evaluation AI skills

Evaluation Methods for Agent Systems

GitHub:sickn33/antigravity-awesome-skills · evaluation

Trusted

This skill provides documentation about evaluating agent systems, which are complex and non-deterministic. It focuses on how to provide actionable feedback and enable continuous improvement.

Source: Workspace import

Originally ingested from a local workspace copy.

version 34e4ed70d75f

2 findings

static analysis only

requires secrets

no human review yet

Safety

Quality

Transparency

100

Operational

Automated result: Trusted

Current public label: Trusted

The skill is mostly documentation, and it mentions secrets, so it's labeled as trusted.

Human review: none yet

The current public label is still relying on automation. A human has not weighed in yet.

Severity mix: 2 low

evaluation skills

Evaluation Methods for Agent Systems