Category

evaluation skills

Browse 1 skills in the evaluation category, with labels, findings, and runtime evidence where available.

1 matching skill
Page 1 of 1
category: evaluation
clear filters

Evaluation Methods for Agent Systems

GitHub:sickn33/antigravity-awesome-skills · evaluation
Trusted

This skill provides documentation about evaluating agent systems, which are complex and non-deterministic. It focuses on how to provide actionable feedback and enable continuous improvement.

Source: Workspace import

Originally ingested from a local workspace copy.

version 34e4ed70d75f
2 findings
static analysis only
requires secrets
no human review yet
Safety
94
Quality
94
Transparency
100
Operational
92

Automated result: Trusted

Current public label: Trusted

The skill is mostly documentation, and it mentions secrets, so it's labeled as trusted.

Human review: none yet

The current public label is still relying on automation. A human has not weighed in yet.

Severity mix: 2 low