Category

advanced-evaluation skills

Browse 1 skills in the advanced-evaluation category, with labels, findings, and runtime evidence where available.

1 matching skill
Page 1 of 1
category: advanced-evaluation
clear filters

Advanced Evaluation

GitHub:sickn33/antigravity-awesome-skills · advanced-evaluation
Trusted

This skill provides production-grade techniques for evaluating LLM outputs. It uses LLMs as judges and synthesizes research into actionable patterns for building reliable evaluation systems.

Source: Workspace import

Originally ingested from a local workspace copy.

version fc836bcc2c4b
2 findings
static analysis only
requires secrets
no human review yet
Safety
94
Quality
94
Transparency
100
Operational
92

Automated result: Trusted

Current public label: Trusted

The skill's documentation mentions secrets, which is a low-severity concern. Since there are no other issues, the skill is labeled as trusted.

Human review: none yet

The current public label is still relying on automation. A human has not weighed in yet.

Severity mix: 2 low