🔎 Evidence browser

Browse the trust index

Search by skill, publisher, category, or trust summary — then use the runtime filters to find cards with live test evidence. The two main lanes are baseline safety checks first and deeper follow-on functionality checks after that.

Security GitHub Weather Trusted with current evidence Higher-confidence picks Trusted + tested + source-scanned Review before installing Runtime-tested Handled fake credentials cleanly Needs real credentials / access Could not fully test yet Fresh runtime evidence Older runtime evidence Hall of Shame Stronger evidence imports Needs review Clear filters

All categories awesome-index · 5367 catalog-only · 5367 coding-agents-and-ides · 1200 web-and-frontend-development · 924 devops-and-cloud · 392 search-and-research · 352 browser-and-automation · 320 productivity-and-tasks · 204 ai-and-llms · 184 cli-utilities · 180

✨ Quick picks

Security GitHub Weather Trusted with current evidence Higher-confidence picks Trusted + tested + source-scanned Review before installing Runtime-tested Handled fake credentials cleanly Needs real credentials / access Could not fully test yet Fresh runtime evidence Older runtime evidence Hall of Shame Stronger evidence imports Needs review Clear filters

🏷 Categories · search-and-research

All categories awesome-index · 5367 catalog-only · 5367 coding-agents-and-ides · 1200 web-and-frontend-development · 924 devops-and-cloud · 392 search-and-research · 352 browser-and-automation · 320 productivity-and-tasks · 204 ai-and-llms · 184 cli-utilities · 180

🧾 Evidence level: source-scanned means local source evidence; catalog-only means thinner metadata-first coverage.

🧪 Runtime status: cards can show only the baseline safety lane or the deeper follow-on functionality lane, depending on how far the skill got. Some cards now also surface how the skill behaved when clearly fake credentials were present.

📏 Depth cue: tells you whether the evidence stops at baseline checks, includes follow-on functionality checks, or includes richer fixture/example proof.

⏱ Freshness cue: tells you whether the latest runtime evidence is from the last 24 hours, the last 7 days, or is older and therefore less current.

🩺 Failure confidence: distinguishes a first seen failure from a repeated failure or a regression after an earlier pass, so not every red row means the same thing.

🧪 Fake-auth behavior: when available, this tells you whether a skill handled clearly fake credentials cleanly, needed real access to continue, or behaved badly around credential-like input.

Results

Showing 10 of 10 skills in the browsable catalog view · runtime: passed · auth behavior: handled-fake-creds · category: search-and-research · sort: score

page evidence snapshotruntime-passed: 9 runtime-failed: 1 source-scanned: 10 fresh <24h: 8 manual review: 0

This snapshot is for the current page of results, not the whole filtered universe.

Browse hint: slices with zero failures plus some source-scanned or reviewed entries deserve more attention first; fresh runtime evidence helps too, because old clean receipts can still hide current drift.

web-search-pro

zjianru · vsource-scanned

|

Trustedfollow-on functionality checks passed · 7/7confidence: source evidence

Runtime receipts + what passed2026-03-16 01:58 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfake-auth behavior: handled cleanlypassed, handled fake credentials cleanlyoutput 128 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2329 msbaseline-v3 8/8

fake-auth behavior: handled cleanlyClearly fake credentials were exercised and handled normally.

RatioDaemon on this skillWeb Search Pro is trying to handle web search. Follow-on functionality checks currently pass without failed checks and setup looks advanced.

Observed: skill-structure-ok

Take: Source-aware scan found normal operational surface via environment, network, or shell-related references.

Decision cue: Decent evidence base — source-level signals are available, so inspect the receipts.

supermarket

niemesrw · vsource-scanned

Search grocery products, find store locations, add items to cart, and view profile across all Kroger-family stores — Kroger, Ralphs, Fred Meyer, Harris Teeter, King Soopers, Fry's, QFC, Mariano's, Pick 'n Save, Metro Market, and more. Use when user asks about groceries, food shopping, store locations, or wants to manage their grocery cart.

High Riskfollow-on functionality checks passed · 7/7confidence: source evidence

Runtime receipts + what passed2026-03-16 11:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 129 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2251 msbaseline-v3 8/8

RatioDaemon on this skillSupermarket is built for supermarket. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

asia-twitter-api-v1

renning22 · vsource-scanned

Search X (Twitter) in real time, monitor trends, extract posts, and analyze social media data—perfect for social listening and intelligence gathering. Safe read-only operations by default.

High Riskfollow-on functionality checks passed · 9/9confidence: source evidence

Runtime receipts + what passed2026-03-16 14:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfake-auth behavior: handled cleanlypassed, handled fake credentials cleanlyoutput 161 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3470 msbaseline-v3 8/8

fake-auth behavior: handled cleanlyClearly fake credentials were exercised and handled normally.

RatioDaemon muttered: asia-twitter-api-v1 looked ordinary in the good, boring way.9/9 functionality-v2 checks passed. Pleasantly boring.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

cirf

kudodefi · vsource-scanned

Interactive crypto deep-research framework with human-AI collaboration for superior research outcomes

High Riskfollow-on functionality checks passed · 11/11confidence: source evidence

Runtime receipts + what passed2026-03-16 18:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 29.2 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 3440 msbaseline-v3 8/8

RatioDaemon on this skillCirf looks aimed at cirf. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

crif

kudodefi · vsource-scanned

Interactive crypto deep-research framework with human-AI collaboration for superior research outcomes

High Riskfollow-on functionality checks passed · 11/11confidence: source evidence

Runtime receipts + what passed2026-03-16 15:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 29.2 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 3507 msbaseline-v3 8/8

RatioDaemon on this skillCrif is built for crif. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

trifle-auth

okwme · vsource-scanned

Authenticate with the Trifle API using Sign-In with Ethereum (SIWE). Manages wallet-based authentication, JWT token storage, and session management for the Trifle ecosystem.

High Riskfollow-on functionality checks could not be fully tested · 8/9confidence: source evidence

Runtime receipts + what blocked setup2026-03-16 17:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, did not make it clear what the test should runoutput 2.1 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 2840 msbaseline-v3 8/8

🕵️ expected proof signal was missing🧭 did not make it clear what the test should run

RatioDaemon muttered: trifle-auth never made it clear what the test was even supposed to run.8/9 functionality-v2 checks passed before the stumble. The package json entrypoints is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

glittercowboy

oleg-schmidt · vsource-scanned

Get Shit Done - Full project planning and execution workflow. Handles project initialization with deep context gathering, automated research, roadmap creation, phase planning, and execution with verification.

High Riskfollow-on functionality checks passed · 5/5confidence: source evidence

Runtime receipts + what passed2026-03-16 20:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 80 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 1724 msbaseline-v3 8/8

RatioDaemon muttered: glittercowboy cleared the baseline safety checks without trying anything cute.5/5 functionality-v2 checks passed. Pleasantly boring.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

gsd

oleg-schmidt · vsource-scanned

Get Shit Done - Full project planning and execution workflow. Handles project initialization with deep context gathering, automated research, roadmap creation, phase planning, and execution with verification.

High Riskfollow-on functionality checks passed · 5/5confidence: source evidence

Runtime receipts + what passed2026-03-16 18:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 80 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 1695 msbaseline-v3 8/8

RatioDaemon muttered: gsd cleared the baseline safety checks without trying anything cute.5/5 functionality-v2 checks passed. Pleasantly boring.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

skill-store

yx2601816404-sys · vsource-scanned

Smart skill installation advisor for ClawHub. Searches for skills matching your needs, evaluates candidates on security (via skill-shield), code quality, and documentation, then produces a comparison report with a recommendation. Use when: looking for a skill to do something specific, comparing similar skills, or wanting a safety-checked recommendation before installing. Zero external dependencies.

High Riskfollow-on functionality checks passed · 7/7confidence: source evidence

Runtime receipts + what passed2026-03-16 02:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 dayspassedoutput 116 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2532 msbaseline-v3 8/8

RatioDaemon muttered: skill-store cleared the baseline safety checks without trying anything cute.7/7 functionality-v2 checks passed. Pleasantly boring.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

bagman

zscole · vsource-scanned

Secure key management for AI agents. Use when handling private keys, API secrets, wallet credentials, or when building systems that need agent-controlled funds. Covers secure storage, session keys, leak prevention, prompt injection defense, and MetaMask Delegation Framework integration.

High Riskbaseline safety checks passed · 8/8confidence: source evidence

Runtime receipts + what passed2026-03-16 20:30 UTC

baseline-v3evidence depth: baseline checks onlytested recently: within 24 hoursfake-auth behavior: handled cleanlypassed, handled fake credentials cleanlyoutput 484 Bartifacts 2worker oc-sandboxsource stage: fresh copysuite 2422 ms

fake-auth behavior: handled cleanlyClearly fake credentials were exercised and handled normally.

RatioDaemon on this skillBagman sits in the secure key management for AI agents lane. Baseline safety checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: 8 /workspace/source-files.txt

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.