🔎 Evidence browser

Browse the skill radar

Search by skill, publisher, category, or trust summary — then use the runtime filters to find cards with live test evidence. The two main lanes are baseline safety checks first and deeper follow-on functionality checks after that.

Security GitHub Weather Trusted only Higher-confidence picks Lower-friction local candidates Review-first installs Sandbox-tested Fresh runtime Stale runtime Hall of Shame Stronger evidence imports Needs review Clear filters

All categories awesome-index · 5367 catalog-only · 5367 coding-agents-and-ides · 1200 web-and-frontend-development · 924 devops-and-cloud · 392 search-and-research · 352 browser-and-automation · 320 productivity-and-tasks · 204 ai-and-llms · 184 cli-utilities · 180

✨ Quick picks

Security GitHub Weather Trusted only Higher-confidence picks Lower-friction local candidates Review-first installs Sandbox-tested Fresh runtime Stale runtime Hall of Shame Stronger evidence imports Needs review Clear filters

🏷 Categories

All categories awesome-index · 5367 catalog-only · 5367 coding-agents-and-ides · 1200 web-and-frontend-development · 924 devops-and-cloud · 392 search-and-research · 352 browser-and-automation · 320 productivity-and-tasks · 204 ai-and-llms · 184 cli-utilities · 180

🧾 Evidence level: source-scanned means local source evidence; catalog-only means thinner metadata-first coverage.

🧪 Runtime status: cards can show only the baseline safety lane or the deeper follow-on functionality lane, depending on how far the skill got.

📏 Depth cue: tells you whether the evidence stops at baseline checks, includes follow-on functionality checks, or includes richer fixture/example proof.

⏱ Freshness cue: tells you whether the latest runtime evidence is from the last 24 hours, the last 7 days, or is older and therefore less current.

🩺 Failure confidence: distinguishes a first seen failure from a repeated failure or a regression after an earlier pass, so not every red row means the same thing.

Results

Showing 21 of 45 skills in the browsable catalog view · runtime: failed · freshness: fresh · sort: score

page evidence snapshotruntime-passed: 0 runtime-failed: 21 source-scanned: 21 fresh <24h: 14 manual review: 0

This snapshot is for the current page of results, not the whole filtered universe.

Browse hint: slices with zero failures plus some source-scanned or reviewed entries deserve more attention first; fresh runtime evidence helps too, because old clean receipts can still hide current drift.

nova-act-usability

zouchaoqun · vsource-scanned

AI-orchestrated usability testing using Amazon Nova Act. The agent generates personas, runs tests to collect raw data, interprets responses to determine goal achievement, and generates HTML reports. Tests real user workflows (booking, checkout, posting) with safety guardrails. Use when asked to "test website usability", "run usability test", "generate usability report", "evaluate user experience", "test checkout flow", "test booking process", or "analyze website UX".

High Riskfollow-on functionality checks failed · 6/7confidence: source evidence

Runtime receipts + what failed2026-03-16 13:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, runtime_failedoutput 544 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2344 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: nova-act-usability made it to runtime and then fell apart on contact.6/7 functionality-v2 checks passed before the stumble. The python syntax is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: sudo , password.

Decision cue: Review first — functionality-v2 already found trouble.

sergei-mikhailov-stt

bzsega · vsource-scanned

Speech recognition from voice messages using Yandex SpeechKit (with an extensible architecture for other providers). Use when you need to convert a voice message to text.

High Riskfollow-on functionality checks failed · 7/10confidence: source evidence

Runtime receipts + what failed2026-03-16 12:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, expectation_failed, runtime_failed, crashed_with_fake_credentialsoutput 117 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 4096 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: The runtime lane gave sergei-mikhailov-stt a chance to act normal. It declined and talked a big game, then missed its own proof signal.7/10 functionality-v2 checks passed before the stumble. The requirements txt shape is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, sudo .

Decision cue: Review first — functionality-v2 already found trouble.

sys-updater

spiceman161 · vsource-scanned

Production-safe Ubuntu maintenance orchestrator: runs daily apt security updates, tracks non-security updates across apt/npm/pnpm/brew with quarantine + auto-review, applies only approved updates, rotates logs/state, and generates clear 09:00 MSK Telegram reports (including what was actually installed).

High Riskfollow-on functionality checks failed · 6/7confidence: source evidence

Runtime receipts + what failed2026-03-15 21:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, runtime_failedoutput 99 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3162 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: sys-updater made it to runtime and then fell apart on contact, which is not ideal for a skill asking to be trusted.6/7 functionality-v2 checks passed before the stumble. The python help is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: sudo , password.

Decision cue: Review first — functionality-v2 already found trouble.

pyzotero

killgfat · vsource-scanned

Python scripts for Zotero - supports search, browse, add items, and full collection management. Both local API and online Web API modes.

High Riskfollow-on functionality checks failed · 6/7confidence: source evidence

Runtime receipts + what failed2026-03-15 07:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, runtime_failedoutput 99 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2533 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon on this skillPyzotero is trying to handle python scripts for Zotero -. Functionality-v2 is currently first observed failure, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, sudo .

Decision cue: Review first — functionality-v2 already found trouble.

agent-commerce-engine

nowloady · vsource-scanned

A production-ready universal engine for Agentic Commerce. This tool enables autonomous agents to interact with any compatible headless e-commerce backend through a standardized protocol. It provides out-of-the-box support for discovery, cart operations, and secure user management.

High Riskfollow-on functionality checks failed · 6/8confidence: source evidence

Runtime receipts + what failed2026-03-16 15:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, runtime_failed, crashed_with_fake_credentialsoutput 99 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3192 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: agent-commerce-engine made it to runtime and then fell apart on contact.6/8 functionality-v2 checks passed before the stumble. The python help is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

moltpho

unifiedh · vsource-scanned

Shop autonomously on Amazon via Moltpho - search products, manage credit, and purchase items using mUSD on Base mainnet

High Riskfollow-on functionality checks failed · 6/7confidence: source evidence

Runtime receipts + what failed2026-03-16 14:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, expectation_failedoutput 99 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2474 msbaseline-v3 8/8

🕵️ expected proof signal was missing

RatioDaemon muttered: moltpho talked a big game, then missed its own proof signal.6/7 functionality-v2 checks passed before the stumble. The requirements txt shape is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

jellyfin-control

titunito · vsource-scanned

Control Jellyfin media server and TV. Search content, resume playback, manage sessions, control TV power and apps. Supports Home Assistant and direct WebOS backends. One command to turn on TV, launch Jellyfin, and play content.

High Riskfollow-on functionality checks failed · 9/10confidence: source evidence

Runtime receipts + what failed2026-03-16 15:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, crashed_with_fake_credentialsoutput 175 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3452 msbaseline-v3 8/8

🕵️ expected proof signal was missing

RatioDaemon muttered: jellyfin-control left receipts, just not the ones it was supposed to, which is not ideal for a skill asking to be trusted.9/10 functionality-v2 checks passed before the stumble. The node entrypoint bogus env is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: sudo , password.

Decision cue: Review first — functionality-v2 already found trouble.

clawdhub-contributor

starbuck100 · vsource-scanned

Contribute to the ClawdHub ecosystem by scouting unknown skills, reporting bugs, and sharing skill recipes. Three modes (passive/active/full) let you control how much you contribute.

High Riskfollow-on functionality checks failed · 7/9confidence: source evidence

Runtime receipts + what failed2026-03-16 00:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, expectation_failed, runtime_failedoutput 117 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3373 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: clawdhub-contributor talked a big game, then missed its own proof signal.7/9 functionality-v2 checks passed before the stumble. The requirements txt shape is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, sudo , password.

Decision cue: Review first — functionality-v2 already found trouble.

grazer

scottcjn · vsource-scanned

Multi-Platform Content Discovery for AI Agents

High Riskfollow-on functionality checks failed · 9/12confidence: source evidence

Runtime receipts + what failed2026-03-15 23:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, ambiguous_entrypoint, expectation_failed, runtime_failedoutput 2.2 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 4217 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: The runtime lane gave grazer a chance to act normal. It declined and left receipts, just not the ones it was supposed to.9/12 functionality-v2 checks passed before the stumble. The package json entrypoints is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, sudo , password.

Decision cue: Review first — functionality-v2 already found trouble.

guava-guard

koatora20 · vsource-scanned

Runtime security guard + scanner for OpenClaw agents. Part of the guard-scanner ecosystem. Detects reverse shells, credential theft, and sandbox escapes in real-time. For full static scanning with 150+ patterns, install guard-scanner.

High Riskfollow-on functionality checks failed · 5/6confidence: source evidence

Runtime receipts + what failed2026-03-15 09:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, runtime_failedoutput 314 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 1922 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon on this skillGuava Guard is built for runtime security guard + scanner for OpenClaw agents. Functionality-v2 is currently first observed failure, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

skill-vettr

britrik · vsource-scanned

Static analysis security scanner for third-party OpenClaw skills.

High Riskbaseline safety checks failed · 7/8confidence: source evidence

Runtime receipts + what failed2026-03-16 16:15 UTC

baseline-v3evidence depth: baseline checks onlytested recently: within 24 hoursfirst failed run seen for this laneexpectation_failed, passed, handled_fake_credentials_cleanlyoutput 452 Bartifacts 2worker oc-sandboxsource stage: fresh copysuite 2442 ms

🕵️ expected proof signal was missing

RatioDaemon muttered: The runtime lane gave skill-vettr a chance to act normal. It declined and talked a big game, then missed its own proof signal.7/8 baseline-v3 checks passed before the stumble. The source-mount check is the part that made this interesting.

Observed: 11 /workspace/source-files.txt

Take: Potentially suspicious implementation signals detected: eval(, rm -rf, password.

Decision cue: Review first — baseline-v3 already found trouble.

tide-watch

chrisagiddings · vsource-scanned

Proactive session capacity monitoring and management for OpenClaw. Prevents context window lockups by warning at configurable thresholds (75%, 85%, 90%, 95%), automatically backing up sessions before resets, and managing session resumption prompts. Use when working on long-running projects, managing multiple conversation channels (Discord, Telegram, webchat), or preventing lost work from full context windows. Includes CLI tools for capacity checks, cross-session dashboards, archive management, and session resumption. Supports any model or provider.

High Riskbaseline safety checks failed · 7/8confidence: source evidence

Runtime receipts + what failed2026-03-16 00:45 UTC

baseline-v3evidence depth: baseline checks onlytested recently: within 24 hoursfirst failed run seen for this laneexpectation_failed, passedoutput 591 Bartifacts 2worker oc-sandboxsource stage: fresh copysuite 2351 ms

🕵️ expected proof signal was missing

RatioDaemon muttered: tide-watch talked a big game, then missed its own proof signal, which is not ideal for a skill asking to be trusted.7/8 baseline-v3 checks passed before the stumble. The source-mount check is the part that made this interesting.

Observed: 12 /workspace/source-files.txt

Take: Potentially suspicious implementation signals detected: rm -rf, sudo , password.

Decision cue: Review first — baseline-v3 already found trouble.

veille

romain-grosos · vsource-scanned

RSS feed aggregator, deduplication engine, LLM scoring, and output dispatcher for OpenClaw agents. Use when: fetching recent articles from configured sources, filtering already-seen URLs, deduplicating by topic, scoring with LLM, dispatching digests to Telegram/email/Nextcloud/file. Enhanced by mail-client (email output) and nextcloud-files (cloud storage).

High Riskfollow-on functionality checks failed · 0/1confidence: source evidence

Runtime receipts + what failed2026-03-15 21:31 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfailure repeated in more than one runregression after earlier passblocked_on_external_serviceoutput 375 Bartifacts 1worker oc-sandboxsource stage: cache hitsuite 5621 msbaseline-v3 8/8

RatioDaemon muttered: veille left receipts, just not the ones it was supposed to.0/1 functionality-v2 checks passed before the stumble. The forced external check is the part that made this interesting.

Take: Potentially suspicious implementation signals detected: eval(, rm -rf, password.

Decision cue: Review first — functionality-v2 already found trouble.

qa-patrol

tahseen137 · vsource-scanned

>

High Riskfollow-on functionality checks failed · 8/10confidence: source evidence

Runtime receipts + what failed2026-03-16 17:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, runtime_failedoutput 13.9 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 3211 msbaseline-v3 8/8

🚫 skill exited with an error

RatioDaemon muttered: qa-patrol made it to runtime and then fell apart on contact, which is not ideal for a skill asking to be trusted.8/10 functionality-v2 checks passed before the stumble. The yaml parse is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

glitch-dashboard

chris6970barbarian-hue · vsource-scanned

Unified web terminal for task management, queue processing, and system monitoring.

High Riskfollow-on functionality checks failed · 8/9confidence: source evidence

Runtime receipts + what failed2026-03-15 09:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, runtime_failedoutput 2.5 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 2875 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: The runtime lane gave glitch-dashboard a chance to act normal. It declined and made it to runtime and then fell apart on contact.8/9 functionality-v2 checks passed before the stumble. The package json entrypoints is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, sudo .

Decision cue: Review first — functionality-v2 already found trouble.

trifle-auth

okwme · vsource-scanned

Authenticate with the Trifle API using Sign-In with Ethereum (SIWE). Manages wallet-based authentication, JWT token storage, and session management for the Trifle ecosystem.

High Riskfollow-on functionality checks failed · 8/9confidence: source evidence

Runtime receipts + what failed2026-03-16 17:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, ambiguous_entrypointoutput 2.1 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 2840 msbaseline-v3 8/8

🕵️ expected proof signal was missing

RatioDaemon muttered: trifle-auth left receipts, just not the ones it was supposed to.8/9 functionality-v2 checks passed before the stumble. The package json entrypoints is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

deai-image

swaylq · vsource-scanned

Detect and remove AI fingerprints from AI-generated images. Strip metadata, add film grain, recompress, and bypass AI image detectors. Works with Midjourney, DALL-E, Stable Diffusion, Flux output.

High Riskfollow-on functionality checks failed · 9/10confidence: source evidence

Runtime receipts + what failed2026-03-15 15:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, runtime_failedoutput 171 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3499 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: deai-image made it to runtime and then fell apart on contact.9/10 functionality-v2 checks passed before the stumble. The python help is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, sudo .

Decision cue: Review first — functionality-v2 already found trouble.

expanso-email-triage

aronchick · vsource-scanned

AI-powered email triage with calendar sync and response drafting

High Riskfollow-on functionality checks failed · 11/12confidence: source evidence

Runtime receipts + what failed2026-03-16 01:50 UTC

functionality-v2evidence depth: includes fixture-backed checkstested recently: within 24 hoursfailure repeated in more than one runpassed, runtime_failedoutput 9.2 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 3651 msbaseline-v3 8/8

🚫 skill exited with an error

RatioDaemon muttered: The runtime lane gave expanso-email-triage a chance to act normal. It declined and made it to runtime and then fell apart on contact.11/12 functionality-v2 checks passed before the stumble. The yaml parse is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

coworker

sarthib7 · vsource-scanned

Connect to Hannah and Elena agents from Serviceplan - specialized AI coworkers for marketing research and operations planning. Access via email or OpenAI-compatible API.

Use Cautionfollow-on functionality checks failed · 6/7confidence: source evidence

Runtime receipts + what failed2026-03-15 07:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, runtime_failedoutput 2.1 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 2365 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon on this skillCoworker is trying to handle coworker. Functionality-v2 is currently first observed failure, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf.

Decision cue: Review first — functionality-v2 already found trouble.

google-keep

tag-assistant · vsource-scanned

Read, create, edit, search, and manage Google Keep notes and lists via CLI.

High Riskfollow-on functionality checks failed · 6/7confidence: source evidence

Runtime receipts + what failed2026-03-15 06:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, runtime_failedoutput 99 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2480 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: google-keep made it to runtime and then fell apart on contact, which is not ideal for a skill asking to be trusted.6/7 functionality-v2 checks passed before the stumble. The python help is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

safe-backup

hacksing · vsource-scanned

Backup OpenClaw state directory and workspace with security best practices.

High Riskfollow-on functionality checks failed · 5/6confidence: source evidence

Runtime receipts + what failed2026-03-15 16:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, runtime_failedoutput 227 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 1976 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: The runtime lane gave safe-backup a chance to act normal. It declined and made it to runtime and then fell apart on contact.5/6 functionality-v2 checks passed before the stumble. The shell syntax is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, password.

Decision cue: Review first — functionality-v2 already found trouble.

« First ← Prev 1 2Page 2 / 2