🔎 Evidence browser

Browse the trust index

Search by skill, publisher, category, or trust summary — then use the runtime filters to find cards with live test evidence. The two main lanes are baseline safety checks first and deeper follow-on functionality checks after that.

Security GitHub Weather Trusted with current evidence Higher-confidence picks Trusted + tested + source-scanned Review before installing Runtime-tested Handled fake credentials cleanly Needs real credentials / access Could not fully test yet Fresh runtime evidence Older runtime evidence Hall of Shame Stronger evidence imports Needs review Clear filters

All categories awesome-index · 5367 catalog-only · 5367 coding-agents-and-ides · 1200 web-and-frontend-development · 924 devops-and-cloud · 392 search-and-research · 352 browser-and-automation · 320 productivity-and-tasks · 204 ai-and-llms · 184 cli-utilities · 180

✨ Quick picks

Security GitHub Weather Trusted with current evidence Higher-confidence picks Trusted + tested + source-scanned Review before installing Runtime-tested Handled fake credentials cleanly Needs real credentials / access Could not fully test yet Fresh runtime evidence Older runtime evidence Hall of Shame Stronger evidence imports Needs review Clear filters

🏷 Categories

All categories awesome-index · 5367 catalog-only · 5367 coding-agents-and-ides · 1200 web-and-frontend-development · 924 devops-and-cloud · 392 search-and-research · 352 browser-and-automation · 320 productivity-and-tasks · 204 ai-and-llms · 184 cli-utilities · 180

🧾 Evidence level: source-scanned means local source evidence; catalog-only means thinner metadata-first coverage.

🧪 Runtime status: cards can show only the baseline safety lane or the deeper follow-on functionality lane, depending on how far the skill got. Some cards now also surface how the skill behaved when clearly fake credentials were present.

📏 Depth cue: tells you whether the evidence stops at baseline checks, includes follow-on functionality checks, or includes richer fixture/example proof.

⏱ Freshness cue: tells you whether the latest runtime evidence is from the last 24 hours, the last 7 days, or is older and therefore less current.

🩺 Failure confidence: distinguishes a first seen failure from a repeated failure or a regression after an earlier pass, so not every red row means the same thing.

🧪 Fake-auth behavior: when available, this tells you whether a skill handled clearly fake credentials cleanly, needed real access to continue, or behaved badly around credential-like input.

Results

Showing 24 of 111 skills in the browsable catalog view · evidence: source-scanned · runtime: passed · auth behavior: handled-fake-creds · sort: score

page evidence snapshotruntime-passed: 11 runtime-failed: 13 source-scanned: 24 fresh <24h: 22 manual review: 0

This snapshot is for the current page of results, not the whole filtered universe.

Browse hint: slices with zero failures plus some source-scanned or reviewed entries deserve more attention first; fresh runtime evidence helps too, because old clean receipts can still hide current drift.

web-search-pro

zjianru · vsource-scanned

|

Trustedfollow-on functionality checks passed · 7/7confidence: source evidence

Runtime receipts + what passed2026-03-16 01:58 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfake-auth behavior: handled cleanlypassed, handled fake credentials cleanlyoutput 128 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2329 msbaseline-v3 8/8

fake-auth behavior: handled cleanlyClearly fake credentials were exercised and handled normally.

RatioDaemon on this skillWeb Search Pro is trying to handle web search. Follow-on functionality checks currently pass without failed checks and setup looks advanced.

Observed: skill-structure-ok

Take: Source-aware scan found normal operational surface via environment, network, or shell-related references.

Decision cue: Decent evidence base — source-level signals are available, so inspect the receipts.

garmer

garrza · vsource-scanned

Extract health and fitness data from Garmin Connect including activities, sleep, heart rate, stress, steps, and body composition. Use when the user asks about their Garmin data, fitness metrics, sleep analysis, or health insights.

High Riskfollow-on functionality checks failed · 7/8confidence: source evidence

Runtime receipts + what failed2026-03-17 04:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, runtime failedoutput 126 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3153 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: garmer made it to runtime and then fell apart on contact.7/8 functionality-v2 checks passed before the stumble. The python help is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

garmin-health

eversonl · vsource-scanned

Talk to your Garmin data naturally - "what was my fastest speed snowboarding?", "how did I sleep last night?", "what was my heart rate at 3pm?". Access 20+ metrics (sleep stages, Body Battery, HRV, VO2 max, training readiness, body composition, SPO2), download FIT/GPX files for route analysis, query elevation/pace at any point, and generate interactive health dashboards. From casual "show me this week's workouts" to deep "analyze my recovery vs training load".

High Riskfollow-on functionality checks failed · 6/9confidence: source evidence

Runtime receipts + what failed2026-03-17 04:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanefake-auth behavior: concerningpassed, runtime failed, fell over when given fake credentialsoutput 1.7 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 3451 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error💥 behaved badly with fake credentials

fake-auth behavior: concerningFake credentials triggered bad behavior or sloppy handling.

RatioDaemon muttered: garmin-health made it to runtime and then fell apart on contact.6/9 functionality-v2 checks passed before the stumble. The meta json identity is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

hz-proactive-agent

lidekahdjdhdhsjjs-lang · vsource-scanned

Transform AI agents from task-followers into proactive partners that anticipate needs and continuously improve. Now with WAL Protocol, Working Buffer, Autonomous Crons, and battle-tested patterns. Part of the Hal Stack 🦞

High Riskfollow-on functionality checks passed · 6/6confidence: source evidence

Runtime receipts + what passed2026-03-16 10:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 98 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 1987 msbaseline-v3 8/8

RatioDaemon on this skillHz Proactive Agent is built for hz proactive. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

lily-memory

kevinodell · vsource-scanned

Persistent memory plugin for OpenClaw agents. Hybrid SQLite FTS5 keyword + Ollama vector semantic search with auto-capture, auto-recall, stuck-detection, and memory consolidation.

High Riskfollow-on functionality checks passed · 9/9confidence: source evidence

Runtime receipts + what passed2026-03-16 11:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 176 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3201 msbaseline-v3 8/8

RatioDaemon muttered: lily-memory behaved itself under runtime pressure.9/9 functionality-v2 checks passed. Pleasantly boring.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

link-brain

jakes420 · vsource-scanned

Local knowledge base for links. Save URLs with summaries and tags, search later using natural language, build collections, and review your backlog with spaced repetition. Includes a standalone HTML graph view.

High Riskfollow-on functionality checks passed · 9/9confidence: source evidence

Runtime receipts + what passed2026-03-16 08:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 134 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3305 msbaseline-v3 8/8

RatioDaemon on this skillLink Brain looks aimed at local knowledge base for links. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

media-news-digest

dinstein · vsource-scanned

Generate media & entertainment industry news digests. Covers Hollywood trades (THR, Deadline, Variety), box office, streaming, awards season, film festivals, and production news. Four-source data collection from RSS feeds, Twitter/X KOLs, Reddit, and web search. Pipeline-based scripts with retry mechanisms and deduplication. Supports Discord and email output with PDF attachments.

High Riskfollow-on functionality checks failed · 9/10confidence: source evidence

Runtime receipts + what failed2026-03-16 09:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanefake-auth behavior: handled cleanlypassed, expectation failed, handled fake credentials cleanlyoutput 162 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 6822 msbaseline-v3 8/8

🕵️ expected proof signal was missing

fake-auth behavior: handled cleanlyClearly fake credentials were exercised and handled normally.

RatioDaemon muttered: media-news-digest talked a big game, then missed its own proof signal.9/10 functionality-v2 checks passed before the stumble. The requirements txt shape is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf.

Decision cue: Review first — functionality-v2 already found trouble.

nicholasrae-review-reply

nicholasrae · vsource-scanned

Automatically monitors your App Store reviews and drafts warm, on-brand replies for 1–3 star reviews — so unhappy users hear back fast. Connects to App Store Connect API, detects repeat complaint patterns as bug signals, and delivers a daily approval queue to Telegram at 8am. You approve, it sends. Supports multiple apps simultaneously.

High Riskfollow-on functionality checks failed · 6/8confidence: source evidence

Runtime receipts + what failed2026-03-16 10:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanefake-auth behavior: concerningpassed, runtime failed, fell over when given fake credentialsoutput 99 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3669 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error💥 behaved badly with fake credentials

fake-auth behavior: concerningFake credentials triggered bad behavior or sloppy handling.

RatioDaemon muttered: nicholasrae-review-reply made it to runtime and then fell apart on contact, which is not ideal for a skill asking to be trusted.6/8 functionality-v2 checks passed before the stumble. The python help is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

nofx

tinkle-community · vsource-scanned

NOFX AI Trading OS integration - crypto market data, AI trading signals, strategy management, trader control, and automated reporting. Use when working with NOFX platform (nofxai.com, nofxos.ai) for crypto trading, market analysis, AI500/AI300 signals, fund flow tracking, OI monitoring, strategy creation, trader management, backtesting, or AI debate arena.

High Riskfollow-on functionality checks passed · 6/6confidence: source evidence

Runtime receipts + what passed2026-03-16 11:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 98 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 1945 msbaseline-v3 8/8

RatioDaemon on this skillNofx is trying to handle nofx. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: sudo .

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

odoo-connector

nullnaveen · vsource-scanned

repository: https://github.com/NullNaveen/openclaw-odoo-skill

High Riskfollow-on functionality checks failed · 8/9confidence: source evidence

Runtime receipts + what failed2026-03-16 11:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, expectation failedoutput 154 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3049 msbaseline-v3 8/8

🕵️ expected proof signal was missing

RatioDaemon on this skillOdoo Connector looks aimed at odoo connector. Follow-on functionality checks currently show first observed failure, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

odoo-erp-connector

nullnaveen · vsource-scanned

repository: https://github.com/NullNaveen/openclaw-odoo-skill

High Riskfollow-on functionality checks failed · 8/9confidence: source evidence

Runtime receipts + what failed2026-03-16 08:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, expectation failedoutput 154 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3073 msbaseline-v3 8/8

🕵️ expected proof signal was missing

RatioDaemon on this skillOdoo Erp Connector is built for odoo erp connector. Follow-on functionality checks currently show first observed failure, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

pinchbench

olearycrew · vsource-scanned

Run PinchBench benchmarks to evaluate OpenClaw agent performance across real-world tasks. Use when testing model capabilities, comparing models, submitting benchmark results to the leaderboard, or checking how well your OpenClaw setup handles calendar, email, research, coding, and multi-step workflows.

High Riskfollow-on functionality checks failed · 9/12confidence: source evidence

Runtime receipts + what failed2026-03-17 04:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanefake-auth behavior: concerningpassed, fell over when given fake credentials, runtime failedoutput 143 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 5024 msbaseline-v3 8/8

🕵️ expected proof signal was missing💥 behaved badly with fake credentials🚫 skill exited with an error

fake-auth behavior: concerningFake credentials triggered bad behavior or sloppy handling.

RatioDaemon muttered: pinchbench left receipts, just not the ones it was supposed to, which is not ideal for a skill asking to be trusted.9/12 functionality-v2 checks passed before the stumble. The shell entrypoint bogus env is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

reprompter

aytuncyildizli · vsource-scanned

|

High Riskfollow-on functionality checks passed · 6/6confidence: source evidence

Runtime receipts + what passed2026-03-16 09:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 98 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 1959 msbaseline-v3 8/8

RatioDaemon on this skillReprompter is trying to handle reprompter. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

silk

silostack · vsource-scanned

Agent banking and payments on Solana. Send and receive stablecoins with cancellable escrow transfers. Optional on-chain accounts with policy-enforced spending limits for human-delegated automation.

High Riskfollow-on functionality checks could not be fully tested · 9/10confidence: source evidence

Runtime receipts + what blocked setup2026-03-16 09:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, did not make it clear what the test should runoutput 2.2 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 3110 msbaseline-v3 8/8

🕵️ expected proof signal was missing🧭 did not make it clear what the test should run

RatioDaemon muttered: silk never made it clear what the test was even supposed to run.9/10 functionality-v2 checks passed before the stumble. The package json entrypoints is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf.

Decision cue: Review first — functionality-v2 already found trouble.

silkyway

silostack · vsource-scanned

Agent banking and payments on Solana. Send and receive stablecoins with cancellable escrow transfers. Optional on-chain accounts with policy-enforced spending limits for human-delegated automation.

High Riskfollow-on functionality checks could not be fully tested · 9/10confidence: source evidence

Runtime receipts + what blocked setup2026-03-16 12:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, did not make it clear what the test should runoutput 2.2 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 3154 msbaseline-v3 8/8

🕵️ expected proof signal was missing🧭 did not make it clear what the test should run

RatioDaemon muttered: The runtime lane gave silkyway a chance to act normal. It declined and never made it clear what the test was even supposed to run.9/10 functionality-v2 checks passed before the stumble. The package json entrypoints is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf.

Decision cue: Review first — functionality-v2 already found trouble.

supermarket

niemesrw · vsource-scanned

Search grocery products, find store locations, add items to cart, and view profile across all Kroger-family stores — Kroger, Ralphs, Fred Meyer, Harris Teeter, King Soopers, Fry's, QFC, Mariano's, Pick 'n Save, Metro Market, and more. Use when user asks about groceries, food shopping, store locations, or wants to manage their grocery cart.

High Riskfollow-on functionality checks passed · 7/7confidence: source evidence

Runtime receipts + what passed2026-03-16 11:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 129 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2251 msbaseline-v3 8/8

RatioDaemon on this skillSupermarket is built for supermarket. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

tech-news-digest

dinstein · vsource-scanned

Generate tech news digests with unified source model, quality scoring, and multi-format output. Six-source data collection from RSS feeds, Twitter/X KOLs, GitHub releases, GitHub Trending, Reddit, and web search. Pipeline-based scripts with retry mechanisms and deduplication. Supports Discord, email, and markdown templates.

High Riskfollow-on functionality checks failed · 9/10confidence: source evidence

Runtime receipts + what failed2026-03-16 12:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanefake-auth behavior: handled cleanlypassed, expectation failed, handled fake credentials cleanlyoutput 163 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 7249 msbaseline-v3 8/8

🕵️ expected proof signal was missing

fake-auth behavior: handled cleanlyClearly fake credentials were exercised and handled normally.

RatioDaemon muttered: tech-news-digest talked a big game, then missed its own proof signal.9/10 functionality-v2 checks passed before the stumble. The requirements txt shape is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf.

Decision cue: Review first — functionality-v2 already found trouble.

vibetrading-code-gen

liuhaonan00 · vsource-scanned

Generate executable Hyperliquid trading strategy code from natural language prompts. Use when a user wants to create automated trading strategies for Hyperliquid exchange based on their trading ideas, technical indicators, or VibeTrading signals. The skill generates complete Python code with proper error handling, logging, and configuration using actual Hyperliquid API wrappers.

High Riskfollow-on functionality checks failed · 5/8confidence: source evidence

Runtime receipts + what failed2026-03-16 10:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanefake-auth behavior: concerningpassed, runtime failed, fell over when given fake credentialsoutput 450 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3382 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error💥 behaved badly with fake credentials

fake-auth behavior: concerningFake credentials triggered bad behavior or sloppy handling.

RatioDaemon muttered: vibetrading-code-gen made it to runtime and then fell apart on contact.5/8 functionality-v2 checks passed before the stumble. The python syntax is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf.

Decision cue: Review first — functionality-v2 already found trouble.

agentguard

manas-io-ai · vsource-scanned

**Version:** 1.0.0

High Riskfollow-on functionality checks passed · 8/8confidence: source evidence

Runtime receipts + what passed2026-03-16 14:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 1.3 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 3814 msbaseline-v3 8/8

RatioDaemon on this skillAgentguard sits in the agentguard lane. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

agentwallet-sdk

up2itnow · vsource-scanned

Non-custodial wallet SDK for autonomous AI agents. Handles x402 payments, CCTP V2 cross-chain bridge transfers, ERC-8004 agent identity, and Uniswap V3 token swaps — all without holding user keys.

High Riskfollow-on functionality checks passed · 7/7confidence: source evidence

Runtime receipts + what passed2026-03-16 13:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 126 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2203 msbaseline-v3 8/8

RatioDaemon muttered: agentwallet-sdk behaved itself under runtime pressure.7/7 functionality-v2 checks passed. Pleasantly boring.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

asia-twitter-api-v1

renning22 · vsource-scanned

Search X (Twitter) in real time, monitor trends, extract posts, and analyze social media data—perfect for social listening and intelligence gathering. Safe read-only operations by default.

High Riskfollow-on functionality checks passed · 9/9confidence: source evidence

Runtime receipts + what passed2026-03-16 14:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfake-auth behavior: handled cleanlypassed, handled fake credentials cleanlyoutput 161 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3470 msbaseline-v3 8/8

fake-auth behavior: handled cleanlyClearly fake credentials were exercised and handled normally.

RatioDaemon muttered: asia-twitter-api-v1 looked ordinary in the good, boring way.9/9 functionality-v2 checks passed. Pleasantly boring.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, password.

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

clawchat-p2p

alexrudloff · vsource-scanned

**Encrypted P2P messaging for connecting OpenClaw agents across different machines and networks.**

High Riskfollow-on functionality checks could not be fully tested · 8/9confidence: source evidence

Runtime receipts + what blocked setup2026-03-16 13:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, did not make it clear what the test should runoutput 2.1 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 2892 msbaseline-v3 8/8

🕵️ expected proof signal was missing🧭 did not make it clear what the test should run

RatioDaemon muttered: The runtime lane gave clawchat-p2p a chance to act normal. It declined and never made it clear what the test was even supposed to run.8/9 functionality-v2 checks passed before the stumble. The package json entrypoints is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: sudo , password.

Decision cue: Review first — functionality-v2 already found trouble.

crypto-scam-detector

princedoss77 · vsource-scanned

Real-time cryptocurrency scam detection with database-first architecture. Protects users from phishing, honeypots, rug pulls, and ponzi schemes. No external API calls during checks!

High Riskfollow-on functionality checks failed · 9/12confidence: source evidence

Runtime receipts + what failed2026-03-16 01:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanefake-auth behavior: concerningpassed, expectation failed, runtime failed, fell over when given fake credentialsoutput 171 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 5112 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error💥 behaved badly with fake credentials

fake-auth behavior: concerningFake credentials triggered bad behavior or sloppy handling.

RatioDaemon muttered: The runtime lane gave crypto-scam-detector a chance to act normal. It declined and talked a big game, then missed its own proof signal.9/12 functionality-v2 checks passed before the stumble. The requirements txt shape is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: sudo , password.

Decision cue: Review first — functionality-v2 already found trouble.

dynamic-ui

theashbhat · vsource-scanned

Render tables, charts, stats, cards, and dashboards as images using HTML templates and wkhtmltoimage.

High Riskfollow-on functionality checks passed · 6/6confidence: source evidence

Runtime receipts + what passed2026-03-16 13:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hourspassedoutput 98 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 1993 msbaseline-v3 8/8

RatioDaemon on this skillDynamic Ui is trying to handle dynamic ui. Follow-on functionality checks currently pass without failed checks, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, sudo .

Decision cue: Proceed carefully — suspicious signals matter more than capability surface alone.

1 2 5Page 1 / 5Next →Last »