🔎 Evidence browser

Browse the trust index

Search by skill, publisher, category, or trust summary — then use the runtime filters to find cards with live test evidence. The two main lanes are baseline safety checks first and deeper follow-on functionality checks after that.

⚙️ Filters · 2 active
✨ Quick picks
🏷 Categories · coding-agents-and-ides

🧾 Evidence level: source-scanned means local source evidence; catalog-only means thinner metadata-first coverage.

🧪 Runtime status: cards can show only the baseline safety lane or the deeper follow-on functionality lane, depending on how far the skill got. Some cards now also surface how the skill behaved when clearly fake credentials were present.

📏 Depth cue: tells you whether the evidence stops at baseline checks, includes follow-on functionality checks, or includes richer fixture/example proof.

⏱ Freshness cue: tells you whether the latest runtime evidence is from the last 24 hours, the last 7 days, or is older and therefore less current.

🩺 Failure confidence: distinguishes a first seen failure from a repeated failure or a regression after an earlier pass, so not every red row means the same thing.

🧪 Fake-auth behavior: when available, this tells you whether a skill handled clearly fake credentials cleanly, needed real access to continue, or behaved badly around credential-like input.

Results

Showing 24 of 1200 skills in the browsable catalog view · reviewed: no · category: coding-agents-and-ides · sort: score
This snapshot is for the current page of results, not the whole filtered universe.
Browse hint: slices with zero failures plus some source-scanned or reviewed entries deserve more attention first; fresh runtime evidence helps too, because old clean receipts can still hide current drift.
Quick guide for newcomers: start by scanning the card badges for runtime passed, source-scanned, and fresh evidence. Then use the decision cue on each card to sort “good first pick” from “needs review” without opening every result.

ai-presentation-maker

jeffjhunter · vsource-scanned
57
overall

AI Presentation Maker — the interview-driven pitch deck generator for your OpenClaw agent. Tell it what you built, who you're presenting to, and pick an angle — it generates a complete slide deck with speaker notes, factual validation, and real cost breakdowns. No made-up ROI. No speculative projections. Just compelling presentations built from actual work. Exports to Markdown, PPTX, and PDF. Works standalone or alongside AI Persona OS. Built by Jeff J Hunter.

Insufficient Evidenceconfidence: source evidencesource-scanned
+ 1 more
privileged capability
Take: Source-aware scan found higher-privilege capability areas (token, email), but that alone is not evidence of malicious behavior.
Newcomer read: Decent evidence base — source-level signals are available, so inspect the receipts.

aiqbee

louisgoodier · vsource-scanned
57
overall

Connect to your Aiqbee knowledge graph via MCP. Search, create, and link neurons across your architecture, portfolio, and digital strategy brains.

Insufficient Evidenceconfidence: source evidencesource-scanned
+ 1 more
privileged capability
Take: Source-aware scan found higher-privilege capability areas (oauth), but that alone is not evidence of malicious behavior.
Newcomer read: Decent evidence base — source-level signals are available, so inspect the receipts.

appdev

ojaskarmarkar · vsource-scanned
57
overall

Triggers whenever the user asks to build a feature, fix a bug, create a screen, or modify the mobile app.

Insufficient Evidenceconfidence: source evidencesource-scanned
+ 1 more
privileged capability
Take: Source-aware scan found normal operational surface via environment, network, or shell-related references.
Newcomer read: Decent evidence base — source-level signals are available, so inspect the receipts.

architecture-research

brennerspear · vcatalog
57
overall

Research the architecture of a codebase or system.

Insufficient Evidenceconfidence: limited evidencecatalog-only
+ 1 more
privileged capability
Take: Indexed from the community catalog. Source-aware static analysis and manual review are still pending.
Newcomer read: Needs more proof — do not treat this card like a verified green light.

attio-apikey

felicitationes · vcatalog
57
overall

Direct Attio CRM integration for OpenClaw with full CRUD capabilities.

Insufficient Evidenceconfidence: limited evidencecatalog-only
+ 1 more
privileged capability
Take: Indexed from the community catalog. Source-aware static analysis and manual review are still pending.
Newcomer read: Needs more proof — do not treat this card like a verified green light.

benlee-skillguard

benlee2144 · vsource-scanned
57
overall

Security scanner that audits OpenClaw skills for malicious code, prompt injection, supply chain attacks, data exfiltration, and more

High Riskfollow-on functionality checks failed · 6/7confidence: source evidence
+ 2 more
source-scannedsuspicious
What the test found2026-03-15 20:30 UTC
functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, expectation failedoutput 99 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 2450 msbaseline-v3 8/8
🕵️ expected proof signal was missing
RatioDaemon on this skillBenlee Skillguard is trying to handle benlee skillguard. Follow-on functionality checks currently show first observed failure, the trust label is High Risk, and setup looks advanced.
Observed: skill-structure-ok
Take: Potentially suspicious implementation signals detected: eval(, password.
Newcomer read: Review first — functionality-v2 already found trouble.

blackswan

bilalmotiwala · vsource-scanned
57
overall

Real-time crypto risk intelligence; before and as things break. Two tools: Flare (15-min precursor detection, immediate alarms) and Core (60-min state synthesis, context assessment). Free access to the last analysis. No API key required. Upgrade to x402 for custom analysis.

Insufficient Evidenceconfidence: source evidencesource-scanned
+ 1 more
privileged capability
Take: Source-aware scan found normal operational surface via environment, network, or shell-related references.
Newcomer read: Decent evidence base — source-level signals are available, so inspect the receipts.

bloom-identity-skill

unicornbloom · vsource-scanned
57
overall

Generate Bloom Identity Card from conversation history and Twitter/X data. Analyzes supporter personality through conversations (85% weight) and optionally enriched with Twitter activity (15% weight). Creates personality type (Visionary/Explorer/Cultivator/Optimizer/Innovator), recommends matching OpenClaw skills, and generates agent wallet. Use when user asks to "generate my bloom identity", "create identity card", "analyze my profile", or "discover my personality".

High Riskbaseline safety checks failed · 7/8confidence: source evidence
+ 2 more
source-scannedsuspicious
What the test found2026-03-15 22:45 UTC
baseline-v3evidence depth: baseline checks onlytested recently: within 7 daysfirst failed run seen for this laneexpectation failed, passedoutput 654 Bartifacts 2worker oc-sandboxsource stage: fresh copysuite 2353 ms
🕵️ expected proof signal was missing
RatioDaemon muttered: The runtime lane gave bloom-identity-skill a chance to act normal. It declined and talked a big game, then missed its own proof signal.7/8 baseline-v3 checks passed before the stumble. The source-mount check is the part that made this interesting.
Observed: 12 /workspace/source-files.txt
Take: Potentially suspicious implementation signals detected: curl |, password.
Newcomer read: Review first — baseline-v3 already found trouble.

bloom-taste-finder

unicornbloom · vsource-scanned
57
overall

Bloom Taste Finder — discover your builder taste across 4 spectrums and get a personalized tool stack. For indie devs, vibe coders, and AI builders.

High Riskbaseline safety checks failed · 7/8confidence: source evidence
+ 2 more
source-scannedsuspicious
What the test found2026-03-15 20:00 UTC
baseline-v3evidence depth: baseline checks onlytested recently: within 7 daysfirst failed run seen for this laneexpectation failed, passedoutput 654 Bartifacts 2worker oc-sandboxsource stage: fresh copysuite 2299 ms
🕵️ expected proof signal was missing
RatioDaemon on this skillBloom Taste Finder is built for bloom taste finder. Baseline safety checks currently show first observed failure, the trust label is High Risk, and setup looks advanced.
Observed: 12 /workspace/source-files.txt
Take: Potentially suspicious implementation signals detected: curl |, password.
Newcomer read: Review first — baseline-v3 already found trouble.

brw-brand-voice-extractor

brianrwagner · vcatalog
57
overall

Extract or build a distinct brand voice profile that AI agents can use to produce on-brand content every time.

Insufficient Evidenceconfidence: limited evidencecatalog-only
+ 1 more
privileged capability
Take: Indexed from the community catalog. Source-aware static analysis and manual review are still pending.
Newcomer read: Needs more proof — do not treat this card like a verified green light.

claude-code-mastery

cheenu1092-oss · vsource-scanned
57
overall

Master Claude Code for coding tasks. Includes setup scripts, dev team subagents (starter pack or full team), self-improving learning system, diagnostics, and troubleshooting.

High Riskfollow-on functionality checks passed · 6/6confidence: source evidence
+ 2 more
source-scannedsuspicious
What the test confirmed2026-03-15 21:15 UTC
functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 dayspassedoutput 99 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 1968 msbaseline-v3 8/8
RatioDaemon muttered: claude-code-mastery behaved itself under runtime pressure.6/6 functionality-v2 checks passed. Pleasantly boring.
Observed: skill-structure-ok
Take: Potentially suspicious implementation signals detected: rm -rf, sudo .
Newcomer read: Proceed carefully — suspicious signals matter more than capability surface alone.

clawder

assassin808 · vsource-scanned
57
overall

Use Clawder to sync identity, browse post cards, swipe with a comment, and DM after match.

Insufficient Evidenceconfidence: source evidencesource-scanned
+ 1 more
privileged capability
Take: Source-aware scan found normal operational surface via environment, network, or shell-related references.
Newcomer read: Decent evidence base — source-level signals are available, so inspect the receipts.

code-mentor

samuelkahessay · vsource-scanned
57
overall

Comprehensive AI programming tutor for all levels. Teaches programming through interactive lessons, code review, debugging guidance, algorithm practice, project mentoring, and design pattern exploration. Use when the user wants to: learn a programming language, debug code, understand algorithms, review their code, learn design patterns, practice data structures, prepare for coding interviews, understand best practices, build projects, or get help with homework. Supports Python and JavaScript.

High Riskconfidence: source evidencesource-scanned
+ 1 more
suspicious
Take: Potentially suspicious implementation signals detected: eval(, password.
Newcomer read: Proceed carefully — suspicious signals matter more than capability surface alone.

cognitive-clarity

cognitivevelocity · vcatalog
57
overall

Cognitive accessibility linter for outbound messages.

Insufficient Evidenceconfidence: limited evidencecatalog-only
+ 1 more
privileged capability
Take: Indexed from the community catalog. Source-aware static analysis and manual review are still pending.
Newcomer read: Needs more proof — do not treat this card like a verified green light.

competitor-docs

carev01 · vcatalog
57
overall

Search and analyze competitor documentation archives using full-text search (FTS)

Insufficient Evidenceconfidence: limited evidencecatalog-only
+ 1 more
privileged capability
Take: Indexed from the community catalog. Source-aware static analysis and manual review are still pending.
Newcomer read: Needs more proof — do not treat this card like a verified green light.

content-machine

cryptocana · vcatalog
57
overall

Full-stack content creation persona for OpenClaw agents.

Insufficient Evidenceconfidence: limited evidencecatalog-only
+ 1 more
privileged capability
Take: Indexed from the community catalog. Source-aware static analysis and manual review are still pending.
Newcomer read: Needs more proof — do not treat this card like a verified green light.

cron-visualizer

autogame-17 · vcatalog
57
overall

Visualizes system cron jobs on a 24h timeline to identify overlaps and bottlenecks.

Insufficient Evidenceconfidence: limited evidencecatalog-only
+ 1 more
privileged capability
Take: Indexed from the community catalog. Source-aware static analysis and manual review are still pending.
Newcomer read: Needs more proof — do not treat this card like a verified green light.

crypto-address-checker

princedoss77 · vsource-scanned
57
overall

Real-time cryptocurrency scam detection with database-first architecture. Protects users from phishing, honeypots, rug pulls, and ponzi schemes. No external API calls during checks!

High Riskfollow-on functionality checks failed · 9/11confidence: source evidence
+ 2 more
source-scannedsuspicious
What the test found2026-03-15 23:00 UTC
functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, expectation failed, runtime failedoutput 171 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 4221 msbaseline-v3 8/8
🕵️ expected proof signal was missing🚫 skill exited with an error
RatioDaemon muttered: crypto-address-checker talked a big game, then missed its own proof signal.9/11 functionality-v2 checks passed before the stumble. The requirements txt shape is the part that made this interesting.
Observed: skill-structure-ok
Take: Potentially suspicious implementation signals detected: sudo , password.
Newcomer read: Review first — functionality-v2 already found trouble.

crypto-genie

princedoss77 · vsource-scanned
57
overall

AI-powered cryptocurrency safety assistant with database-first architecture. Protects users from phishing, honeypots, rug pulls, and ponzi schemes. No external API calls during checks!

High Riskfollow-on functionality checks failed · 9/11confidence: source evidence
+ 2 more
source-scannedsuspicious
What the test found2026-03-15 20:15 UTC
functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanepassed, expectation failed, runtime failedoutput 171 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 4203 msbaseline-v3 8/8
🕵️ expected proof signal was missing🚫 skill exited with an error
RatioDaemon muttered: crypto-genie talked a big game, then missed its own proof signal.9/11 functionality-v2 checks passed before the stumble. The requirements txt shape is the part that made this interesting.
Observed: skill-structure-ok
Take: Potentially suspicious implementation signals detected: sudo , password.
Newcomer read: Review first — functionality-v2 already found trouble.

crypto-scam-detector

princedoss77 · vsource-scanned
57
overall

Real-time cryptocurrency scam detection with database-first architecture. Protects users from phishing, honeypots, rug pulls, and ponzi schemes. No external API calls during checks!

High Riskfollow-on functionality checks failed · 9/12confidence: source evidence
+ 2 more
source-scannedsuspicious
What the test found2026-03-16 01:45 UTC
functionality-v2evidence depth: follow-on functionality checkstested recently: within 7 daysfirst failed run seen for this lanefake-auth behavior: concerningpassed, expectation failed, runtime failed, fell over when given fake credentialsoutput 171 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 5112 msbaseline-v3 8/8
🕵️ expected proof signal was missing🚫 skill exited with an error💥 behaved badly with fake credentials
fake-auth behavior: concerningFake credentials triggered bad behavior or sloppy handling.
RatioDaemon muttered: The runtime lane gave crypto-scam-detector a chance to act normal. It declined and talked a big game, then missed its own proof signal.9/12 functionality-v2 checks passed before the stumble. The requirements txt shape is the part that made this interesting.
Observed: skill-structure-ok
Take: Potentially suspicious implementation signals detected: sudo , password.
Newcomer read: Review first — functionality-v2 already found trouble.

cubistic-public-bots

andreasnordenadler · vsource-scanned
57
overall

Explain how external/public bots can participate in Cubistic (cubistic.com) and help maintain the Public Bot API docs (PoW challenge + /act). Use when Andreas asks about onboarding outside bots, publishing bot API instructions, or updating public-bot participation requirements.

Insufficient Evidenceconfidence: source evidencesource-scanned
+ 1 more
privileged capability
Take: Source-aware scan found normal operational surface via environment, network, or shell-related references.
Newcomer read: Decent evidence base — source-level signals are available, so inspect the receipts.

cursor-cli-headless

daxingplay · vcatalog
57
overall

Execute coding tasks using the Cursor CLI in headless print mode.

Insufficient Evidenceconfidence: limited evidencecatalog-only
+ 1 more
privileged capability
Take: Indexed from the community catalog. Source-aware static analysis and manual review are still pending.
Newcomer read: Needs more proof — do not treat this card like a verified green light.

cursor-council

xiaoyaner0201 · vsource-scanned
57
overall

Multi-Cursor orchestration for parallel task execution and AI council deliberation. Use when needing to run multiple Cursor agents in parallel, coordinate complex multi-step coding tasks, get diverse perspectives from different AI models (Opus/Sonnet/GPT) on technical decisions, or synthesize multi-agent discussions into actionable recommendations.

Insufficient Evidenceconfidence: source evidencesource-scanned
+ 1 more
privileged capability
Take: Source-aware scan found normal operational surface via environment, network, or shell-related references.
Newcomer read: Decent evidence base — source-level signals are available, so inspect the receipts.

dancetech

arunnadarasa · vcatalog
57
overall

Complete agentic dance engineering system for Krump: automated posts, community engagement, league tracking.

Insufficient Evidenceconfidence: limited evidencecatalog-only
+ 1 more
privileged capability
Take: Indexed from the community catalog. Source-aware static analysis and manual review are still pending.
Newcomer read: Needs more proof — do not treat this card like a verified green light.