🔎 Evidence browser

Search the skill radar

Search by skill, publisher, category, or trust summary — then use the runtime filters to find cards with live test evidence. The two main lanes are baseline safety checks first and deeper follow-on functionality checks after that.

Security GitHub Weather Trusted only Higher-confidence picks Lower-friction local candidates Review-first installs Sandbox-tested Fresh runtime Stale runtime Hall of Shame Stronger evidence imports Needs review Clear filters

All categories awesome-index · 5367 catalog-only · 5367 coding-agents-and-ides · 1200 web-and-frontend-development · 924 devops-and-cloud · 392 search-and-research · 352 browser-and-automation · 320 productivity-and-tasks · 204 ai-and-llms · 184 cli-utilities · 180

✨ Quick picks

Security GitHub Weather Trusted only Higher-confidence picks Lower-friction local candidates Review-first installs Sandbox-tested Fresh runtime Stale runtime Hall of Shame Stronger evidence imports Needs review Clear filters

🏷 Categories

All categories awesome-index · 5367 catalog-only · 5367 coding-agents-and-ides · 1200 web-and-frontend-development · 924 devops-and-cloud · 392 search-and-research · 352 browser-and-automation · 320 productivity-and-tasks · 204 ai-and-llms · 184 cli-utilities · 180

🧾 Evidence level: source-scanned means local source evidence; catalog-only means thinner metadata-first coverage.

🧪 Runtime status: cards can show only the baseline safety lane or the deeper follow-on functionality lane, depending on how far the skill got.

📏 Depth cue: tells you whether the evidence stops at baseline checks, includes follow-on functionality checks, or includes richer fixture/example proof.

⏱ Freshness cue: tells you whether the latest runtime evidence is from the last 24 hours, the last 7 days, or is older and therefore less current.

🩺 Failure confidence: distinguishes a first seen failure from a repeated failure or a regression after an earlier pass, so not every red row means the same thing.

Results

Showing 6 of 6 results for “te” · runtime: failed · freshness: fresh · sort: relevance

page evidence snapshotruntime-passed: 0 runtime-failed: 6 source-scanned: 6 fresh <24h: 6 manual review: 0

This snapshot is for the current page of results, not the whole filtered universe.

Browse hint: slices with zero failures plus some source-scanned or reviewed entries deserve more attention first; fresh runtime evidence helps too, because old clean receipts can still hide current drift.

casual-cron

gostlightai · vsource-scanned

Create Clawdbot cron jobs from natural language with strict run-guard rules. Use when: users ask to schedule reminders or messages (recurring or one-shot), especially via Telegram, or when they use /at or /every. Examples: 'Create a daily reminder at 8am', 'Remind me in 20 minutes', 'Send me a Telegram message at 3pm', '/every 2h'.

High Riskfollow-on functionality checks failed · 4/6confidence: source evidence

Runtime receipts + what failed2026-03-15 18:15 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, runtime_failedoutput 3.0 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 2086 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: The runtime lane gave casual-cron a chance to act normal. It declined and made it to runtime and then fell apart on contact.4/6 functionality-v2 checks passed before the stumble. The meta json shape is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf.

Decision cue: Review first — functionality-v2 already found trouble.

glitch-dashboard

chris6970barbarian-hue · vsource-scanned

Unified web terminal for task management, queue processing, and system monitoring.

High Riskfollow-on functionality checks failed · 8/9confidence: source evidence

Runtime receipts + what failed2026-03-15 09:45 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, runtime_failedoutput 2.5 KBartifacts 0worker oc-sandboxsource stage: cache hitsuite 2875 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: The runtime lane gave glitch-dashboard a chance to act normal. It declined and made it to runtime and then fell apart on contact.8/9 functionality-v2 checks passed before the stumble. The package json entrypoints is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: rm -rf, sudo .

Decision cue: Review first — functionality-v2 already found trouble.

aliyun-mail

jixsonwang · vsource-scanned

A skill to send emails via Aliyun enterprise email service with support for markdown, HTML text, attachments, and syntax highlighting for code blocks.

Use Cautionfollow-on functionality checks failed · 8/9confidence: source evidence

Runtime receipts + what failed2026-03-15 14:00 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, runtime_failedoutput 153 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3370 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon on this skillAliyun Mail is trying to handle aliyun mail. Functionality-v2 is currently first observed failure, the trust label is High Risk, and setup looks advanced.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: password.

Decision cue: Review first — functionality-v2 already found trouble.

sys-updater

spiceman161 · vsource-scanned

Production-safe Ubuntu maintenance orchestrator: runs daily apt security updates, tracks non-security updates across apt/npm/pnpm/brew with quarantine + auto-review, applies only approved updates, rotates logs/state, and generates clear 09:00 MSK Telegram reports (including what was actually installed).

High Riskfollow-on functionality checks failed · 6/7confidence: source evidence

Runtime receipts + what failed2026-03-15 21:30 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfirst failed run seen for this lanepassed, runtime_failedoutput 99 Bartifacts 0worker oc-sandboxsource stage: cache hitsuite 3162 msbaseline-v3 8/8

🕵️ expected proof signal was missing🚫 skill exited with an error

RatioDaemon muttered: sys-updater made it to runtime and then fell apart on contact, which is not ideal for a skill asking to be trusted.6/7 functionality-v2 checks passed before the stumble. The python help is the part that made this interesting.

Observed: skill-structure-ok

Take: Potentially suspicious implementation signals detected: sudo , password.

Decision cue: Review first — functionality-v2 already found trouble.

veille

romain-grosos · vsource-scanned

RSS feed aggregator, deduplication engine, LLM scoring, and output dispatcher for OpenClaw agents. Use when: fetching recent articles from configured sources, filtering already-seen URLs, deduplicating by topic, scoring with LLM, dispatching digests to Telegram/email/Nextcloud/file. Enhanced by mail-client (email output) and nextcloud-files (cloud storage).

High Riskfollow-on functionality checks failed · 0/1confidence: source evidence

Runtime receipts + what failed2026-03-15 21:31 UTC

functionality-v2evidence depth: follow-on functionality checkstested recently: within 24 hoursfailure repeated in more than one runregression after earlier passblocked_on_external_serviceoutput 375 Bartifacts 1worker oc-sandboxsource stage: cache hitsuite 5621 msbaseline-v3 8/8

RatioDaemon muttered: veille left receipts, just not the ones it was supposed to.0/1 functionality-v2 checks passed before the stumble. The forced external check is the part that made this interesting.

Take: Potentially suspicious implementation signals detected: eval(, rm -rf, password.

Decision cue: Review first — functionality-v2 already found trouble.

tide-watch

chrisagiddings · vsource-scanned

Proactive session capacity monitoring and management for OpenClaw. Prevents context window lockups by warning at configurable thresholds (75%, 85%, 90%, 95%), automatically backing up sessions before resets, and managing session resumption prompts. Use when working on long-running projects, managing multiple conversation channels (Discord, Telegram, webchat), or preventing lost work from full context windows. Includes CLI tools for capacity checks, cross-session dashboards, archive management, and session resumption. Supports any model or provider.

High Riskbaseline safety checks failed · 7/8confidence: source evidence

Runtime receipts + what failed2026-03-16 00:45 UTC

baseline-v3evidence depth: baseline checks onlytested recently: within 24 hoursfirst failed run seen for this laneexpectation_failed, passedoutput 591 Bartifacts 2worker oc-sandboxsource stage: fresh copysuite 2351 ms

🕵️ expected proof signal was missing

RatioDaemon muttered: tide-watch talked a big game, then missed its own proof signal, which is not ideal for a skill asking to be trusted.7/8 baseline-v3 checks passed before the stumble. The source-mount check is the part that made this interesting.

Observed: 12 /workspace/source-files.txt

Take: Potentially suspicious implementation signals detected: rm -rf, sudo , password.

Decision cue: Review first — baseline-v3 already found trouble.