Browse the skill radar
Search by skill, publisher, category, or trust summary — then use the runtime filters to find cards with live test evidence. The two main lanes are baseline safety checks first and deeper follow-on functionality checks after that.
✨ Quick picks
🏷 Categories · devops-and-cloud
🧾 Evidence level: source-scanned means local source evidence; catalog-only means thinner metadata-first coverage.
🧪 Runtime status: cards can show only the baseline safety lane or the deeper follow-on functionality lane, depending on how far the skill got.
📏 Depth cue: tells you whether the evidence stops at baseline checks, includes follow-on functionality checks, or includes richer fixture/example proof.
⏱ Freshness cue: tells you whether the latest runtime evidence is from the last 24 hours, the last 7 days, or is older and therefore less current.
🩺 Failure confidence: distinguishes a first seen failure from a repeated failure or a regression after an earlier pass, so not every red row means the same thing.
Results
ai-media
Full-stack AI media generation powered by GPU server (RTX 3090/3080/2070S).
+ 1 more
birdweather
Query BirdWeather station data — species detections, trends, and comparisons from BirdNET-Pi and PUC bird song detection stations. Use when asked about bird sightings, bird song detections, local bird activity, BirdWeather stations, or wildlife audio monitoring. No API key required.
+ 1 more
email-resend
>
+ 1 more
joplin-api
Manage Joplin notes via REST API. Use for creating, reading, updating, deleting, or searching Joplin notes programmatically.
+ 1 more
trading-upbit-skill
Upbit automated trading (aggressive breakout) with cron-friendly run-once commands, TopVolume monitoring, and percent-based budget splitting.
+ 1 more
visa-business-planner
This skill helps users create comprehensive business plans for obtaining a Japanese Business Manager Visa. It includes templates, examples, resources, and tools for market analysis, revenue projections, and legal requirements. Tailored for backgrounds in tech, media, and consulting.
+ 1 more
azure-ai-projects-py
Build AI applications using the Azure AI Projects Python SDK (azure-ai-projects). Use when working with Foundry project clients, creating versioned agents with PromptAgentDefinition, running evaluations, managing connections/deployments/datasets/indexes, or using OpenAI-compatible clients. This is the high-level Foundry SDK - for low-level agent operations, use azure-ai-agents-python skill.
+ 1 more
construction-pm
Construction project management toolkit for AI agents. Use when: (1) Tracking construction jobs through the full lifecycle from lead to paid, (2) Generating daily PM briefings on job status and pipeline health, (3) Checking for stale permits or aging jobs that need attention, (4) Parsing emails from project managers for job updates, (5) Monitoring revenue pipeline across all job statuses.
+ 1 more
content-watcher
AI-powered content monitoring and summarization tool. Monitor RSS feeds, blogs, and news sources with automatic AI summarization and daily digest generation.
+ 1 more
depguard
Dependency audit, vulnerability scanning, and license compliance. Free vuln check + paid continuous monitoring via git hooks.
+ 1 more
lraivisto-researchvault
High-velocity research orchestration engine. Manages persistent state, synthesis, and autonomous verification for agents.
+ 1 more
researchvault-brain
Local research orchestration and state management. Use when starting projects, logging progress, or exporting findings.
+ 1 more
vault-research
High-velocity research orchestration engine. Manages persistent state, synthesis, and autonomous verification for agents.
+ 1 more
vault-research-brain
High-velocity research orchestration engine. Manages persistent state, synthesis, and autonomous verification for agents.
+ 1 more
vaultresearch
High-velocity research orchestration engine. Manages persistent state, synthesis, and autonomous verification for agents.
+ 1 more
arxivkb
Local arXiv paper manager with semantic search. Crawls arXiv categories, downloads PDFs, chunks content, and indexes with FAISS + Ollama embeddings. No cloud API keys required — everything runs locally.
+ 1 more
claudemem
>
+ 1 more
agent-self-governance
Self-governance protocol for autonomous agents: WAL (Write-Ahead Log), VBR (Verify Before Reporting), ADL (Anti-Divergence Limit), VFM (Value-For-Money), and IKL (Infrastructure Knowledge Logging). Use when: (1) receiving a user correction — log it before responding, (2) making an important decision or analysis — log it before continuing, (3) pre-compaction memory flush — flush the working buffer to WAL, (4) session start — replay unapplied WAL entries to restore lost context, (5) any time you want to ensure something survives compaction, (6) before claiming a task is done — verify it, (7) periodic self-check — am I drifting from my persona? (8) cost tracking — was that expensive operation worth it? (9) discovering infrastructure — log hardware/service specs immediately.
+ 1 more
devvit-publishing-auditor
A specialized auditor for Reddit Devvit developers to verify app readiness before uploading to the Reddit servers. It ensures compliance with Devvit CLI v0.12.x and Reddit’s publishing standards.
+ 1 more
sovereign-api-mock-generator
Generates mock API servers from OpenAPI specs or examples. Realistic fake data, configurable delays, error simulation.
+ 1 more
the-arena
>
+ 1 more
token-usage-optimizer
Maximize your Claude Code subscription value with smart usage monitoring and burn rate optimization. Track 5-hour session and 7-day weekly quotas, get one-time alerts, and daily reports showing if you're under/over-using your $20-200/month plan. Ultra-lightweight (10min cache, minimal API calls). Perfect for Pro, Max 100, and Max 200 subscribers who want to get every dollar's worth.
+ 1 more
0x0-messenger
Send and receive P2P messages using disposable numbers and PINs. No servers, no accounts. Use for human notifications, approval flows, and agent-to-agent communication.
+ 1 more
agent-evaluation
Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world benchmarks Use when: agent testing, agent evaluation, benchmark agents, agent reliability, test agent.