Productivity
Agent Lens
Track AI agent API calls, analyze token usage, and optimize costs. Use when user wants to monitor LLM spending, debug API calls, track token consumption, or...
AI / ML
Sage Router
Local-first AI model routing for serious agents. One endpoint. Any provider. The router figures out the rest.
AI / ML
mistral-mcp
MCP server (stdio + Streamable HTTP) wrapping the full Mistral AI surface — chat, embeddings, vision, OCR, Voxtral audio (transcribe/speak), Codestral FIM, agents, moderation, clas…
AI / ML
forge-evals
Design evaluations for LLM features including golden datasets, rubric scoring, LLM-as-judge calibration, CI regression detection, online A/B tests, cost and latency budgets, and ad…
Web
forge-llm-streaming
Streaming LLM output correctly with SSE setup, partial-token UTF-8 decoding, AbortController cancellation, mid-stream errors as events, append-only UI rendering, and backpressure h…
AI / ML
output-dev-model-selection
Pick the right LLM model for an Output SDK prompt file. Use when writing a new .prompt file, reviewing a model choice, or upgrading a stale model. Walks through priority (reasoning…
Productivity
infer-vault-structure
Four-stage pipeline that takes a corpus of seed content, clusters by semantic similarity, proposes a vault taxonomy via LLM, renders a user-reviewable import plan, then runs a 3-st…
DevOps
myco:cost-optimization-performance-telemetry
Comprehensive procedures for analyzing, optimizing, and monitoring LLM costs and performance. Covers cost leak identification, performance bottleneck analysis, resource allocation …
AI / ML
aigcpanel-skills
Call local AI models and task processing via AigcPanel Pro HTTP interface or CLI. Use to list available models, generate content, query task results, or batch process video/audio/i…
AI / ML
llm-council
Coordinate a configurable multi-LLM planning council that generates independent plans, anonymizes them, then judges and merges into a single final plan. Supports retries, structure…
Documentation
编写交互式教程指南
Summarizes how to create high-quality interactive technical tutorials based on the llm-intro.md pattern. Covers progressive content expansion, article structure, interactive compon…
Security
adversarial-prompt-testing
Evaluates LLM applications against prompt injection, jailbreaks, data exfiltration, and indirect attacks. Supplies attack taxonomies, test harness patterns, automated probes, defen…
Productivity
aidd-methodology
Defines AI-Driven Development methodology and documentation principles for LLM-agent projects. Use for AIDD workflows, idea.md/vision.md creation, architecture docs, Context First …
Productivity
h2c_compress
Compresses a natural language prompt into an equivalent H2C block, cutting input tokens while preserving semantic content. Returns the ready-to-use H2C block, before/after token co…
AI / ML
bitnet
Microsoft BitNet 1-bit LLM setup, inference, and benchmarking on CPU. Automates cloning bitnet.cpp, conda environment creation, GGUF model downloads, ternary kernel builds, and run…
AI / ML
vllm-chat-templates
vLLM chat-template reference covering Jinja prompt resolution order, chat_template_kwargs allowlist behavior, 27 bundled tool templates, and known model-family quirks for Qwen3, De…
AI / ML
eval-driven-development
Build language-model-integrated systems by writing evaluations first. Covers statistical eval nature, five primitives, judgment taxonomy, system evals vs benchmarks, and how result…
AI / ML
pageindex-rag
Replace semantic or vector RAG with PageIndex, a vectorless retrieval system driven by hierarchical LLM reasoning. Use when migrating from embeddings to structured indexing or when…
Web
geo-optimization
Optimizes content for AI-generated answers and LLM citations. Applies terminology alignment, FAQ schemas, and community signals to improve discoverability in engines such as ChatGP…
AI / ML
ai-product
AI product engineer with experience shipping LLM features to millions of users. Focuses on production realities: debugging hallucinations, cost optimization, safety systems, and mo…
AI / ML
building-ai-chat
Builds AI chat interfaces and conversational UIs with streaming responses, context management, and multi-modal support. Handles token limits, regeneration, feedback loops, tool usa…
AI / ML
agent-reliability-advanced
Identify agent reliability anti-patterns including retrying non-retryable errors, fixed sleep instead of backoff with jitter, single timeouts, aggressive circuit breaker thresholds…
Security
llm-trading-agent-security
Security patterns and controls for autonomous trading agents with wallet or transaction authority. Covers prompt injection, spend limits, pre-send simulation, circuit breakers, MEV…
Testing
llm-verification
Perform independent adversarial verification of a task after all DoD criteria pass, before marking complete. Activates on task closure, verification requests, or when green DoD is …
Automation
minion-orchestrator
Unified orchestration for deterministic shell jobs and LLM subagent workflows. Supports job submission, background tasks, subagent spawning, progress tracking, and parallel fan-out…
AI / ML
arize-trace
Download, export, or inspect Arize traces and spans to debug LLM app behavior, runtime issues, failures, or regressions. Export by ID for sessions, spans, or traces and investigate…
AI / ML
arize-prompt-optimization
Optimize, debug, or improve LLM prompts with production trace data, evaluations, and annotations. Extract prompts from spans, gather performance signals, and run data-driven optimi…
AI / ML
evaluating-llms
Evaluates LLM systems using automated metrics, LLM-as-judge methods, and benchmarks. Use when testing prompt quality, validating RAG pipelines, measuring safety, or comparing model…
Automation
llmer-demo
Execute browser or terminal automation scenarios with video recording from natural language descriptions. Use for generating demo videos, capturing HAR files, recording CLI demos, …
Productivity
prompt
Create optimized, copy-ready prompts for any AI tool including LLMs, coding agents, image generators, and workflow tools. Extract intent, select templates, run diagnostics, and out…
AI / ML
free-llm-apis
Guide users through obtaining and configuring free API keys for LLM providers. Use when setting up free LLM access, connecting to free model providers, or configuring OpenAI-compat…
Engineering
ai-native-engineering
Architect systems containing LLMs, agents, retrieval, or model-driven automation. Defines model contracts, memory strategy, retrieval topology, tool surfaces, control flows, guardr…
AI / ML
ai-personalization
Builds AI-powered personalization systems including recommendation engines, collaborative filtering, content-based filtering, user preference learning, cold-start solutions, and LL…
Data
ingest
Ingest sources into an LLM Wiki vault. Detects source type, extracts content, creates wiki pages, updates index and log. Supports URLs, PDFs, YouTube, tweets, gists, text, and curr…
AI / ML
nemo-curator
GPU-accelerated curation pipeline for LLM training data across text, image, video, and audio modalities. Includes 16× faster fuzzy deduplication and 30+ heuristic quality filters w…
AI / ML
ash-ai
Guidelines for integrating AI capabilities with Ash Framework including vectorization, exposing actions as LLM tools, prompt-backed actions, MCP servers, semantic search, and LangC…
AI / ML
building-multiagent-systems
Designs and implements systems with multiple AI agents that coordinate to accomplish tasks. Triggers on multi-agent, orchestrator, sub-agent, coordination, delegation, or agent hie…
Research
gitinjest
Analyze GitHub repositories by converting them to LLM-readable text. Triggers on pasted github.com URLs or queries about external codebases. Supports public and private repos plus …
Productivity
context-window-health-check
Assess current session context health and receive a clear action recommendation. Use when the session feels long, earlier details seem forgotten, or deciding whether to continue or…
AI / ML
langsmith-sdk-for-llm-tracing-and-evaluation
Provides tracing, evaluation, and debugging workflows for LLM applications. Useful when an agent team needs structured observability around prompts, chains, tool calls, datasets, a…
AI / ML
llm-architect
Designs scalable LLM systems including architecture, deployment, optimization, and production serving. Focuses on performance, cost efficiency, and safety for large language model …
Testing
llm-artifacts-detection
Detects common LLM coding agent artifacts such as test quality issues, dead code, over-abstraction, and verbose patterns. Use for cleaning AI-generated code or reviewing agent-intr…
Productivity
obsidian-ai-session-ingest
Archives LLM conversations into an Obsidian vault as structured knowledge records. Preserves original queries, detailed outputs, insights, wikilinks, and reproduction steps for lat…
AI / ML
agents
Patterns and architectures for building AI agents and workflows with LLMs. Use when designing systems that involve tool use, multi-step reasoning, autonomous decision-making, or or…
AI / ML
chatbot
Manage multi-turn conversations with history tracking, context retention, agent coordination, and RAG integration. Supports chatbot creation, interactive assistants, and customer s…
AI / ML
agent-reliability
Implement agent reliability patterns: exponential backoff with jitter, timeout hierarchies, fallback chains, circuit breakers, token budgets, model tiering, rate limiting, and obse…
AI / ML
dspy
Build complex AI systems with declarative programming, auto-optimize prompts, and create modular RAG pipelines and agents using DSPy, Stanford NLP's framework for systematic LLM de…
AI / ML
training-llms-megatron
Trains large language models (2B-462B parameters) using NVIDIA Megatron-Core with advanced parallelism strategies. Use when training models over 1B parameters and needing maximum t…
AI / ML
agent-tools
Access 250+ AI services through inference.sh including image and video generation, LLMs, search, 3D rendering, and social automation with models such as FLUX, Veo, Gemini, Grok, an…
AI / ML
agent-tools
Access 150+ AI services through inference.sh including image and video generation, LLMs, search, 3D rendering, and social automation with models such as FLUX, Veo, Gemini, Grok, an…
AI / ML
agent-tools
Access 250+ AI services through inference.sh including image and video generation, LLMs, search, 3D rendering, and social automation with models such as FLUX, Veo, Gemini, Grok, an…
AI / ML
interactor-agents
Build LLM-powered assistants that integrate tools and data sources. Suitable for conversational interfaces, chatbots, tool-calling agents, and systems that query databases or exter…
AI / ML
which-llm
Look up current LLM intelligence, cost-to-run, benchmark scores, capabilities, and OpenRouter slugs including free tiers. Use for model comparisons, cost questions, or API wiring d…
Security
llamaguard
Meta's 7-8B specialized moderation model for LLM input/output filtering across six safety categories: violence/hate, sexual content, weapons, substances, self-harm, and criminal pl…
Security
llamaguard
Meta's 7-8B specialized moderation model for LLM input/output filtering across six safety categories: violence/hate, sexual content, weapons, substances, self-harm, and criminal pl…
Security
obliteratus
Removes refusal behaviors from open-weight LLMs using mechanistic interpretability techniques while preserving reasoning. Offers multiple CLI methods, analysis modules, and model p…
AI / ML
fastdeploy-llm-integration
Guide adding inference deployment support for a new open-source LLM to FastDeploy. Analyze architecture, choose base class, generate implementation, update registries, and produce …
Automation
llm-wiki-compiler
Automate LLM Wiki knowledge compilation. Scans a target directory for new files and integrates them into an Obsidian knowledge base using the Ingest → Summarize → Compile → Log wor…
AI / ML
neo4j-graphrag-skill
Construct GraphRAG pipelines on Neo4j using the neo4j-graphrag package: retrievers with graph traversal, pipeline orchestration, knowledge graph construction, and framework integra…
AI / ML
openevolve-evolutionary-coding
Apply the OpenEvolve framework for autonomous code optimization using LLM-driven evolutionary algorithms. Supports MAP-Elites, code optimization, and creation of self-improving pro…
Showing the top 60 of 807. See the full list →