Testing
block-testing
Automated testing guidance for custom AEM Edge Delivery Services blocks. Analyzes block JavaScript and CSS for common issues including missing null checks, unscoped CSS selectors, …
AI / ML
forge-evals
Design evaluations for LLM features including golden datasets, rubric scoring, LLM-as-judge calibration, CI regression detection, online A/B tests, cost and latency budgets, and ad…
Testing
playwright-e2e-builder
Plans and builds comprehensive Playwright E2E test suites using Page Object Model, authentication state persistence, custom fixtures, visual regression, and CI integration. Conduct…
Testing
playwright-e2e-execution-run
Executes an existing Playwright end-to-end suite against a confirmed non-production target and returns a structured run report including pass/fail counts, flaky tests, durations, a…
Testing
unit-test-bean-validation
Provides patterns for unit testing Jakarta Bean Validation (JSR-380) constraints including @Valid, @NotNull, @Min, @Max, and @Email with Hibernate Validator. Generates custom valid…
Testing
pev-test-design
Analyzes Acceptance Criteria using six classical QA techniques: equivalence partitioning, boundary value analysis, decision tables, state transitions, error guessing, and checklist…
Testing
designing-distributed-system-tests
Designs claim-driven test plans for distributed or stateful systems involving persistence, replication, consensus, or partial failure. Investigates product guarantees then creates …
Testing
qa-testing-routing
Routes QA and testing tasks across visual QA, certification, performance, API, accessibility, tools, and workflow optimization. Handles evidence-based bug reporting with screenshot…
Testing
qa-tester
Ensures 100% scenario coverage for any project feature. Builds a mandatory scenario matrix before writing tests, covering success paths, errors, missing fields, partial combination…
Testing
unit-test-service-layer
Provides patterns for unit testing service layers with Mockito. Creates isolated tests that mock repository calls, verify method invocations, test exception scenarios, and stub ext…
Testing
accessibility-test
Automated WCAG 2.1 AA accessibility testing with axe-core and Lighthouse CI. Auto-detects frameworks, discovers routes, installs Playwright for page scanning and jest-axe for compo…
Testing
accessibility-selenium-testing
Accessibility testing toolkit using Selenium WebDriver 4+ with Java 21+ and axe-core. Validates WCAG 2.1/2.2 compliance, scans pages for a11y violations, tests keyboard navigation,…
Design
design-ship
End-to-end design handoff to pull request: imports a handoff bundle, generates Storybook stories and Playwright tests, runs diff-aware browser verification, and opens a PR with bun…
Testing
unit-test-utility-methods
Provides patterns for testing utility classes, static methods, and helper functions. Validates pure functions, null handling, edge cases, and boundary conditions. Generates asserti…
Testing
snapshot-testing
Apply snapshot testing for structural regression checks on complex outputs: capture serialized data, rendered DOM, or pixel visuals, then compare against approved baselines while m…
Testing
Advanced Playwright E2E Framework
Enterprise-grade Playwright test automation framework using 8-layer architecture with Page Object Model, Module Pattern, custom fixtures, API testing layer, structured logging, dat…
Testing
mcp-server-evaluations
Test MCP servers for quality and reliability. Verify tool functionality, test error handling, generate tests, and assess response quality using only curl. Apply when validating imp…
Testing
qa-run-light
Executes QA-*.md checklists via MCP Playwright for rapid lightweight reporting. Derives 7-category insights from a single QA file plus execution evidence. Excludes deep AI visual j…
Testing
batch-bug-shepherd
Moves batches of suspected bugs from raw issue lists to mergeable PRs. Spawns triage subagents per issue, cross-references against open PRs, shepherds community PRs or opens fixes …
Testing
unit-test-boundary-conditions
Provides edge case, corner case, and boundary condition testing patterns for Java unit tests. Validates minimum/maximum values, null cases, empty collections, numeric overflow, flo…
Testing
unit-test-scheduled-async
Provides patterns for unit testing Spring @Scheduled and @Async methods using JUnit 5, CompletableFuture, Awaitility, and Mockito. Covers mocking task execution, verifying executio…
Testing
mobile-flows-maestro
Creates and runs Maestro flows for iOS and Android apps, manages the Maestro MCP server, diagnoses flaky UI tests, and supports mobile E2E testing onboarding. Use for Maestro, mobi…
Testing
dotnet-testing-advanced-xunit-upgrade-guide
Complete guide to upgrading xUnit from 2.9.x to 3.x, covering breaking changes, package updates, async void fixes, and IAsyncLifetime adjustments. Introduces new features including…
Engineering
completion-rule
Automatic self-check and correction rules for LLM code generation tasks. Enforces functional self-tests, unit test design, and linter checks after output. Auto-fixes until all pass…
Testing
mobile-test
Creates full mobile test coverage across unit, widget, integration, snapshot, and accessibility layers. Auto-detects Flutter, React Native, or native iOS/Android stacks and emits t…
Testing
unit-test-caching
Provides patterns for unit testing Spring Cache annotations including @Cacheable, @CachePut, and @CacheEvict. Generates test code that mocks cache managers, verifies hit/miss behav…
Testing
paper-autoraters
Execute the four paper-quality autoraters: Citation F1, Literature Review Quality (6-axis), SxS Overall Paper Quality, and SxS Literature Review Quality. Triggers on requests to sc…
General
acc-create-psr17-http-factory
Generates PSR-17 HTTP Factory classes for PHP 8.5 including RequestFactoryInterface, ResponseFactoryInterface, StreamFactoryInterface, UriFactoryInterface, ServerRequestFactoryInte…
Testing
beaver-build
Creates robust tests that catch issues before production. Surveys code, selects test targets, builds coverage with care, reinforces weak areas, and ships with confidence. Use for w…
Testing
microbenchmarking
Handles BenchmarkDotNet usage for .NET microbenchmarking tasks including creation, execution, configuration, and review. Covers design, project setup, efficient running, and side-b…
Testing
launching-activities-with-activityscenario
Launches, drives, and tears down Activities from instrumentation tests using ActivityScenario and ActivityScenarioRule. Covers launch methods, state transitions, recreation, onActi…
Security
owasp-api-security
OWASP API Security Top 10 testing patterns, injection payloads, auth bypass vectors, and security test generation for REST APIs. Use when writing security tests, reviewing API endp…
Engineering
engineering
Orchestrates engineering execution from a planned JIRA ticket using TDD, linting, type checking, tests, and behavioral verification. Supports first-cut implementation and PR revisi…
Testing
loom-test-strategy
Provides test strategy guidance including test pyramid design, coverage goals, categorization, flaky test diagnosis, infrastructure architecture, and risk-based prioritization acro…
Testing
spec-driven-qa
Validates code quality through six-phase spec-driven QA with Execute-Verify-Gate enforcement at every step. Enforces coverage thresholds, detects anti-patterns, validates spec comp…
Data
syntherklaas
Generates synthetic test data through interactive dialog covering columns, types, foreign keys, and constraints. Renders ASCII UML, applies volume and distributions, then outputs c…
Testing
unit-test-config-properties
Provides patterns for unit testing @ConfigurationProperties classes. Validates property binding, tests validation constraints, verifies default values, checks type conversions, and…
Testing
browser-validator
Automatically validate implementations in real browsers after code is written. Tests responsive breakpoints, WCAG AA accessibility compliance, keyboard navigation, ARIA attributes,…
Testing
loom-testing
Implements tests across unit, integration, e2e, security, infrastructure, data pipeline, and ML domains using pytest, jest, vitest, mocha, junit, or testify. Supports TDD/BDD workf…
Testing
openclaw-parallels-smoke
Execute end-to-end Parallels smoke tests, upgrades, and reruns for OpenClaw across macOS, Windows, and Linux guests. Covers VM install validation, gateway checks, release-to-main u…
Testing
probe-define
Generate probe-skeleton YAML files from uncovered test-matrix cells in feature-flows.yaml. Each skeleton includes feature_id, test_class, cross-references, and a TODO-marked arrang…
Testing
injecting-mouse-and-keyboard
Drive Jetpack Compose UI tests with non-touch input using performMouseInput for clicks, drags, scrolls, and hovers, performKeyInput for key events and modifier state checks, and pe…
Testing
launching-fragments-with-fragmentscenario
Tests Fragments in isolation using FragmentScenario and launch methods. Covers fragment-testing artifacts, EmptyFragmentActivity host, theme overrides, FragmentFactory handling, co…
Testing
nightwatchjs-integrated-browser-mobile-testing-framework
Source-backed skill for Nightwatch.js, the Node.js framework for E2E, component, API, accessibility, and mobile testing via the W3C WebDriver stack. Supports repeatable browser con…
Testing
unit-test-philosophy
Risk-based unit testing and Allure-readable behavioral spec style for open agreements. Applies when adding or updating tests, expanding coverage, or reviewing test quality across s…
Testing
vlmkit
Frontend toolkit for VLM-driven visual regression testing, screenshot analysis, markup synthesis, and automated CSS repair loops. Detects visual changes after code edits and suppor…
Testing
evaluation-methodology
Defines PluginEval quality methodology including dimensions, rubrics, statistical methods, and scoring formulas. Supports interpreting scores, improving triggering accuracy, calibr…
Testing
ring:dev-k6-load-testing
Generates k6 load tests following Lerian platform conventions for Palantir Self-Service Testing. Produces product directories, scenario YAMLs, helper clients, and bundleable script…
Testing
choosing-test-rule-vs-runtest
Selects the correct Compose UI test entry point among createComposeRule, createAndroidComposeRule, createEmptyComposeRule, runComposeUiTest, and runAndroidComposeUiTest, plus v1/v2…
Data
tooluniverse-crispr-screen-analysis
Analyzes pooled or arrayed CRISPR screens to identify essential genes, synthetic lethal interactions, and drug targets via sgRNA processing, MAGeCK/BAGEL scoring, quality control, …
Testing
unit-test-wiremock-rest-api
Provides patterns for unit testing external REST APIs using WireMock. Stubs API responses, verifies request details, simulates failures (timeouts, 4xx/5xx errors), and validates HT…
Testing
bmad-qa-automate
Detect the project's test framework and generate API plus E2E tests for existing features, then run them and produce a coverage summary. Covers happy paths and critical errors. Use…
Testing
configuring-junit4-on-android
Sets up a JUnit 4 JVM test suite for an Android module using the standard Gradle dependencies, AndroidJUnit4 runner, InstrumentationRegistry, ApplicationProvider, test size annotat…
Testing
emulate-seed
Create stateful emulation configs for APIs including GitHub, Vercel, Google, Slack, Apple, Microsoft, AWS, Okta, Resend, Stripe, and MongoDB. Provides full state machines rather th…
Security
offensive-fuzzing
Offensive fuzzing methodology covering target selection, fuzzer choice, harness development, corpus curation, mutation strategies, coverage measurement, and crash triage against fi…
Web
web-design-builder
Create and refactor HTML5/JavaScript web designs from specs or descriptions. Produces complete, accessible, responsive designs using modern frameworks and verifies output via autom…
Testing
unit-test-mapper-converter
Provides patterns for unit testing mappers, converters, and bean mappings. Validates entity-to-DTO and model transformation logic in isolation. Generates executable mapping tests w…
Testing
edge-cases
Systematically identifies error states, boundary conditions, and recovery paths using the BOUNDARY method. Surfaces gaps missed by specs. Use after PRDs, before sprint planning, or…
Testing
evaluation
Applies when evaluating agent performance, building test frameworks, measuring agent quality, or creating evaluation rubrics. Covers LLM-as-judge methods, multi-dimensional evaluat…
AI / ML
eval-driven-development
Build language-model-integrated systems by writing evaluations first. Covers statistical eval nature, five primitives, judgment taxonomy, system evals vs benchmarks, and how result…
Showing the top 60 of 2,439. See the full list →