Pre-built, composable AI skills for every cloud operation. Search by name, connection, or use case — or browse by domain below.
Datadog observability platform for metrics, logs, traces, APM, monitors, dashboards, SLOs, and incident management. Covers infrastructure monitoring, application performance, log analytics, and synthetic monitoring via the Datadog API.
MongoDB database analysis, performance tuning, query optimization, and health monitoring. Covers collection analysis, index recommendations, aggregation pipelines, replica set health, slow query investigation, and Atlas cluster management. Enforces two-phase execution and anti-hallucination rules.
MySQL and MariaDB database analysis, performance tuning, query optimization, and health monitoring. Covers slow query analysis, index effectiveness, table statistics, replication health, InnoDB status, and schema inspection. Enforces two-phase execution and read-only safety rules.
Redis in-memory data store analysis, performance monitoring, memory optimization, and key management. Covers keyspace inspection, slow log analysis, eviction policy review, replication health, cluster status, and connection management. Enforces SCAN-first patterns and safety rules.
PagerDuty incident management, alerting, escalation policies, on-call scheduling, and analytics. Covers active incident overview, on-call rotation queries, service management, MTTR analysis, escalation policy audit, incident creation, and postmortem tracking.
Prometheus metrics monitoring, PromQL query building, alerting rule analysis, and scrape target health. Covers infrastructure metrics, HTTP/APM metrics, TSDB storage analysis, recording rules, Alertmanager integration, and cardinality analysis. Enforces discovery-first metric validation.
ArgoCD GitOps continuous delivery management for Kubernetes. Covers application sync status, deployment health, rollback operations, repository management, cluster registration, RBAC analysis, and application diff review via ArgoCD REST API and CLI.
Sentry error tracking, performance monitoring, release health, and issue management. Covers error investigation, stack trace analysis, release comparison, transaction performance monitoring, alert rule management, and issue triage via Sentry API.
HashiCorp Vault secrets management for reading secrets, auditing access policies, checking seal status, managing leases, reviewing audit logs, and inspecting auth methods. Covers KV secrets engine, PKI, and Vault Enterprise namespaces. Enforces read-only safety and value masking.
Linear project management for engineering teams. Covers issue tracking, sprint cycles, project roadmaps, team velocity, issue search, and workflow automation via the Linear GraphQL API. Use when managing engineering backlogs, analyzing sprint progress, or creating issues.
Snowflake data warehouse analysis, query performance tuning, credit cost optimization, warehouse management, and schema inspection. Covers QUERY_HISTORY analysis, credit consumption, warehouse utilization, storage analysis, and rightsizing opportunities. Enforces two-phase execution and anti-hallucination rules.
New Relic observability platform for APM, infrastructure monitoring, log management, synthetic monitoring, and alerting. Covers NRQL query building, application performance analysis, error investigation, alert management, and service dependency analysis via NerdGraph GraphQL API.
Run a structured pre-deployment and post-deployment safety checklist for any service release. Guides through readiness gates, migration safety, rollback planning, deployment execution, post-deploy health validation, and stabilization monitoring.
Structured incident response workflow covering detection, triage, investigation, mitigation, resolution, and post-mortem. Includes severity matrix, triage questions, investigation guidance, mitigation decision framework, and post-mortem template for SEV1/SEV2/SEV3 incidents.
Generate a comprehensive cloud cost optimization report by analyzing spending patterns, identifying waste, and recommending savings opportunities across AWS, GCP, or Azure. Covers idle resource detection, rightsizing, reserved instance analysis, spot opportunities, and prioritized action plan.
Create a PagerDuty incident from an RCA finding, monitoring alert, or escalation. Automatically validates service and escalation policy, formats incident details, creates the incident via PagerDuty API, and adds initial context notes for responders.
Post structured deployment notifications to Slack channels. Formats release details into clean Slack block messages for started, completed, failed, and rollback events — including service, version, environment, deployer, key changes, and rollback instructions.
Enrich code reviews with business context by finding the linked issue/ticket from branch name or PR/MR description. Validates that code changes satisfy acceptance criteria and business requirements before reviewing.
Create Jira tickets from incident RCA findings. Automatically generates structured tickets with root cause analysis, severity classification, and remediation steps.
Create structured Confluence incident report pages from RCA findings. Generates comprehensive post-incident documentation with timeline, impact analysis, root cause, and action items.
Post concise incident RCA summaries to Slack channels. Formats findings into a readable Slack message with severity, root cause, impact, and action items.
Create GitLab issues from code review findings. Converts review comments into trackable issues with severity labels, code references, and remediation guidance.
Post structured review checklists as comments on GitHub pull requests. Summarizes code review findings with categorized checks, severity indicators, and actionable feedback.
Cloudflare GraphQL Analytics for zone traffic, firewall events, Workers metrics, and schema exploration. Use when querying Cloudflare analytics data or exploring the GraphQL API.
PostgreSQL database analysis, performance tuning, and health monitoring. You MUST read this entire skill document before executing any PostgreSQL operations — it contains mandatory workflows, safety constraints, and two-phase execution rules that prevent common errors like hallucinated column names and unsafe queries.
SonarQube code quality and security analysis. Use when working with code quality metrics, security hotspots, quality gates, or issue tracking in SonarQube Cloud or Server.
MANDATORY parallel execution patterns (30x speedup), CloudWatch statistics syntax, Cost Explorer aggregation, output token limits, and common pitfalls
Analyze, break down, and report AWS costs and bills. Covers cost breakdown by service, account, or usage type; monthly/daily billing trends; cost anomaly detection; RI/SP utilization; cost forecasting; credit/discount analysis; and multi-account cost comparison. Uses anti-hallucination rules, mandatory currency/credit detection workflow, and reusable Cost Explorer functions.
Detect unused and idle AWS resources that incur cost without providing value. Covers detached EBS volumes, idle load balancers, unused Elastic IPs, stopped EC2 instances, idle NAT Gateways, old snapshots, and unused ENIs. Includes estimated monthly waste per resource and anti-hallucination rules for safe detection.
AWS pricing helper for cost queries. ALWAYS use get_aws_cost script for pricing questions.
Analyze EC2, RDS, EBS, and Lambda resource utilization to identify right-sizing opportunities. Uses CloudWatch metrics with anti-hallucination rules for burstable instances, memory metrics, peak vs average analysis, and estimated monthly savings calculations.
MANDATORY parallel execution patterns (30x speedup), monitor metrics aggregation, --output tsv formatting, and common pitfalls
REQUIRED helper functions (never raw curl), workspace/repo_slug patterns, suggestion block syntax for code review
Cost anti-hallucination rules, MANDATORY parallel execution patterns (30x speedup), monitoring aligners, reusable billing/pricing scripts, VAT/tax handling, and filtering/pagination
Detect unused and idle GCP resources that incur cost without providing value.
Analyze Compute Engine VM, Cloud SQL, Persistent Disk, and serverless (Cloud Functions/Cloud Run) utilization to identify right-sizing opportunities. Uses Cloud Monitoring metrics with anti-hallucination rules for E2 shared-core instances, sole-tenant nodes, preemptible/spot VMs, SUD eligibility, peak vs average analysis, CUD-aware savings, and estimated monthly savings calculations.
REQUIRED --repo flag for all commands, helper functions for code review comments, suggestion block syntax rules
REQUIRED helper functions (never raw curl), project_id patterns, suggestion block syntax with offset notation
MANDATORY parallel execution patterns, cluster overview script, -o json with jq filtering, batch operations, and common pitfalls
HubSpot CRM management - search and retrieve contacts, companies, deals, tickets, and other CRM objects. Use when working with HubSpot data, analyzing sales pipelines, or querying customer information.
Notion workspace management - pages, databases, blocks, and content. Use when searching, creating, updating, or organizing Notion content.
Supabase project management - databases, Edge Functions, branches, and migrations. Use when working with Supabase infrastructure, executing SQL, or deploying serverless functions.
Dynatrace observability, problem management, and DQL queries. Use when working with Dynatrace problems, vulnerabilities, entities, logs, metrics, spans, or events.
Elasticsearch cluster monitoring, log analytics, and search optimization. Use when working with Elasticsearch indices, mappings, DSL queries, ES|QL analytics, or shard health.
Grafana monitoring, visualization, and alerting. Use when working with Grafana datasources, Prometheus metrics, Loki logs, dashboards, or alerts.
Confluence page management, space administration, and content collaboration. Use when working with Confluence pages, spaces, comments, or Atlassian connections.
Jira issue tracking, project management, and workflow automation. Use when working with Jira issues, projects, sprints, or Atlassian connections.
Zabbix monitoring platform CLI. Use when working with Zabbix hosts, templates, triggers, events, logs, maintenance, dashboards, or configuration export/import.
Google AlloyDB instance analysis, query insights, columnar engine optimization, maintenance windows, and cluster health. You MUST read this skill before executing any AlloyDB operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Aqua Security platform analysis. Covers container runtime protection, image assurance policies, compliance frameworks, vulnerability management, workload protection, and registry scanning. Use when analyzing container security posture, reviewing image compliance, investigating runtime alerts, or ...
Google BigQuery job analysis, slot utilization, cost analysis, dataset management, and query optimization. You MUST read this skill before executing any BigQuery operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Apache Cassandra keyspace analysis, compaction strategies, repair status, nodetool operations, and cluster health monitoring. You MUST read this skill before executing any Cassandra operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Checkov infrastructure-as-code security scanning. Covers Terraform, CloudFormation, Kubernetes, and Dockerfile scanning, policy management, custom checks, compliance frameworks, and suppression management. Use when scanning IaC for security misconfigurations, evaluating compliance, managing custo...
ClickHouse table analysis, MergeTree optimization, query performance tuning, parts management, and cluster health. You MUST read this skill before executing any ClickHouse operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
CockroachDB cluster health, range distribution, SQL statistics, schema change monitoring, and query optimization. You MUST read this skill before executing any CockroachDB operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Couchbase bucket analysis, index advisor, N1QL query performance, XDCR status, and cluster health monitoring. You MUST read this skill before executing any Couchbase operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Dependabot and Renovate dependency update management. Covers vulnerability alerts, dependency update PRs, auto-merge configuration, version pinning, update scheduling, and security advisory tracking. Use when managing dependency updates, reviewing vulnerability alerts, configuring auto-merge poli...
Apache Druid datasource analysis, ingestion task management, supervisor status, segment management, and query performance. You MUST read this skill before executing any Druid operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Amazon DynamoDB table analysis, capacity mode evaluation, GSI/LSI usage, item access patterns, and cost optimization. You MUST read this skill before executing any DynamoDB operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Google Cloud Firestore collection analysis, index management, security rules review, usage metrics, and query optimization. You MUST read this skill before executing any Firestore operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
InfluxDB bucket management, Flux query analysis, task management, retention policies, and performance monitoring. You MUST read this skill before executing any InfluxDB operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Kubescape Kubernetes security posture analysis. Covers NSA/CISA framework assessment, MITRE ATT&CK mapping, CIS Kubernetes benchmarks, workload scanning, RBAC analysis, and network policy evaluation. Use when assessing Kubernetes cluster security, evaluating compliance frameworks, or analyzing wo...
MariaDB Galera cluster status, MaxScale routing, ColumnStore engine analysis, performance schema tuning, and replication health. You MUST read this skill before executing any MariaDB operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Neo4j graph database analysis, index management, Cypher query optimization, APOC utilities, and cluster health monitoring. You MUST read this skill before executing any Neo4j operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
PlanetScale branch management, deploy requests, schema analysis, query insights, and database health. You MUST read this skill before executing any PlanetScale operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Prowler cloud security assessment tool. Covers AWS, Azure, and GCP security posture assessment, CIS benchmark evaluation, compliance frameworks (PCI-DSS, HIPAA, SOC2), multi-account scanning, and remediation guidance. Use when assessing cloud security posture, running compliance audits, or invest...
Amazon Redshift query performance, WLM configuration, table design analysis, vacuum/analyze status, and cluster health. You MUST read this skill before executing any Redshift operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Semgrep static application security testing (SAST). Covers code scanning, custom rule creation, CI integration, autofix capabilities, multi-language support, and vulnerability triage. Use when scanning source code for security vulnerabilities, creating custom detection rules, or integrating SAST ...
SingleStore (MemSQL) workspace management, pipeline status, query tuning, memory analysis, and cluster health. You MUST read this skill before executing any SingleStore operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Snyk security scanning and vulnerability analysis. Covers dependency vulnerability scanning, container image scanning, IaC security scanning, code analysis, license compliance, and fix recommendations. Use when scanning for vulnerabilities, analyzing dependency risks, reviewing container security...
tfsec Terraform security scanning. Covers static analysis of Terraform code, custom rule creation, CI integration, severity filtering, module scanning, and remediation guidance. Use when scanning Terraform configurations for security issues, creating custom security rules, or integrating security...
TiDB cluster topology, slow query analysis, hot region analysis, dashboard monitoring, and SQL optimization. You MUST read this skill before executing any TiDB operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
TimescaleDB hypertable analysis, chunk management, continuous aggregates, compression policies, and time-series optimization. You MUST read this skill before executing any TimescaleDB operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Trino (Presto) query analysis, catalog management, worker health, memory allocation, and query optimization. You MUST read this skill before executing any Trino operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Trivy comprehensive security scanner. Covers container image scanning, filesystem scanning, Kubernetes cluster scanning, SBOM generation, secret detection, license scanning, and misconfiguration detection. Use when scanning containers for vulnerabilities, generating SBOMs, scanning K8s clusters, ...
Vitess tablet health, VSchema management, resharding status, VReplication workflows, and cluster topology. You MUST read this skill before executing any Vitess operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Wiz cloud security posture management. Covers cloud security posture analysis, vulnerability prioritization, attack path analysis, compliance assessment, container security, and resource inventory. Use when assessing cloud security posture, investigating attack paths, prioritizing vulnerabilities...
AWS API Gateway REST and HTTP API management, stage analysis, usage plans, throttling configuration, and deployment tracking. Covers API inventory, endpoint metrics, latency analysis, authorization configuration, and integration health.
AWS Athena query execution analysis, workgroup management, Data Catalog integration, and cost-per-query tracking. Covers query history, performance optimization, workgroup quota management, saved queries, and data scanned analysis.
AWS Batch compute environment management, job queue analysis, scheduling policy review, and array job monitoring. Covers compute resource utilization, job status tracking, failure investigation, Fargate vs EC2 comparison, and queue priority analysis.
AWS CloudFront distribution analysis, cache hit ratio monitoring, origin health checks, invalidation management, and performance optimization. Covers distribution inventory, behavior configuration, SSL certificate status, geo-restriction, and real-time metrics.
AWS CloudTrail event analysis, trail management, insight event investigation, and organization trail configuration. Covers API activity analysis, security event investigation, resource change tracking, unauthorized access detection, and event history querying.
AWS CloudWatch Logs group management, Logs Insights query execution, metric filter analysis, retention policy review, and subscription filter management. Covers log group inventory, storage cost optimization, query patterns, and cross-account log aggregation.
AWS Cognito user pool analysis, identity pool management, authentication flow analysis, and MFA status tracking. Covers user statistics, app client configuration, password policy review, Lambda trigger inspection, and federation setup.
AWS Config compliance dashboard, rule evaluation status, conformance pack management, and resource timeline analysis. Covers configuration recorder status, compliance summary, non-compliant resource investigation, remediation tracking, and aggregator management.
AWS ECS cluster health, service status, task analysis, capacity provider management, and deployment tracking. Covers service stability, task failure investigation, container insights, resource utilization, and scaling analysis.
AWS EKS cluster management, nodegroup status, addon management, IRSA configuration, and cluster health analysis. Covers Kubernetes version tracking, nodegroup scaling, addon compatibility, IAM role mapping, and control plane logging.
AWS ElastiCache deep analysis for Redis and Memcached clusters, replication health, failover analysis, and performance metrics. Covers node-level metrics, memory utilization, cache hit rates, eviction tracking, connection analysis, and engine-specific diagnostics.
AWS EventBridge event bus management, rule analysis, target health monitoring, and schema registry exploration. Covers event pattern matching, rule invocation metrics, dead-letter queue analysis, cross-account event routing, and event replay.
AWS Glue crawler management, ETL job run analysis, Data Catalog exploration, and schema registry management. Covers crawler schedules, job performance metrics, database and table inventory, partition analysis, and connection health.
AWS GuardDuty finding analysis, detector management, suppression rule review, and member account oversight. Covers threat detection summary, finding severity distribution, finding type breakdown, trusted IP lists, and organization-wide security posture.
AWS Lambda function analysis, invocation metrics, cold start analysis, layer management, and concurrency tracking. Covers function configuration review, error rate analysis, duration percentiles, provisioned concurrency utilization, and deployment package optimization.
AWS Organizations account management, OU structure analysis, SCP policy review, and service access control. Covers organizational hierarchy, account inventory, policy inheritance analysis, delegated administrator status, and service control boundary assessment.
AWS RDS deep analysis covering Performance Insights, event subscriptions, proxy management, and global database status. Covers PI counter metrics, wait event analysis, top SQL queries, read replica lag, storage autoscaling, and Aurora-specific diagnostics.
AWS S3 bucket analysis, storage class distribution, lifecycle policies, access patterns, and cost optimization. Covers bucket inventory, object metrics, versioning status, encryption audit, public access analysis, and intelligent tiering evaluation.
AWS Secrets Manager secret rotation status, access analysis, cost tracking, and lifecycle management. Covers secret inventory, rotation configuration audit, last access tracking, resource policy review, and version management.
AWS SES sending statistics, bounce and complaint rate analysis, identity management, configuration set monitoring, and deliverability tracking. Covers sending quota utilization, suppression list management, DKIM/SPF status, and reputation dashboard metrics.
AWS Step Functions execution analysis, state machine management, error tracking, and performance optimization. Covers execution history, failure investigation, state transition metrics, Express vs Standard comparison, and workflow visualization.
AWS Systems Manager parameter store management, session management, patch compliance tracking, and Run Command analysis. Covers parameter inventory, managed instance status, automation execution, maintenance windows, and inventory collection.
AWS Transit Gateway attachment analysis, route table management, peering configuration, and network topology review. Covers TGW inventory, attachment health, route propagation, VPN connection status, and bandwidth utilization.
AWS WAF web ACL management, rule analysis, traffic metrics, and IP set management. Covers WAF rule group inspection, rate-based rule configuration, managed rule group analysis, logging status, and blocked request investigation.
AWS X-Ray trace analysis, service map generation, fault and error analysis, sampling rule management, and latency investigation. Covers trace summaries, segment analysis, annotation-based filtering, and group configuration.
Azure Kubernetes Service (AKS) cluster management, node pool status, addon management, upgrade planning, and health diagnostics via Azure CLI.
Application Insights request performance, dependency tracking, availability tests, smart detection, and failure analysis via Azure CLI and Log Analytics queries.
Azure App Service web app health monitoring, deployment slot management, scaling configuration, and application settings analysis via Azure CLI.
Azure Blob Storage container analysis, access tier management, lifecycle policies, replication configuration, and storage account health via Azure CLI.
Azure Container Apps revision management, scaling rules, Dapr integration, ingress configuration, and environment health via Azure CLI.
Azure Cosmos DB throughput analysis, partition key distribution, consistency levels, metrics monitoring, and database management via Azure CLI.
Azure Cost Management cost analysis, budget tracking, Advisor cost recommendations, export management, and spending trend analysis via Azure CLI.
Azure DevOps Boards work item tracking, sprint analysis, burndown metrics, team velocity, and backlog management via Azure DevOps CLI extension.
Azure Front Door routing rules, WAF policy management, health probe configuration, cache management, and CDN analytics via Azure CLI.
Azure Functions app analysis, execution metrics, consumption plan monitoring, scaling configuration, and deployment management via Azure CLI.
Azure Key Vault secret, key, and certificate management, access policy auditing, rotation status, and security configuration via Azure CLI.
Azure Logic Apps workflow run analysis, trigger history, connector management, error diagnostics, and workflow definition inspection via Azure CLI.
Azure Monitor metrics querying, Log Analytics workspace management, alert rule configuration, action groups, and diagnostic settings via Azure CLI.
Azure Policy compliance assessment, policy and initiative assignment management, remediation task tracking, and definition analysis via Azure CLI.
Azure SQL Database DTU/vCore analysis, query performance insights, elastic pool management, geo-replication status, and database health monitoring via Azure CLI.
Google AlloyDB cluster management, instance analysis, query insights, maintenance window configuration, and performance diagnostics via gcloud CLI.
Google Artifact Registry repository management, vulnerability scanning analysis, cleanup policy configuration, and artifact lifecycle management via gcloud CLI.
Google BigQuery job analysis, slot utilization, storage optimization, materialized views, BI Engine capacity, and query performance diagnostics via bq CLI and gcloud.
Google Cloud Armor security policy management, WAF rule configuration, adaptive protection analysis, and edge security policy management via gcloud CLI.
Google Cloud Build trigger management, build history analysis, worker pool configuration, artifact management, and CI/CD pipeline diagnostics via gcloud CLI.
Google Cloud Deploy delivery pipeline management, release tracking, rollout status monitoring, approval workflows, and deployment diagnostics via gcloud CLI.
Google Cloud Functions management, execution analysis, scaling configuration, event trigger inspection, and runtime diagnostics via gcloud CLI.
Google Cloud Logging log analysis, log-based metrics, sink management, exclusion filters, and log routing configuration via gcloud CLI.
Google Cloud Run service management, revision traffic splitting, scaling configuration, concurrency tuning, and container diagnostics via gcloud CLI.
Google Cloud Spanner instance management, query statistics analysis, hot spot detection, schema analysis, and performance diagnostics via gcloud CLI.
Google Cloud Storage bucket analysis, lifecycle rule management, access control configuration, transfer service operations, and storage class optimization via gsutil and gcloud CLI.
Google Cloud Trace latency analysis, trace exploration, sampling configuration, and span diagnostics via gcloud CLI.
Google Cloud Firestore document and collection analysis, index management, security rules review, usage monitoring, and performance diagnostics via gcloud CLI.
Google Kubernetes Engine cluster operations, node pool management, workload analysis, autopilot configuration, and upgrade planning via gcloud CLI.
Google Secret Manager secret lifecycle management, version control, rotation configuration, IAM policy auditing, and access diagnostics via gcloud CLI.
ActiveMQ broker health, destination management, advisory topic monitoring, network connector status, and message flow analysis. You MUST read this skill before executing any ActiveMQ operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Ansible automation and configuration management. Covers playbook execution, inventory management, role management, vault secrets, ad-hoc commands, fact gathering, and task debugging. Use when running playbooks, managing inventories, debugging task failures, or auditing Ansible configurations.
Argo Workflows management for Kubernetes-native workflow orchestration. Covers workflow templates, cron workflow scheduling, artifact management, workflow execution, parameter handling, and resource monitoring. Use when checking workflow status, investigating step failures, managing templates, or...
Atlantis pull request automation for Terraform. Covers plan/apply via PR comments, workspace management, project configuration, server status, lock management, and repository configuration. Use when managing Atlantis deployments, debugging PR plan/apply issues, or configuring project workflows.
AWS CDK infrastructure-as-code management. Covers synth, diff, deploy, bootstrap, context values, construct inspection, and stack dependencies. Use when managing CDK applications, reviewing synthesized templates, comparing deployments, or debugging construct issues.
AWS Kinesis stream management, shard analysis, consumer lag monitoring, enhanced fan-out configuration, and throughput analysis. You MUST read this skill before executing any Kinesis operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
AWS SQS queue metrics, dead letter queue management, SNS subscription management, message attributes, and queue health monitoring. You MUST read this skill before executing any SQS/SNS operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Azure DevOps comprehensive management. Covers pipelines, boards, repositories, artifacts, test plans, work item tracking, and release management. Use when checking pipeline status, managing work items, investigating build failures, or auditing Azure DevOps project configurations.
Azure Event Hubs partition analysis, checkpoint management, capture status, consumer group monitoring, and throughput analysis. You MUST read this skill before executing any Event Hubs operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Better Stack (formerly Logtail) for log search, uptime monitoring, incident management, on-call scheduling, and status page management. Covers log querying, monitor configuration, heartbeat checks, escalation policies, and team management. Use when searching logs, managing uptime monitors, handli...
Buildkite CI/CD pipeline and agent management. Covers build pipeline status, agent pool health, artifact management, build annotations, and cluster queue monitoring. Use when checking build status, investigating failures, managing agents, or reviewing pipeline configurations.
Calico networking and security management for Kubernetes. Covers network policy enforcement, BGP configuration, IPAM management, Felix agent status, Typha health, WireGuard encryption, and global network sets. Use when managing Calico network policies, debugging routing issues, configuring BGP pe...
Celery worker management, task monitoring, queue routing, Flower dashboard analysis, and task result tracking. You MUST read this skill before executing any Celery operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
cert-manager certificate lifecycle management. Covers certificate issuance, issuer configuration, ACME challenges, certificate renewal, troubleshooting failed issuance, and trust management. Use when managing TLS certificates in Kubernetes, debugging issuance failures, configuring ACME providers,...
Chef infrastructure automation and configuration management. Covers cookbook management, node configuration, compliance profiles, data bag management, role/environment administration, and Chef InSpec auditing. Use when managing Chef infrastructure, debugging convergence failures, inspecting node ...
Cilium eBPF-based networking and security management for Kubernetes. Covers network policy enforcement, eBPF map inspection, Hubble flow visibility, cluster mesh, endpoint health, service load balancing, and identity management. Use when managing Cilium network policies, debugging connectivity, m...
CircleCI pipeline and workflow management. Covers pipeline status, workflow analysis, job logs, credit usage monitoring, orb management, and project configuration. Use when checking CI status, investigating build failures, analyzing credit consumption, or managing CircleCI orbs.
AWS CloudFormation stack management. Covers stack lifecycle, change sets, drift detection, template validation, nested stacks, stack sets, and event troubleshooting. Use when managing CloudFormation stacks, investigating deployment failures, detecting drift, or validating templates.
HashiCorp Consul service discovery and mesh management. Covers service registration, health checks, KV store operations, intentions (ACLs), Connect service mesh, datacenter federation, and prepared queries. Use when managing service discovery, debugging health checks, configuring service mesh int...
Crossplane Kubernetes-native infrastructure management. Covers managed resources, compositions, provider configurations, composite resource definitions (XRDs), claims, and provider health. Use when managing Crossplane resources, debugging provisioning failures, inspecting compositions, or auditin...
Docker container lifecycle and image management. Covers container operations, image builds, volume/network inspection, Docker Compose orchestration, resource usage monitoring, and registry management. Use when managing containers, debugging container issues, inspecting images, or orchestrating mu...
Drone CI pipeline execution and management. Covers pipeline status, build logs, secret management, repository activation, cron scheduling, and runner monitoring. Use when checking build status, investigating pipeline failures, managing secrets, or configuring Drone CI repositories.
Amazon ElastiCache cluster health, node status, replication group management, parameter group analysis, and performance monitoring. You MUST read this skill before executing any ElastiCache operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
env0 environment management platform. Covers environment lifecycle, template deployment, cost tracking, policy management, variable configuration, and approval workflows. Use when managing env0 environments, tracking infrastructure costs, reviewing deployment history, or configuring deployment te...
Envoy Proxy configuration and health management. Covers listener configuration, cluster health checks, route table inspection, admin interface operations, stats monitoring, and config dump analysis. Use when debugging Envoy proxy configuration, inspecting upstream cluster health, analyzing route ...
External Secrets Operator management. Covers secret syncing from external providers, SecretStore configuration, ExternalSecret resource management, push secrets, ClusterSecretStore setup, and provider troubleshooting. Use when managing secret synchronization between external vaults and Kubernetes...
Fluentd log collector management with plugin status, buffer monitoring, routing rule analysis, match/filter configuration, and input health. Covers pipeline analysis, buffer overflow detection, retry monitoring, and configuration validation. Use when managing Fluentd plugins, analyzing buffer hea...
Flux CD GitOps management for Kubernetes. Covers source reconciliation, Kustomization status, HelmRelease management, image automation, notification configuration, and drift detection. Use when checking GitOps sync status, investigating reconciliation failures, managing Flux sources, or auditing ...
GitHub Actions workflow and runner management. Covers workflow run status, job analysis, usage billing, secret management, runner administration, and artifact retrieval. Use when checking CI status, investigating workflow failures, managing self-hosted runners, or auditing Actions usage costs.
GitLab CI/CD pipeline and runner management. Covers pipeline status, job logs, runner administration, artifact management, environment deployments, and merge request pipelines. Use when checking CI status, investigating job failures, managing runners, or auditing deployment history.
Google Pub/Sub topic management, subscription health, dead letter policies, schema management, and message flow analysis. You MUST read this skill before executing any Pub/Sub operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Harness continuous delivery platform management. Covers deployment pipelines, service management, environment configuration, connector health, infrastructure definitions, and execution history. Use when checking deployment status, investigating pipeline failures, managing services, or auditing Ha...
Helm chart management for Kubernetes deployments. Covers chart installation, upgrade, rollback, release history, repository management, template rendering, dependency resolution, and values inspection. Use when managing Helm releases, debugging chart issues, or reviewing deployment configurations.
Istio service mesh management for Kubernetes. Covers virtual services, destination rules, gateway configuration, traffic management, mTLS policies, sidecar injection, telemetry, and fault injection. Use when managing service mesh traffic routing, debugging connectivity issues, reviewing mTLS stat...
Jenkins CI/CD pipeline management and monitoring. Covers pipeline execution, build analysis, agent health monitoring, plugin management, queue inspection, and credential auditing. Use when checking build status, investigating failures, managing Jenkins agents, or auditing plugin configurations.
Apache Kafka topic management, consumer group monitoring, partition analysis, broker health, and lag monitoring. You MUST read this skill before executing any Kafka operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Kustomize Kubernetes manifest management. Covers overlay composition, base management, resource generation, patch strategies, variable substitution, and build validation. Use when managing Kustomize overlays, debugging manifest generation, reviewing patch strategies, or validating Kustomize builds.
Kyverno Kubernetes policy management. Covers policy creation, validation rules, mutation rules, generation rules, policy reports, exception management, and compliance auditing. Use when managing Kubernetes admission policies, debugging policy violations, reviewing policy reports, or configuring r...
Linkerd service mesh management for Kubernetes. Covers proxy injection, traffic splitting, service profiles, tap/top monitoring, mTLS verification, health checks, and dashboard access. Use when managing Linkerd mesh, debugging service communication, configuring traffic splits, or monitoring mesh ...
Logstash pipeline management with input/filter/output analysis, pipeline statistics, event processing metrics, and queue monitoring. Covers pipeline health, plugin performance, JVM stats, hot threads analysis, and configuration review. Use when managing Logstash pipelines, analyzing throughput, r...
NATS subject management, JetStream stream configuration, consumer status, cluster health, and message flow analysis. You MUST read this skill before executing any NATS operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
HashiCorp Nomad workload orchestrator management. Covers job submission, allocation status, deployment health, client/server status, namespace management, and resource utilization. Use when managing Nomad jobs, debugging allocation failures, monitoring deployments, or inspecting cluster health.
Octopus Deploy management for deployment automation. Covers deployment projects, environments, variable sets, tenant configuration, release management, and runbook execution. Use when checking deployment status, investigating release failures, managing environments, or auditing Octopus Deploy con...
OPA/Gatekeeper policy engine management. Covers Rego policy authoring, constraint templates, constraint management, audit results, data queries, and policy testing. Use when managing Kubernetes admission policies with Gatekeeper, writing Rego rules, debugging policy violations, or querying OPA data.
Red Hat OpenShift container platform management. Covers project management, build configurations, routes, deployment configs, operators, image streams, and cluster administration. Use when managing OpenShift projects, debugging builds, configuring routes, or working with operators and image streams.
OpsGenie alert management, on-call schedule queries, escalation policy configuration, integration management, and incident response. Covers alert lifecycle, team management, routing rules, maintenance windows, and notification policies. Use when managing alerts, reviewing on-call schedules, confi...
Packer machine image building. Covers image builds, template validation, variable management, post-processor configuration, build debugging, and multi-builder pipelines. Use when building machine images, validating templates, debugging build failures, or managing image pipelines.
PagerDuty Events API v2 for event routing, change events, alert grouping, and incident triggering. Covers trigger/acknowledge/resolve events, change tracking, alert deduplication, custom event transformations, and integration key management. Use when sending events to PagerDuty, managing alert gr...
Podman rootless container and pod management. Covers container lifecycle, pod orchestration, image building, systemd integration, rootless networking, and container migration from Docker. Use when managing Podman containers, creating pods, generating systemd units, or building images without a da...
Apache Pulsar tenant management, namespace policies, topic statistics, subscription lag, and cluster health. You MUST read this skill before executing any Pulsar operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Pulumi infrastructure-as-code management. Covers stack management, preview/update workflows, configuration and secrets, state inspection, resource history, and policy packs. Use when managing Pulumi stacks, investigating deployment failures, managing secrets, or auditing infrastructure resources.
Puppet configuration management. Covers catalog compilation, node management, module installation, report analysis, fact inspection, environment management, and resource auditing. Use when managing Puppet infrastructure, debugging catalog failures, inspecting node states, or auditing configuratio...
RabbitMQ queue management, exchange bindings, connection health, shovel and federation status, and cluster monitoring. You MUST read this skill before executing any RabbitMQ operations — it contains mandatory two-phase execution, anti-hallucination rules, and safety constraints.
Rancher multi-cluster Kubernetes management. Covers cluster provisioning, project/namespace management, catalog apps, workload monitoring, RBAC, and fleet management. Use when managing Rancher clusters, deploying catalog applications, configuring projects, or monitoring multi-cluster workloads.
SaltStack configuration management and remote execution. Covers state management, grain data, pillar data, job management, targeting, event system, and orchestration. Use when managing Salt infrastructure, executing remote commands, debugging state failures, or inspecting minion data.
Skaffold continuous development workflow management for Kubernetes. Covers dev loop configuration, build/deploy profiles, debug mode, file sync, render pipeline, and multi-module projects. Use when managing Skaffold development workflows, debugging build failures, configuring deploy profiles, or ...
Spacelift IaC management platform. Covers stack management, run history, policy evaluation, drift detection, module registry, and worker pool monitoring. Use when managing Spacelift stacks, investigating run failures, reviewing policies, or auditing infrastructure deployments.
Spinnaker continuous delivery and infrastructure management. Covers application deployment, pipeline management, infrastructure views, server group operations, load balancer configuration, and canary analysis. Use when checking deployment status, managing pipelines, investigating rollback scenari...
Statuspage.io component management, incident lifecycle, scheduled maintenance, metric publishing, and subscriber notifications. Covers component status updates, incident creation and resolution, uptime metrics, and page configuration. Use when managing status page components, creating incidents, ...
TeamCity CI/CD server management. Covers build configurations, agent pools, VCS roots, build queue monitoring, project hierarchy, and build chain analysis. Use when checking build status, investigating failures, managing agents, or auditing TeamCity project configurations.
Tekton Pipelines management for Kubernetes-native CI/CD. Covers pipeline runs, task management, trigger bindings, event listeners, and workspace configuration. Use when checking pipeline execution, investigating task failures, managing triggers, or auditing Tekton resources in a Kubernetes cluster.
Terraform infrastructure-as-code management. Covers state management, plan/apply workflows, module inspection, drift detection, workspace management, import operations, and provider debugging. Use when managing infrastructure state, investigating plan failures, detecting configuration drift, or a...
Terragrunt Terraform wrapper management. Covers run-all operations, dependency management, configuration generation, input/output passing, remote state configuration, and multi-environment workflows. Use when managing Terragrunt configurations, running cross-module operations, debugging dependenc...
Traefik reverse proxy and load balancer management. Covers entrypoint configuration, router rules, middleware chains, provider discovery, TLS management, dashboard access, and metrics monitoring. Use when managing Traefik routing, debugging middleware chains, configuring TLS, or monitoring proxy ...
Travis CI build management and monitoring. Covers build history, repository settings, cron job management, cache administration, and environment variable configuration. Use when checking build status, investigating failures, managing cron schedules, or auditing Travis CI repository settings.
Vagrant development environment management. Covers box management, VM lifecycle, provisioner execution, multi-machine environments, snapshot management, and networking configuration. Use when managing Vagrant VMs, debugging provisioning failures, or inspecting development environment configurations.
AppDynamics application performance monitoring with flow maps, business transaction analysis, tier health, analytics queries, and baseline management. Covers application topology, node health, error analysis, metric browsing, and health rule violations. Use when analyzing application performance,...
Axiom observability platform with dataset management, APL (Axiom Processing Language) queries, monitors, dashboards, and virtual fields. Covers log and event ingestion analysis, dataset statistics, alerting rules, annotation management, and data retention. Use when querying datasets with APL, man...
Falco runtime threat detection and monitoring. Covers rule management, alert analysis, system call monitoring, Kubernetes audit log analysis, custom rule creation, and output channel configuration. Use when investigating runtime security alerts, managing detection rules, analyzing suspicious acti...
Instana application performance monitoring with infrastructure discovery, service endpoint analysis, incident management, smart alert configuration, and dependency mapping. Covers automatic topology, call analysis, trace grouping, SLI/SLO tracking, and infrastructure health. Use when analyzing se...
Jaeger distributed tracing for trace search, service dependency analysis, span analysis, sampling configuration, and performance monitoring. Covers trace lookup, service topology, operation latency analysis, and comparison workflows. Use when searching traces, analyzing service dependencies, inve...
Kibana visualization platform with space management, saved objects, index patterns, dashboard export, and lens analysis. Covers dashboard management, visualization listing, data view configuration, alerting rules, and reporting. Use when managing Kibana spaces, exporting dashboards, reviewing ind...
Grafana Loki log aggregation with LogQL queries, label management, tenant analysis, ingestion health, and ruler configuration. Covers log stream queries, metric queries from logs, alerting rules, and storage analysis. Use when querying logs via LogQL, analyzing label cardinality, managing alert r...
OpenTelemetry collector management, pipeline configuration, exporter health, instrumentation analysis, and receiver status. Covers collector metrics, pipeline topology, processor performance, batch/queue monitoring, and SDK configuration review. Use when managing OTel collectors, analyzing pipeli...
Pingdom uptime monitoring with check management, transaction checks, page speed analysis, alerting configuration, and performance reporting. Covers HTTP/TCP/UDP checks, real user monitoring, response time analysis, outage history, and contact management. Use when managing uptime checks, analyzing...
Splunk platform for log analytics, SPL queries, index management, saved searches, alerts, and dashboard analysis. Covers search job management, data model acceleration, KV store lookups, and forwarder health. Use when running SPL queries, investigating alerts, analyzing indexes, or managing Splun...
Sumo Logic cloud-native log analytics, metrics queries, monitors, dashboards, and content management. Covers log search, metrics exploration, alerting rules, folder/content management, and data ingestion health. Use when searching logs, querying metrics, managing monitors, or analyzing Sumo Logic...
Grafana Tempo distributed tracing with TraceQL queries, service graphs, span metrics, and trace analysis. Covers trace search, span attribute filtering, service topology, trace-to-metrics correlation, and backend health monitoring. Use when querying traces via TraceQL, analyzing service graphs, i...
Thanos long-term Prometheus storage with store gateway health, compactor status, query frontend analysis, sidecar management, and ruler evaluation. Covers PromQL queries via Thanos Query, block management, deduplication, downsampling status, and multi-cluster federation. Use when querying Thanos ...
Uptime Kuma self-hosted monitoring with monitor management, notification channels, status pages, heartbeat analysis, and maintenance windows. Covers HTTP/TCP/DNS/ping monitors, alert configuration, response time tracking, and uptime statistics. Use when managing monitors, analyzing uptime, config...
VictoriaMetrics time-series database with MetricsQL queries, vmstorage health, vmselect performance, retention management, and cluster monitoring. Covers metric ingestion, cardinality analysis, query optimization, and multi-tenant operations. Use when querying metrics via MetricsQL, analyzing sto...
Architecture Decision Record (ADR) template covering context, decision drivers, considered options, decision outcome, consequences, and status tracking. Based on the MADR format. Use for documenting significant technical decisions with full rationale.
Blameless postmortem template covering incident timeline, contributing factors analysis, impact assessment, what went well, what could be improved, and concrete action items. Focuses on systemic improvements over individual blame. Use after any SEV1/SEV2 incident.
Blue-green deployment template covering environment preparation, parallel stack deployment, traffic switching, validation gates, and cleanup. Use for zero-downtime releases with instant rollback capability.
Canary deployment template covering traffic percentage ramp-up, metrics gates at each stage, automated promotion criteria, and rollback triggers. Use for gradual, risk-controlled production releases with real traffic validation.
Capacity planning workflow covering traffic forecasting, resource utilization analysis, scaling strategy, cost projections, and bottleneck identification. Use for quarterly capacity reviews, pre-launch planning, or scaling readiness assessments.
Chaos engineering experiment template covering hypothesis definition, blast radius containment, experiment execution, observation, and learning. Supports Chaos Monkey, Litmus, Gremlin, and manual fault injection. Use for resilience validation and failure mode discovery.
CIS Benchmark assessment for AWS covering identity and access management, logging, monitoring, networking, and storage controls. Based on CIS AWS Foundations Benchmark v3.0. Use for security baseline validation or hardening projects.
CIS Benchmark assessment for Kubernetes covering control plane configuration, worker node security, RBAC policies, pod security, network policies, and secrets management. Based on CIS Kubernetes Benchmark v1.8. Use for cluster hardening or compliance validation.
Cloud migration readiness assessment covering application portfolio analysis, 6R migration strategy classification, dependency mapping, wave planning, risk assessment, and cost modeling. Use for planning data center exits, cloud-first transformations, or workload repatriation decisions.
Database migration safety checklist covering pre-migration validation, backup verification, compatibility checks, migration execution, data validation, rollback procedures, and post-migration monitoring. Use for schema changes, engine migrations, or data platform transitions.
Disaster recovery planning template covering RTO/RPO definitions, failover procedures, communication plans, testing schedules, and recovery validation. Use for establishing DR strategy, documenting runbooks, or preparing for DR audits.
DNS cutover procedure covering pre-checks, TTL reduction, cutover execution, validation, and rollback. Use for domain migrations, CDN changes, load balancer swaps, or any DNS-dependent infrastructure transition.
FinOps maturity review covering crawl/walk/run phases across cost visibility, optimization, governance, and organizational alignment. Based on the FinOps Foundation framework. Use for establishing FinOps practice, benchmarking maturity, or planning capability improvements.
GDPR data audit covering personal data mapping, lawful basis assessment, consent management, data retention policies, DSAR processes, and cross-border transfer compliance. Use for annual data audits, new system assessments, or regulatory preparation.
HIPAA compliance review covering PHI handling, encryption requirements, access controls, audit logging, Business Associate Agreements, and breach notification readiness. Use for healthcare application assessments, vendor onboarding, or annual compliance reviews.
Kubernetes cluster upgrade runbook covering pre-upgrade checks, API deprecation review, node upgrade procedure, validation tests, and rollback steps. Supports EKS, GKE, AKS, and self-managed clusters. Use for minor/major version upgrades.
Load testing plan template covering test scenario design, baseline capture, execution configuration, results analysis, and reporting. Supports k6, Locust, Gatling, and JMeter approaches. Use for capacity validation, performance regression testing, or pre-launch load testing.
Multi-region deployment template covering region selection, data replication strategy, traffic routing, failover configuration, consistency models, and operational procedures. Use for expanding to new regions, designing active-active architectures, or implementing global redundancy.
On-call health assessment covering page volume analysis, MTTA/MTTR measurement, toil identification, alert quality review, burnout indicators, and improvement recommendations. Use for quarterly on-call reviews or team health retrospectives.
PCI DSS v4.0 compliance assessment covering cardholder data protection, network segmentation, vulnerability management, access control, and monitoring. Use for self-assessment questionnaires (SAQ), audit preparation, or continuous compliance validation.
Production readiness checklist covering scalability, reliability, observability, security, operational maturity, and documentation. Use before promoting a service to production or during periodic production-readiness audits.
Run a structured security audit across IAM, network, encryption, logging, and vulnerability management. Covers identity hygiene, network segmentation, data protection, and audit trail completeness. Use for periodic security reviews or pre-compliance preparation.
SLO definition workflow covering SLI selection, target setting, error budget policy, alerting strategy, and stakeholder alignment. Use when establishing SLOs for a new service, revising existing targets, or implementing an SRE practice.
Assess SOC2 Type II compliance readiness across Trust Service Criteria: security, availability, processing integrity, confidentiality, and privacy. Covers access controls, change management, monitoring, incident response, and vendor management. Use for audit preparation or continuous compliance m...
SSL/TLS certificate rotation workflow covering certificate discovery, renewal planning, deployment procedures, and validation. Supports ACM, Let's Encrypt, and manual CA certificates. Use for scheduled rotations, expiring certificate remediation, or certificate automation setup.
Great Expectations data quality framework analysis. Covers expectation suite management, validation result review, data docs generation, checkpoint execution, datasource configuration, and batch analy
AWS native cost optimization using Compute Optimizer, Trusted Advisor cost checks, Savings Plan analysis, Cost Explorer, and resource rightsizing. Covers EC2 rightsizing, idle resource detection, Savi
Airbyte data integration platform management. Covers connection status monitoring, sync job tracking, source and destination health, schema change detection, workspace management, and connector upgrad
Apache Airflow workflow orchestration management. Covers DAG management, task instance monitoring, executor health, pool status, variable management, connection audit, and scheduler diagnostics. Use w
Apigee API platform management - API proxy deployment, environment configuration, analytics and traffic analysis, developer and app management. Use when managing Apigee-based API infrastructure, deplo
Apptio and Cloudability IT financial management and cloud cost optimization. Covers IT cost allocation, cloud cost analysis, benchmarking, budget forecasting, showback/chargeback reports, and technolo
Auth0 identity platform management for tenant configuration, application setup, connection analysis, rule and action review, user management, and log inspection. Covers universal login, social connect
AWS IAM deep analysis for policy evaluation, role trust relationship review, access advisor data, credential reports, permission boundary inspection, and identity-based vs resource-based policy analys
Azure Entra ID (formerly Azure AD) management for user lifecycle, app registrations, service principals, conditional access policies, sign-in log analysis, and directory role assignments. Covers B2B g
BentoML model serving and packaging management. Covers service management, model packaging, deployment status, API testing, Bento building, and runner configuration. Use when packaging ML models for s
Caddy web server management, automatic TLS certificate handling, reverse proxy configuration, site management, and plugin inspection. Covers Caddyfile analysis, admin API usage, access logs, and upstr
CAST AI Kubernetes cluster optimization and cost management. Covers cluster cost optimization, spot instance management, cost reports, node rebalancing, security posture, and workload right-sizing. Us
CloudHealth by VMware multi-cloud cost management and governance. Covers cost reporting, rightsizing recommendations, governance policies, budget tracking, reserved instance management, and compliance
CoreDNS server management, zone configuration, plugin chain analysis, query logging, and cache statistics. Covers Corefile inspection, health endpoint monitoring, metrics collection, and forwarding co
CyberArk privileged access management for safe management, privileged account discovery, session monitoring, credential rotation status, and security audit. Covers Vault, PVWA, PSM, and CyberArk Privi
Dagster data orchestration platform management. Covers asset management, pipeline runs, sensor and schedule status, IO manager configuration, partition management, and resource health. Use when checki
dbt (data build tool) project management and monitoring. Covers model runs, test results, source freshness, documentation generation, manifest analysis, and dbt Cloud API integration. Use when checkin
Feast feature store management. Covers feature store configuration, entity management, feature views, materialization, online serving, and data source inspection. Use when managing ML feature pipeline
Finout cloud cost management and MegaBill analysis platform. Covers cost allocation, MegaBill analysis, virtual tagging, showback reports, cost anomaly detection, and cross-provider spend optimization
Fivetran data integration connector management. Covers connector status monitoring, sync operations, schema drift detection, usage metrics, destination management, and transformation scheduling. Use w
FusionAuth identity platform management for tenant configuration, application setup, theme customization, webhook configuration, user management, and audit log review. Covers login flows, passwordless
Google Workspace administration for user provisioning, group management, organizational unit structure, security settings review, and audit log analysis. Covers Google Admin SDK Directory API, Reports
GraphQL API management - schema introspection, query complexity analysis, resolver performance monitoring, and subscription management. Use when analyzing GraphQL schemas, optimizing query performance
gRPC service management - service discovery via reflection, health checking, load balancing analysis, channelz diagnostics, and proto schema inspection. Use when debugging gRPC services, inspecting av
HAProxy load balancer management, backend health monitoring, frontend statistics, server status tracking, and ACL management. Covers session analysis, connection metrics, SSL termination status, and s
Hasura GraphQL Engine management - metadata inspection, remote schema configuration, event trigger management, action definitions, and permission analysis. Use when managing Hasura-based APIs, configu
Hugging Face platform management. Covers model hub, datasets, spaces, inference endpoints, tokenizer inspection, and model card analysis. Use when searching for models, managing datasets, deploying in
Infracost Terraform cost estimation and FinOps policy enforcement. Covers cost breakdown by resource, diff analysis between plan changes, CI/CD integration checks, policy validation, and cost forecast
JumpCloud directory platform management for user and system management, SSO application configuration, device policy enforcement, group management, and audit event review. Covers LDAP, RADIUS, SSO app
Keycloak identity and access management for realm administration, client configuration, user federation, role mapping, session analysis, and authentication flow review. Covers OpenID Connect, SAML, LD
Kong Gateway management - service and route configuration, plugin management, upstream health monitoring, consumer and credential management. Use when managing API gateway infrastructure, configuring
Kubecost Kubernetes cost monitoring and optimization. Covers namespace cost allocation, workload cost breakdown, efficiency scoring, savings recommendations, cluster cost trends, and budget alerting.
Kubeflow ML platform management on Kubernetes. Covers pipeline management, experiment tracking, notebook servers, KFServing/KServe inference, Katib hyperparameter tuning, and training operators. Use w
Label Studio data labeling platform management. Covers project management, annotation tasks, model predictions, data export, user management, and labeling configuration. Use when managing annotation p
Looker BI platform management. Covers explore analysis, dashboard review, LookML project management, PDT status monitoring, query performance, user activity, and content validation. Use when analyzing
Metabase BI platform management. Covers question and dashboard analysis, database connection management, pulse/subscription monitoring, collection organization, query performance, and user activity tr
MLflow experiment tracking and model registry management. Covers experiment tracking, run comparison, model registry, artifact management, model serving, and metric analysis. Use when managing ML expe
Neptune.ai experiment tracking and model registry management. Covers experiment comparison, model registry, dashboard monitoring, run metadata analysis, and artifact management. Use when managing ML e
Nginx web server and reverse proxy management, upstream health monitoring, virtual host analysis, access log analytics, and rate limiting configuration. Covers server block inspection, SSL certificate
nOps AWS cloud optimization and cost management platform. Covers ShareSave automated savings, commitment management, well-architected reviews, idle resource detection, and AWS cost optimization. Use w
Okta identity and access management for user lifecycle management, application assignments, MFA status review, group and policy administration, and system log analysis. Covers user provisioning, SSO a
Ping Identity platform management for SSO configuration, MFA policy review, directory bridge monitoring, session management, and environment health checks. Covers PingOne, PingFederate, and PingAccess
Postman workspace management - collection and request management, environment variable configuration, monitor run analysis, and API documentation generation. Use when managing API testing workflows, a
Prefect workflow orchestration platform management. Covers flow run monitoring, deployment management, work pool status, automation rules, block configuration, and agent health. Use when checking flow
Ray distributed computing platform management. Covers cluster management, job submission, Ray Serve deployment, dashboard monitoring, actor management, and resource utilization. Use when managing dist
AWS SageMaker ML platform management. Covers training jobs, model endpoints, model registry, pipelines, feature store, experiments, and hyperparameter tuning. Use when managing ML training workflows,
Seldon Core model deployment and inference management on Kubernetes. Covers model deployment, inference graphs, A/B testing, canary rollouts, monitoring, and explainability. Use when deploying ML mode
Spot.io and Ocean cluster cost optimization for Kubernetes and cloud workloads. Covers cluster cost optimization, workload right-sizing, savings analysis, spot instance management, and infrastructure
Apache Superset BI platform management. Covers chart and dashboard management, dataset analysis, query history review, database connections, saved query management, and alert/report scheduling. Use wh
Swagger and OpenAPI specification management - spec validation, code generation, Swagger UI management, and schema analysis. Use when validating API specifications, generating client/server code from
Tailscale mesh VPN management, device status monitoring, ACL policy analysis, DNS configuration, and exit node management. Covers network topology inspection, key expiry tracking, MagicDNS status, and
Tyk API gateway management - API definitions, policy configuration, analytics and usage data, key management, and dashboard operations. Use when managing Tyk-based API infrastructure, configuring acce
Vantage cloud cost management and observability platform. Covers cost reports across providers, anomaly detection, budget alerts, resource-level cost tracking, and provider integrations. Use when anal
Google Vertex AI platform management. Covers model management, training pipelines, prediction endpoints, feature store, experiments, datasets, and custom jobs. Use when managing ML models on GCP, depl
Weights & Biases experiment tracking and ML ops management. Covers run tracking, sweep management, artifact versioning, report analysis, model registry, and team collaboration. Use when managing ML ex
WireGuard VPN tunnel management, peer configuration, handshake timing analysis, transfer statistics, and interface monitoring. Covers tunnel status inspection, key management, endpoint tracking, and r
Monte Carlo data observability platform monitoring. Covers data freshness monitors, volume anomalies, schema changes, lineage analysis, incident management, and custom monitors. Use when investigating
Cache flush and warm procedure covering impact assessment, flush execution, and warming strategy. Use when cache data is stale, after schema changes, or when cache corruption is suspected.
Database failover procedure covering pre-checks, failover execution, validation, and DNS update. Use when performing planned database failover, responding to primary database failure, or testing failo
Dependency upgrade workflow covering compatibility check, testing, deployment, and monitoring. Use for library upgrades, framework version bumps, runtime updates, or security patch application.
Load balancer failover procedure covering health checks, traffic shift, backend validation, and rollback. Use for planned LB maintenance, active-passive failover, or responding to LB degradation.
Log rotation and cleanup procedure covering disk usage check, rotation configuration, archival, and cleanup. Use when disk usage is high due to logs, configuring new log rotation policies, or performi
Network partition recovery procedure covering diagnosis, route fix, connectivity validation, and root cause analysis. Use when services experience network segmentation, intermittent connectivity, or c
OS patching window procedure covering pre-patch backup, patch application, reboot sequence, and validation. Use for scheduled maintenance windows, security patch deployments, or kernel upgrades.
Application scaling procedure covering trigger assessment, scale execution, and monitoring. Use for manual scaling in response to traffic spikes, planned events, or autoscaling failures.
Secret and credential rotation procedure covering discovery, rotation execution, deployment, and validation. Use for scheduled credential rotation, responding to leaked secrets, or compliance-driven k
Storage expansion procedure covering capacity check, volume resize, filesystem extension, and validation. Use when disk usage approaches thresholds, before large data ingestion, or for planned storage
Alibaba Cloud infrastructure management via the aliyun CLI. Covers ECS instances, RDS databases, VPCs, SLB load balancers, OSS storage, and billing. Use when managing Alibaba Cloud resources or checki
Allma incident management, collaboration workflows, post-incident learning, and channel orchestration. Covers incident creation via Slack, timeline tracking, stakeholder communication, automated workf
Ambassador (Emissary-Ingress) API gateway management covering mappings, hosts, TLS contexts, authentication filters, rate limiting, and Envoy proxy configuration. Use when managing Ambassador/Emissary
Anecdotes compliance automation platform for managing security programs, controls, evidence, and multi-framework compliance. Covers control monitoring, evidence automation, plugin integrations, gap an
Apache Airflow deep management covering DAG inventory, task instance monitoring, scheduler health, executor status, pool and variable management, connection auditing, and SLA miss tracking. Use when i
Appsmith low-code platform management covering application inventory, workspace organization, datasource health, page and widget analysis, and user access auditing. Use when reviewing internal tool co
Atlassian Statuspage management, component monitoring, incident communication, scheduled maintenance, and subscriber notifications. Covers page configuration, component group management, incident life
AWS Amplify application management and deployment analysis. Covers Amplify apps, branches, backend environments, build status, domain associations, and webhook configurations. Use when inspecting Ampl
AWS App Runner service management and health analysis. Covers service inventory, deployment status, auto-scaling configurations, custom domain associations, VPC connector settings, and observability c
AWS AppConfig configuration management and deployment analysis. Covers applications, environments, configuration profiles, deployment strategies, hosted configurations, and deployment status monitorin
AWS Backup vault, plan, and job management. Covers backup vaults, backup plans, backup selections, recovery points, backup jobs, restore jobs, and compliance frameworks. Use when auditing backup cover
Advanced AWS CloudFront management covering distribution lifecycle, behavior configuration, Lambda@Edge functions, origin groups for failover, real-time logs, field-level encryption, and cache policy
AWS CodeBuild project management and build analysis. Covers build project inventory, build history, build phase details, environment configurations, source credentials, report groups, and build metric
AWS CodeCommit repository management and analysis. Covers repository inventory, branch details, pull request status, commit history, approval rules, and trigger configurations. Use when inspecting rep
AWS CodeDeploy application and deployment management. Covers applications, deployment groups, deployment history, deployment configurations, instance health, target revisions, and rollback settings. U
AWS CodePipeline CI/CD pipeline management and execution analysis. Covers pipeline inventory, stage and action status, execution history, pipeline triggers, artifact stores, and action type configurat
AWS Control Tower landing zone and account management. Covers landing zone status, enrolled accounts, guardrails (controls), organizational units, baseline configurations, and drift detection. Use whe
AWS Data Pipeline workflow management and execution analysis. Covers pipeline inventory, pipeline definitions, execution status, object status, task runner health, and pipeline scheduling. Use when in
AWS DocumentDB cluster management and health analysis. Covers cluster inventory, instance status, parameter groups, subnet groups, snapshots, event subscriptions, and performance metrics. Use when ins
AWS DynamoDB deep-dive management covering table configurations, capacity analysis, GSI/LSI health, auto-scaling policies, streams, backups, contributor insights, and TTL settings. Use when performing
AWS EFS file system management and performance analysis. Covers file system inventory, mount targets, access points, throughput modes, lifecycle policies, replication configurations, and storage metri
AWS FSx file system management and health analysis. Covers FSx for Lustre, Windows File Server, NetApp ONTAP, and OpenZFS file systems, volumes, backups, data repository associations, and storage capa
AWS Inspector vulnerability management and finding analysis. Covers Inspector v2 coverage, vulnerability findings, finding summaries, scan configurations, suppression rules, and coverage statistics. U
Azure CDN management covering profiles, endpoints, custom domains, caching rules, origin groups, and performance analytics. Supports Microsoft, Verizon, and Akamai CDN providers within Azure. Use when
Azure DNS management covering public and private DNS zones, record sets, virtual network links, DNSSEC, and alias records. Use when managing Azure DNS zones, configuring DNS records, setting up privat
Backblaze B2 cloud storage management via the b2 CLI. Covers buckets, files, lifecycle rules, keys, and usage statistics. Use when managing B2 storage or reviewing bucket configurations and costs.
Baserow open-source database platform management covering workspace organization, database and table inventory, field configuration analysis, view management, webhook monitoring, and user access audit
Bazel build system management. Covers BUILD file analysis, target querying, dependency graphs, remote execution, caching configuration, and build performance profiling. Use when managing Bazel workspa
Better Uptime monitoring, status pages, incident management, on-call scheduling, and heartbeat tracking. Covers monitor configuration, response time analysis, incident lifecycle, on-call rotations, an
BIND9 DNS server management covering zone files, named configuration, DNSSEC, RNDC controls, query logging, and server statistics. Use when managing BIND9 DNS servers, auditing zone configurations, tr
Bitbucket deep platform management covering repository inventory, Pipelines CI/CD monitoring, pull request analysis, branch permission auditing, deployment environment tracking, and workspace member m
Synopsys Black Duck software composition analysis for open-source risk management, vulnerability detection, and license compliance. Covers project scanning, BOM component analysis, vulnerability track
Blameless incident management, SLO tracking, retrospectives, and reliability insights. Covers incident lifecycle, blameless retrospective facilitation, follow-up tracking, reliability scorecards, and
Budibase low-code platform management covering application inventory, table and datasource health, screen layout analysis, automation monitoring, and user role auditing. Use when reviewing internal ap
Bunny.net CDN management covering pull zones, storage zones, edge rules, cache configuration, and bandwidth analytics. Use when managing Bunny CDN pull zones, analyzing traffic patterns, configuring e
PortSwigger Burp Suite Enterprise Edition for automated dynamic application security testing (DAST). Covers scan management, vulnerability findings, site configuration, scan scheduling, and issue repo
Cachet open-source status page management, component tracking, incident reporting, and metric visualization. Covers component group management, incident lifecycle, scheduled maintenance, metric points
Cargo Rust build and package management. Covers build configuration, workspace management, feature flags, build profiles, cross-compilation, and build script analysis. Use when managing Cargo workspac
Chamber secrets management using AWS Systems Manager Parameter Store, service-scoped secret organization, environment variable export, and parameter auditing. Covers reading and writing secrets by ser
Changesets version management covering changeset file inventory, pending version bump analysis, package release readiness, changelog generation status, and monorepo release coordination. Use when audi
Google Chronicle SIEM security operations, threat detection, asset investigation, and log analysis. Covers UDM search queries, detection rule management, asset and IOC lookups, reference list manageme
Cloudflare CDN management including zone configuration, cache purging, page rules, firewall rules, SSL/TLS settings, and performance optimization. Covers cache analytics, bandwidth usage, threat detec
Advanced Cloudflare DNS management covering zone records, DNSSEC configuration, DNS analytics, load balancing pools, health checks, and DNS firewall. Use for deep DNS record management, DNSSEC trouble
Cloudflare R2 object storage management via the wrangler CLI and Cloudflare API. Covers buckets, objects, usage metrics, CORS policies, and lifecycle rules. Use when managing R2 storage or reviewing b
CockroachDB Cloud (Serverless & Dedicated) management via the ccloud CLI and CockroachDB Cloud API. Covers clusters, databases, SQL users, networking, backups, and metrics. Use when managing Cockroach
CocoaPods dependency management for iOS/macOS. Covers Podfile configuration, pod publishing, dependency resolution, spec repositories, and pod analysis. Use when managing CocoaPods dependencies, publi
Codacy code quality management. Covers automated code reviews, pattern configuration, coverage tracking, security analysis, and repository settings. Use when managing Codacy projects, reviewing code p
Code Climate quality management. Covers maintainability analysis, test coverage tracking, issue detection, GPA scoring, and repository configuration. Use when managing Code Climate projects, reviewing
Codecov coverage tracking management. Covers coverage reporting, PR comments, flag management, component analysis, and coverage trend tracking. Use when managing Codecov configuration, analyzing cover
CODEOWNERS file management covering ownership mapping analysis, coverage gap detection, team assignment validation, orphaned path identification, and review requirement auditing. Use when auditing cod
ConfigCat feature flag management, remote configuration, A/B testing, targeting rules, and percentage-based rollouts. Covers flag listing, targeting rule management, environment overrides, audit log r
HashiCorp Consul Connect service mesh management covering service discovery, intentions, sidecar proxies, mesh gateways, ingress/terminating gateways, certificate management, and health checks. Use wh
Contentful headless CMS management covering space and environment inventory, content type analysis, entry and asset monitoring, locale configuration, webhook tracking, API key auditing, and usage quot
Contour ingress controller management covering HTTPProxy resources, TLS delegation, rate limiting, request policies, inclusion/delegation, and Envoy proxy configuration. Use when managing Contour HTTP
Coveralls coverage tracking management. Covers coverage reporting, PR status checks, repository configuration, coverage history, and badge management. Use when managing Coveralls coverage reports, con
crates.io Rust package registry management. Covers crate publishing, version management, dependency resolution, feature flags, build configurations, and crate metadata analysis. Use when managing Rust
CrowdStrike Falcon endpoint detection and response, threat intelligence, host management, and incident investigation. Covers detection queries, host inventory, IOC searches, real-time response session
Dagger CI/CD pipeline management. Covers pipeline-as-code configuration, module management, function execution, caching layers, container builds, and pipeline debugging. Use when managing Dagger pipel
Dagster deep data orchestration management covering job and asset inventory, run monitoring, schedule and sensor health, partition status, resource configuration, and daemon health checks. Use when in
Danger JS automated code review management. Covers Dangerfile configuration, PR rule enforcement, plugin management, CI integration, and custom rule development. Use when managing Danger JS rules, con
DeepSource code analysis management. Covers automated code reviews, issue detection, antipattern analysis, coverage tracking, and analyzer configuration. Use when managing DeepSource projects, reviewi
DevCycle feature flag management, targeting rules, environments, variable management, and usage metrics. Covers feature creation, variation configuration, targeting rules, audience segments, and API u
DigitalOcean infrastructure management via the doctl CLI. Covers Droplets, databases, load balancers, firewalls, domains, and account billing. Use when managing DigitalOcean resources, checking Drople
DigitalOcean App Platform management via the doctl CLI. Covers apps, components, deployments, logs, domains, alerts, and billing. Use when managing App Platform applications or checking deployment hea
DigitalOcean Kubernetes (DOKS) management via the doctl CLI. Covers clusters, node pools, upgrades, kubeconfig, and cluster health. Use when managing DOKS clusters or checking Kubernetes infrastructur
DigitalOcean Spaces object storage management via the AWS CLI (S3-compatible) and doctl CLI. Covers Spaces buckets, objects, CDN endpoints, CORS, and lifecycle policies. Use when managing DigitalOcean
Directus headless CMS and data platform management covering collection inventory, field configuration, role and permission auditing, flow automation monitoring, webhook management, and file storage an
DNSimple DNS and domain management covering domains, DNS records, zone files, certificates, contacts, and domain registration. Use when managing DNSimple hosted zones, configuring DNS records, managin
Doppler secrets management, environment configuration, project organization, and access control. Covers secret syncing, config comparison across environments, activity logs, service token management,
Dotenv Vault encrypted environment variable management, environment syncing, team collaboration, and version tracking. Covers vault creation, environment pushing and pulling, key listing, version hist
Drata compliance automation platform for SOC 2, ISO 27001, HIPAA, PCI DSS, and GDPR compliance monitoring. Covers control monitoring, evidence collection, personnel management, asset tracking, and aud
Earthly build system management. Covers Earthfile configuration, target analysis, artifact management, caching strategies, satellite builds, and CI integration. Use when managing Earthly builds, debug
Elastic Security SIEM detection management, alert triage, rule configuration, and threat hunting. Covers security alerts, detection rules, timeline investigations, endpoint events, and case management
Eppo feature flag management, experiment assignment, statistical analysis, and metric definition. Covers flag configuration, experiment design, Bayesian and frequentist analysis, metric pipelines, and
Equinix Metal bare metal infrastructure management via the metal CLI and Equinix Metal API. Covers devices, projects, VLANs, IPs, BGP sessions, and capacity. Use when managing Equinix Metal bare metal
Fauna database management via the fauna CLI and Fauna API. Covers databases, collections, indexes, functions, keys, and query execution. Use when managing Fauna databases or reviewing document-relatio
FireHydrant incident management, runbooks, service catalog, status pages, and retrospectives. Covers incident lifecycle, severity tracking, team assignments, change events, and reliability analytics.
Flagsmith feature flag and remote config management, user segments, A/B testing, and environment management. Covers flag listing, identity overrides, segment rules, change requests, and audit logging.
Flipt feature flag management, segment-based targeting, rule configuration, and namespace organization. Covers flag and variant management, segment constraints, distribution rules, rollout percentages
Forgejo self-hosted Git forge management covering repository inventory, organization structure, user administration, webhook monitoring, Actions runner status, and federation configuration. Use when a
Micro Focus Fortify application security testing for static analysis (SAST), dynamic analysis (DAST), and software security assurance. Covers SSC project management, scan result analysis, issue tracki
Google Cloud CDN management covering backend services, URL maps, cache configuration, SSL policies, Cloud Armor integration, and CDN metrics. Use when managing GCP Cloud CDN backends, analyzing cache
Gitea self-hosted Git platform management covering repository inventory, organization structure, user administration, webhook monitoring, issue and pull request tracking, and CI/CD integration status.
GitHub deep platform management covering repository inventory, branch protection analysis, Actions workflow monitoring, security alerts, dependency graph, code scanning results, organization member au
GitLab deep platform management covering project inventory, CI/CD pipeline monitoring, merge request analysis, container registry health, security dashboard, runner status, group and member auditing,
Gloo Edge API gateway management covering virtual services, upstreams, route tables, authentication policies, rate limiting, WAF, and transformation filters. Use when managing Gloo Edge gateway routin
Go modules package management. Covers module initialization, dependency resolution, version management, module proxying, vendoring, and vulnerability scanning. Use when managing Go modules, resolving
GoDaddy DNS management covering domain DNS records, zone configuration, domain availability, registration details, and forwarding rules. Use when managing GoDaddy-hosted DNS records, checking domain r
Google Cloud DNS management covering managed zones, record sets, DNSSEC configuration, DNS policies, and peering. Use when managing GCP Cloud DNS zones, configuring DNS records, enabling DNSSEC, setti
Gradle build system management. Covers build script analysis, task execution, dependency resolution, build scans, plugin management, and multi-project builds. Use when managing Gradle projects, debugg
GrowthBook feature flag management, A/B testing, experiment analysis, and data-driven decisions. Covers feature definitions, experiment configuration, metric tracking, visual editor experiments, and S
Anchore Grype vulnerability scanner for container images, filesystems, and SBOMs. Covers image scanning, SBOM-based vulnerability matching, severity filtering, fix tracking, and database management. U
Hetzner Cloud and dedicated server management via the hcloud CLI and Hetzner API. Covers servers, volumes, networks, firewalls, load balancers, and snapshots. Use when managing Hetzner infrastructure
Hetzner Cloud deep-dive management via the hcloud CLI. Covers server types, pricing, placement groups, certificates, primary IPs, and detailed server metrics. Use for detailed Hetzner Cloud analysis b
Hex.pm Elixir/Erlang package registry management. Covers package publishing, dependency resolution, Mix configuration, Hex organization management, and package metadata. Use when managing Hex packages
Hyperproof compliance operations platform for managing controls, evidence, risks, and audit workflows across multiple frameworks. Covers control testing, evidence automation, task management, risk reg
IBM Cloud infrastructure management via the ibmcloud CLI. Covers VPC instances, Kubernetes clusters, Cloud Foundry apps, databases, object storage, and billing. Use when managing IBM Cloud resources o
IFTTT automation platform management covering applet inventory, service connection health, activity log monitoring, and usage tracking. Use when auditing applet configurations, investigating trigger o
Imperva (Incapsula) CDN and application security management covering site configuration, caching rules, WAF policies, DDoS protection, SSL settings, and performance analytics. Use when managing Imperv
incident.io incident lifecycle management, severity tracking, custom fields, status pages, post-incident reviews, and role assignments. Covers incident creation, escalation, follow-up tracking, and an
Infisical secrets management, environment configuration, access control, secret versioning, and audit logging. Covers project and workspace management, secret syncing across environments, access polic
Infoblox DDI (DNS, DHCP, IPAM) management covering DNS zones, records, networks, IP address allocation, DHCP scopes, and grid health. Use when managing Infoblox DNS/DHCP infrastructure, auditing IP ad
Instatus status page management, component monitoring, incident communication, and subscriber notifications. Covers status page creation, component health tracking, incident updates, scheduled mainten
Advanced Istio service mesh management covering Envoy proxy debugging, Wasm extensions, multi-cluster mesh federation, ambient mesh, advanced traffic policies, rate limiting, circuit breaking, and obs
Jeli incident analysis, narrative-based post-incident reviews, opportunity identification, and learning extraction. Covers incident ingestion from Slack, timeline reconstruction, contributing factor a
Just command runner management. Covers justfile analysis, recipe discovery, variable management, recipe dependencies, and cross-platform configuration. Use when managing justfiles, discovering availab
ArgoCD deep-dive management for advanced GitOps operations. Covers ApplicationSet controllers, multi-cluster sync strategies, notification configurations, resource hook analysis, sync wave ordering, a
Cert-manager deep-dive management for Kubernetes TLS certificate lifecycle. Covers certificate inventory, issuer health, certificate requests, challenges, orders, ACME configuration, and renewal statu
Crossplane deep-dive management for Kubernetes-native infrastructure provisioning. Covers provider health diagnostics, composition debugging, XRD schema validation, claim lifecycle analysis, managed r
ExternalDNS management for Kubernetes DNS record automation. Covers ExternalDNS deployment health, DNS record synchronization status, source and provider configurations, domain filters, ownership trac
Flux CD deep-dive management for Kubernetes GitOps delivery. Covers GitRepository sources, Kustomization reconciliation, HelmRelease status, HelmRepository health, ImagePolicy automation, notification
Grafana Operator management for Kubernetes dashboard and datasource automation. Covers Grafana instances, GrafanaDashboard CRDs, GrafanaDatasource configurations, GrafanaFolder management, and operato
Kubernetes HPA and VPA autoscaling management. Covers HorizontalPodAutoscaler configurations, scaling history, target utilization, current vs desired replicas, VerticalPodAutoscaler recommendations, u
Loki stack management for Kubernetes log aggregation. Covers Loki deployment health, Promtail/log agent status, log stream ingestion, storage backend configuration, retention policies, and query perfo
Kubernetes Metrics Server management and resource metrics analysis. Covers Metrics Server deployment health, node and pod resource utilization, top consumers, API availability, and metrics accuracy. U
Kubernetes NetworkPolicy management and network segmentation analysis. Covers NetworkPolicy inventory, ingress and egress rules, pod selector coverage, namespace isolation, default deny policies, and
Kubernetes PodDisruptionBudget management and disruption analysis. Covers PDB inventory, allowed disruptions, eviction status, node drain impact, workload protection coverage, and PDB misconfiguration
Prometheus Operator management for Kubernetes monitoring stack. Covers Prometheus instances, ServiceMonitors, PodMonitors, PrometheusRules, Alertmanager configurations, scrape targets, and recording r
Kubernetes ResourceQuota and LimitRange management. Covers quota inventory, usage vs limits, LimitRange defaults, namespace resource consumption, quota violations, and capacity planning. Use when audi
Sealed Secrets management for Kubernetes secret encryption. Covers SealedSecret resources, controller health, certificate rotation, key management, decryption status, and secret synchronization. Use w
Velero deep-dive management for Kubernetes backup and disaster recovery. Covers backup schedules, backup status, restore operations, backup storage locations, volume snapshot locations, backup item ac
KeyCDN management covering zones, zone aliases, cache settings, SSL certificates, and usage analytics. Use when managing KeyCDN zones, analyzing bandwidth and request metrics, configuring cache behavi
Kong Mesh service mesh management covering meshes, dataplanes, traffic policies, mesh gateways, mTLS configuration, and observability. Built on Kuma with enterprise features. Use when managing Kong Me
Kuma service mesh management covering meshes, dataplanes, traffic policies, mesh gateways, fault injection, rate limiting, and observability configuration. Supports both Kubernetes and Universal (VM)
Lacework cloud security platform for threat detection, compliance assessment, vulnerability management, and cloud workload protection. Covers alert investigation, compliance report analysis, container
Laika compliance platform for SOC 2, ISO 27001, HIPAA, and other security framework management. Covers control monitoring, policy management, evidence workflows, vendor assessment, and employee compli
LaunchDarkly feature flag management, targeting rules, environments, experimentation, and audit logging. Covers flag lifecycle, user targeting, percentage rollouts, prerequisite flags, flag scheduling
Limelight Networks (Edgio) CDN management covering delivery configurations, origin settings, cache policies, SSL certificates, and real-time analytics. Use when managing Limelight/Edgio CDN properties
Advanced Linkerd service mesh management covering proxy diagnostics, multi-cluster linking, policy resources, HTTPRoute configuration, server authorization, retry budgets, and advanced observability.
Linode (Akamai Cloud) infrastructure management via the linode-cli. Covers Linodes, NodeBalancers, volumes, domains, databases, and Kubernetes (LKE). Use when managing Linode/Akamai resources or check
GNU Make build system management. Covers Makefile analysis, target discovery, dependency graphs, variable inspection, and build debugging. Use when managing Makefiles, understanding target dependencie
Make (formerly Integromat) scenario management covering scenario inventory, execution history, connection health, webhook monitoring, and data store analysis. Use when auditing automation scenarios, i
Maven build system management. Covers POM analysis, dependency management, plugin configuration, multi-module builds, repository settings, and lifecycle phases. Use when managing Maven projects, resol
MegaLinter multi-language linting management. Covers linter configuration, flavor selection, reporter setup, CI integration, and fix mode management. Use when managing MegaLinter configurations, enabl
Neon serverless Postgres management via the neonctl CLI and Neon API. Covers projects, branches, databases, roles, endpoints, and compute scaling. Use when managing Neon databases or reviewing branch
NGINX Ingress Controller management covering ingress resources, backend services, TLS configuration, annotations, rate limiting, custom error pages, and controller health. Use when managing NGINX Ingr
NocoDB spreadsheet-database platform management covering base inventory, table structure analysis, view configurations, webhook monitoring, shared view auditing, and API token management. Use when rev
npm registry and package management. Covers package publishing, version management, registry configuration, access controls, audit scanning, and dependency analysis. Use when managing npm packages, co
NS1 managed DNS platform covering zones, records, filter chains, monitoring jobs, DNSSEC, and traffic steering. Use when managing NS1 DNS zones, configuring intelligent traffic routing via filter chai
ProjectDiscovery Nuclei vulnerability scanner for template-based scanning of web applications, networks, and cloud services. Covers scan execution, template management, result analysis, severity track
NuGet package registry management. Covers package publishing, version management, dependency resolution, package source configuration, vulnerability scanning, and .NET project analysis. Use when manag
Nx monorepo build system management. Covers workspace configuration, project graph analysis, affected commands, computation caching, task executors, and distributed task execution. Use when managing N
Open Service Mesh (OSM) management covering mesh configuration, sidecar injection, traffic policies, SMI resources, ingress configuration, and observability. Use when managing OSM mesh instances, conf
Optimizely feature experimentation, flag management, audience targeting, event tracking, and results analysis. Covers feature flag configuration, experiment design, rollout management, audience rules,
Oracle Cloud Infrastructure management via the OCI CLI. Covers compute instances, networking, block storage, databases, and IAM. Use when managing Oracle Cloud resources, checking instance health, or
Oracle OCI Compute deep-dive management via the OCI CLI. Covers instances, shapes, images, boot volumes, console connections, instance pools, and autoscaling. Use for detailed OCI compute analysis.
Oracle OCI Database deep-dive management via the OCI CLI. Covers DB Systems, Autonomous Databases, backups, Data Guard, patching, and performance metrics. Use for detailed OCI database analysis.
Orca Security cloud-native application protection platform for agentless vulnerability scanning, compliance monitoring, and threat detection across cloud environments. Covers alert management, asset i
Google OSV-Scanner for open-source vulnerability detection using the OSV database. Covers dependency scanning, SBOM analysis, vulnerability lookup, license checking, and guided remediation. Use when s
OVHcloud infrastructure management via the ovh CLI and OVHcloud API. Covers dedicated servers, VPS, Public Cloud instances, databases, domains, and billing. Use when managing OVHcloud resources or rev
Packagist PHP package registry management. Covers Composer package publishing, version management, autoloading configuration, dependency resolution, and package metadata. Use when managing PHP package
Pants build system management. Covers BUILD file analysis, target querying, dependency inference, remote execution, cache configuration, and build performance. Use when managing Pants workspaces, debu
Payload CMS management covering collection inventory, global configuration, access control analysis, media library monitoring, version and draft tracking, and webhook status. Use when reviewing conten
Pi-hole DNS sinkhole management covering ad-blocking statistics, query logs, blocklist management, client activity, gravity database, and network-wide DNS filtering. Use when managing Pi-hole instance
Pipedream serverless workflow management covering workflow inventory, event source monitoring, connected account health, execution history, and credit usage tracking. Use when auditing workflow config
PlanetScale managed database platform management via the pscale CLI. Covers databases, branches, deploy requests, connection strings, and schema management. Use when managing PlanetScale databases or
Microsoft Power Automate flow management covering flow inventory, run history, connection health, connector usage, and environment monitoring. Use when auditing cloud or desktop flows, investigating r
PowerDNS authoritative and recursor management covering zones, records, server statistics, DNSSEC, TSIG keys, and zone metadata. Use when managing PowerDNS servers, configuring DNS zones and records,
pre-commit hook framework management. Covers hook configuration, repository management, hook execution, CI integration, and autoupdate management. Use when managing pre-commit hooks, adding or removin
Prefect deep workflow orchestration management covering flow inventory, flow run monitoring, deployment health, work pool status, block configuration, automation rules, and agent/worker health checks.
Palo Alto Prisma Cloud security platform for cloud security posture management, workload protection, and compliance monitoring. Covers alert management, policy configuration, asset inventory, vulnerab
PyPI package registry management. Covers package publishing, version management, distribution builds, dependency resolution, index configuration, and package metadata. Use when managing Python package
Qualys vulnerability management, asset inventory, compliance scanning, and security posture assessment. Covers vulnerability scan results, host detection queries, VMDR dashboard metrics, patch priorit
Railway platform management via the railway CLI and Railway API. Covers projects, services, deployments, databases, volumes, and environment variables. Use when managing Railway deployments or checkin
Rapid7 InsightVM vulnerability management, asset discovery, risk scoring, and remediation tracking. Covers vulnerability scanning, asset inventory, risk prioritization, scan scheduling, and compliance
Render platform management via the Render API. Covers web services, static sites, background workers, cron jobs, databases, and environment groups. Use when managing Render deployments or checking ser
Renovate dependency update bot management covering configuration analysis, update PR tracking, dependency dashboard monitoring, package rule auditing, merge confidence review, and schedule optimizatio
Reviewdog automated code review management. Covers linter integration, PR comment configuration, CI setup, reporter configuration, and multi-tool orchestration. Use when managing Reviewdog CI integrat
Rootly incident management, on-call scheduling, workflows, retrospectives, and service catalog. Covers incident creation and lifecycle, automated workflows, alert routing, status pages, and post-incid
RubyGems package registry management. Covers gem publishing, version management, Bundler dependency resolution, gemspec configuration, and security scanning. Use when managing Ruby gems, publishing to
Sanity.io headless CMS management covering dataset inventory, document type analysis, GROQ query monitoring, asset management, webhook tracking, and project member auditing. Use when reviewing content
Scaleway cloud infrastructure management via the scw CLI. Covers instances, Kubernetes (Kapsule), managed databases, object storage, serverless containers, and billing. Use when managing Scaleway reso
Scrut Automation compliance platform for SOC 2, ISO 27001, HIPAA, GDPR, and other security frameworks. Covers risk management, control monitoring, evidence automation, vendor risk assessment, and cont
Bitnami Sealed Secrets management for Kubernetes, certificate rotation, secret encryption, and controller health. Covers encrypting secrets for Git storage, certificate lifecycle management, namespace
Section.io edge compute and CDN management covering applications, environments, proxy stack configuration, Varnish cache settings, and real-time monitoring. Use when managing Section.io edge applicati
Secureframe compliance automation platform for SOC 2, ISO 27001, HIPAA, PCI DSS, and GDPR compliance. Covers control monitoring, test automation, personnel management, vendor tracking, and evidence co
Semantic-release automated versioning management covering release configuration analysis, release history tracking, commit convention compliance, plugin chain auditing, branch strategy review, and cha
Microsoft Sentinel SIEM incident management, threat hunting, analytics rules, and security operations. Covers incident triage, KQL-based threat hunting, alert rule configuration, connector health, and
Sleuth deployment tracking, DORA metrics, change failure rate analysis, and deploy health monitoring. Covers deployment frequency, lead time, MTTR, change failure rate, feature flag impact, and code c
SonarQube code quality management. Covers project analysis, quality gate status, issue tracking, code coverage metrics, technical debt analysis, and quality profile configuration. Use when managing So
Sonatype Nexus IQ software composition analysis for open-source vulnerability detection, license compliance, and policy management. Covers application scanning, component analysis, policy violations,
Mozilla SOPS encrypted secrets management, key rotation, multi-provider encryption, and file-based secret operations. Covers encrypting and decrypting files, managing encryption keys (AWS KMS, GCP KMS
Sourcegraph code intelligence platform management covering repository indexing status, code search capabilities, batch change tracking, code insights monitoring, precise code intelligence status, and
Split.io feature flag management, targeting rules, traffic allocation, experimentation, and metric tracking. Covers split definitions, treatment configurations, segment management, impression tracking
Sprinto compliance automation platform for SOC 2, ISO 27001, HIPAA, GDPR, and PCI DSS. Covers automated check monitoring, entity management, policy tracking, training compliance, and audit readiness.
Squadcast incident management, on-call scheduling, escalation policies, SLO tracking, and runbooks. Covers incident lifecycle, alert deduplication, tagging, routing rules, and postmortem generation. U
StackPath CDN management covering sites, CDN scopes, edge rules, WAF configuration, SSL certificates, and analytics. Use when managing StackPath CDN delivery, analyzing cache performance and bandwidth
Statsig feature gate management, dynamic configs, experiment analysis, and metric tracking. Covers gate configuration, rule-based targeting, holdout groups, A/B test results, pulse metrics, and layer
StatusPal status page management, service monitoring, incident communication, and maintenance scheduling. Covers status page configuration, component health tracking, subscriber notifications, inciden
Strapi headless CMS management covering content type inventory, entry counts, media library analysis, role and permission auditing, webhook monitoring, and plugin status. Use when reviewing content mo
Tenable.io vulnerability management, asset discovery, scan orchestration, and risk-based prioritization. Covers vulnerability exports, asset inventory, VPR scoring, plugin analysis, scan scheduling, a
Thoropass (formerly Laika) compliance automation and audit management platform for SOC 2, ISO 27001, HIPAA, and PCI DSS. Covers control monitoring, evidence management, audit workflows, policy trackin
Tigris globally distributed object storage management via the AWS CLI (S3-compatible) and Tigris dashboard API. Covers buckets, objects, regions, caching, and shadow buckets. Use when managing Tigris
ToolJet low-code platform management covering application inventory, datasource health, workspace organization, user management, and environment configuration. Use when reviewing internal tool setups,
Tray.io integration platform management covering workflow inventory, execution monitoring, connector health, authentication management, and solution instance tracking. Use when auditing integration wo
Tugboat Logic (now part of OneTrust) compliance automation for SOC 2, ISO 27001, and other security frameworks. Covers policy management, control monitoring, evidence collection, risk assessment, and
Turborepo monorepo build system management. Covers pipeline configuration, caching strategies, task dependencies, remote caching, package filtering, and build performance analysis. Use when managing T
Turso edge database management via the turso CLI. Covers databases, groups, locations, replicas, tokens, and usage statistics. Use when managing Turso/libSQL databases or reviewing edge replication.
Unleash feature toggle management, activation strategies, environment configuration, and usage metrics. Covers toggle lifecycle, gradual rollout strategies, constraint-based targeting, project managem
UpCloud infrastructure management via the upctl CLI and UpCloud API. Covers servers, storage, networks, load balancers, managed databases, and Kubernetes. Use when managing UpCloud resources or checki
Vanta compliance automation platform for SOC 2, ISO 27001, HIPAA, PCI DSS, and GDPR compliance monitoring. Covers test monitoring, evidence collection, vulnerability tracking, personnel compliance, an
Veracode application security testing platform for SAST, DAST, SCA, and manual penetration testing. Covers application profiles, scan results, flaw management, policy compliance, and sandbox testing.
Vultr cloud infrastructure management via the vultr-cli and Vultr API. Covers instances, bare metal, block storage, load balancers, Kubernetes, databases, and billing. Use when managing Vultr resource
Wasabi cloud storage management via the AWS CLI (S3-compatible). Covers buckets, objects, versioning, lifecycle policies, and access control. Use when managing Wasabi storage or reviewing bucket confi
Mend (formerly WhiteSource) software composition analysis for open-source security, license compliance, and automated remediation. Covers vulnerability detection, library inventory, policy violations,
Workato enterprise automation platform management covering recipe inventory, job execution history, connection health, lookup table monitoring, and workspace management. Use when auditing integration
Xata serverless database management via the xata CLI and Xata API. Covers databases, branches, tables, schema management, migrations, and search. Use when managing Xata databases or reviewing schema w
OWASP ZAP (Zed Attack Proxy) for automated dynamic application security testing and web vulnerability scanning. Covers active/passive scanning, spider crawling, alert management, scan policies, and au
Zapier workflow automation management covering Zap inventory, task usage tracking, connection health, error monitoring, and folder organization. Use when auditing Zap configurations, investigating fai
Comprehensive accessibility audit template following WCAG 2.1 guidelines. Covers automated scanning, manual testing, keyboard navigation, screen reader compatibility, color contrast analysis, and reme
Structured review template for evaluating API designs before implementation. Covers RESTful conventions, naming consistency, error handling patterns, pagination strategy, versioning, security consider
Template for managing API sunset notifications and deprecation communications. Covers consumer identification, multi-channel notification strategy, deprecation header implementation, migration trackin
Template for defining and implementing API versioning strategies. Covers versioning scheme selection, backward compatibility analysis, migration path design, client communication planning, and depreca
Template for systematically verifying backup integrity and recoverability. Covers backup inventory, restore testing in isolated environments, data integrity validation, RTO/RPO measurement, and gap an
Template for developing and validating business continuity plans for critical services. Covers business impact analysis, recovery strategy definition, team roles and communication plans, failover proc
Runbook for TLS/SSL certificate renewal across infrastructure. Covers certificate inventory, expiration tracking, CSR generation, certificate issuance, deployment to load balancers and CDNs, chain val
Template for managing infrastructure and application changes through a structured change management process. Covers change classification, risk assessment, approval workflows, implementation planning,
Structured code review template covering correctness, security, performance, maintainability, and testing. Provides a consistent review checklist, severity classification, and feedback framework to en
Template for handling customer data export requests (data portability). Covers request validation, data source identification, extraction and assembly, format standardization, secure delivery, verific
Template for assessing data pipeline quality, reliability, and data integrity. Covers schema validation, data freshness monitoring, completeness checks, anomaly detection, lineage tracking, and SLA co
Template for defining and implementing data retention policies across systems. Covers data classification, regulatory requirements mapping, retention period definition, automated lifecycle management,
Step-by-step runbook for executing controlled and emergency database failovers. Covers pre-failover health checks, replication lag verification, connection draining, promotion of standby, application
Template for planning and executing dependency updates across projects. Covers vulnerability scanning, compatibility analysis, update sequencing, testing strategy, rollback planning, and automation se
Template for reviewing event-driven architecture designs and implementations. Covers event schema validation, producer/consumer mapping, ordering guarantees, idempotency patterns, dead letter handling
Template for systematically identifying and removing stale feature flags from codebases. Covers flag inventory, staleness analysis, dependency mapping, safe removal workflow, and verification to reduc
Template for structured incident communications across all stages of an incident lifecycle. Covers initial notification, status updates, resolution announcement, and post-incident summary with audienc
Runbook for safely draining Kubernetes nodes for maintenance, upgrades, or decommissioning. Covers pod disruption budget validation, workload rescheduling, persistent volume handling, cordon and drain
Template for configuring and validating log rotation policies across services and infrastructure. Covers log volume assessment, rotation strategy selection, retention policy configuration, compression
Template for analyzing monolithic applications and planning decomposition into microservices. Covers bounded context identification, domain-driven design analysis, service boundary definition, data ow
Step-by-step guide for migrating from multiple repositories to a monorepo structure. Covers repository consolidation strategy, history preservation, build system selection, CI/CD reconfiguration, code
Runbook for diagnosing and recovering from network partition events across distributed systems. Covers partition detection, impact assessment, split-brain resolution, data reconciliation, connectivity
Template for defining, measuring, and enforcing performance budgets across web applications and services. Covers Core Web Vitals targets, bundle size limits, API latency thresholds, resource loading b
Template for conducting Privacy Impact Assessments (PIA) on new projects or system changes involving personal data. Covers data flow mapping, purpose limitation analysis, data minimization review, con
Comprehensive checklist for evaluating release readiness before deploying to production. Covers code completeness, testing validation, documentation updates, monitoring readiness, rollback procedures,
Template for planning and executing the deprecation and decommissioning of services. Covers consumer impact analysis, migration path definition, communication timeline, traffic monitoring, graceful sh
Structured plan for migrating data between storage systems, volumes, or tiers. Covers capacity planning, data transfer strategy, performance benchmarking, cutover coordination, and validation to ensur
Template for systematically identifying, categorizing, and prioritizing technical debt across a codebase or system. Covers code quality metrics, architecture smells, dependency risks, test coverage ga
Template for evaluating third-party vendor risk before onboarding or during periodic reviews. Covers security posture assessment, data handling practices, compliance certifications, business continuit
Structured runbook for executing zero-downtime data and service migrations. Covers pre-migration validation, dual-write setup, incremental data sync, cutover orchestration, and rollback procedures to
Aerospike cluster management, namespace utilization, set analysis, and secondary index monitoring. Covers node health, storage engine statistics, migration tracking, UDF management, and XDR cross-data
Algolia index management, search relevance tuning, analytics review, and configuration optimization. Covers index settings, ranking criteria, faceting configuration, synonyms, rules, API key managemen
Allure TestOps test management and analytics platform monitoring. Covers project dashboards, launch tracking, test case management, defect clustering, environment matrix analysis, and CI/CD integratio
Amplitude analytics management — monitor event ingestion, user activity, chart dashboards, cohorts, and data taxonomy. Use when reviewing event volumes, inspecting user funnels, auditing taxonomy heal
Apache Druid deep management — monitor cluster health, datasources, ingestion tasks, segments, query performance, and coordinator/overlord status. Use when debugging ingestion failures, inspecting seg
Apache Geode distributed cache management, region inspection, member health monitoring, and WAN gateway analysis. Covers locator status, server groups, partition distribution, persistent disk store he
Apache HBase cluster management, region server health monitoring, table analysis, and compaction management. Covers RegionServer load balancing, HDFS integration health, WAL status, namespace inspecti
Apache Kudu cluster management, tablet server health monitoring, table partition analysis, and replication diagnostics. Covers master health, tablet distribution, column encoding efficiency, compactio
Apache Pinot management — monitor cluster health, tables, segments, ingestion jobs, query performance, and controller/broker/server status. Use when debugging ingestion issues, inspecting segment dist
Appium mobile testing management and server monitoring. Covers Appium server health, session management, desired capabilities review, device farm integration, test result analysis, and driver configur
AWS CDK application and stack management. Covers stack synthesis, CloudFormation template inspection, diff analysis, deployment orchestration, context values, asset management, and construct library u
AWS Config deep rule and compliance management. Covers Config rules, conformance packs, aggregators, advanced queries, remediation actions, configuration history, and multi-account compliance dashboar
AWS Fargate task and service analysis covering ECS cluster inventory, Fargate task status, CPU and memory utilization, task definition review, networking configuration, service auto-scaling policies,
Deep AWS Lambda analysis covering function inventory, cold start profiling, memory optimization, concurrency patterns, layer dependency auditing, dead letter queue health, event source mappings, and c
AWS Organizations deep management. Covers organizational unit hierarchy, account management, service control policies (SCPs), tag policies, backup policies, delegated administrators, and organizationa
AWS Proton service and environment template management. Covers environment templates, service templates, service instances, pipeline management, component inspection, and template sync configuration.
AWS Serverless Application Model (SAM) management. Covers template validation, local testing, build and packaging, deployment orchestration, stack inspection, log tailing, and API Gateway debugging. U
AWS Service Catalog portfolio and product management. Covers portfolio administration, product versioning, provisioned product inspection, launch constraints, tag options, and sharing across accounts.
AWS IAM Identity Center (SSO) management. Covers SSO instance configuration, permission sets, account assignments, user and group provisioning, session policies, and access auditing. Use when managing
Deep Azure Boards management covering work items, sprints, backlogs, queries, and team velocity analytics. Use when performing deep audits of Azure DevOps Boards, analyzing sprint health, reviewing ba
Azure Container Instances management covering container group inventory, container status and restart counts, CPU and memory utilization, networking configuration, log retrieval, and GPU allocation tr
Deep Azure Functions analysis covering function app inventory, trigger types, execution metrics, scaling behavior, slot deployments, application insights integration, and consumption plan cost trackin
Azure Lighthouse delegated resource management. Covers delegation assignments, managed service offers, customer tenant access, cross-tenant resource visibility, role assignments, and delegation auditi
Azure Management Groups hierarchy and governance management. Covers management group tree structure, subscription placement, policy assignments, RBAC inheritance, compliance status, and governance aud
Azure Policy deep compliance and governance management. Covers policy definitions, initiatives, assignments, compliance evaluation, exemptions, remediation tasks, custom policies, and regulatory compl
Azure Resource Manager (ARM) template and deployment management. Covers template validation, deployment operations, resource group inspection, what-if analysis, deployment history, and template spec m
Basecamp project management covering projects, to-do lists, messages, schedules, and team activity. Use when auditing Basecamp usage, managing to-dos and message boards, analyzing project activity, or
Baselime serverless observability platform for AWS Lambda, CloudFlare Workers, and Vercel functions. Covers log and trace querying, alert management, service discovery, and performance analysis of ser
BrowserStack cloud testing platform monitoring and analysis. Covers Automate session management, live testing status, App Automate tracking, build and session analysis, device/browser usage metrics, a
CapRover self-hosted PaaS management covering app inventory, container status, custom domain mappings, persistent directory mounts, environment variable auditing, cluster node status, Docker registry
CDK for Terraform (CDKTF) project and stack management. Covers project synthesis, provider generation, stack deployment, output inspection, and diff analysis using TypeScript, Python, Go, Java, or C#.
Census reverse ETL management — monitor syncs, destinations, models, source connections, and sync run health. Use when debugging sync failures, inspecting record delivery, auditing destination mapping
Checkmk infrastructure and application monitoring platform for hosts, services, network devices, and cloud resources. Covers host/service status, event console, rule management, agent deployment, and
Chroma vector database management, collection inspection, embedding analysis, and query performance monitoring. Covers collection metadata, document counts, distance metrics, index health, and tenant/
Chronosphere cloud-native observability platform for metrics, tracing, and alerting at scale. Covers PromQL-based metric queries, trace analysis, monitor management, dashboards, and control plane conf
Deep analysis of Cloudflare Workers deployments including script inventory, route mappings, KV namespace usage, Durable Objects, cron triggers, CPU time metrics, and error rate tracking. Use for compr
AWS CloudFormation deep stack management. Covers stack operations, change sets, drift detection, nested stacks, stack sets, resource imports, template analysis, and rollback troubleshooting. Use when
Coda document management covering docs, pages, tables, formulas, and workspace analytics. Use when auditing Coda workspace usage, managing documents and tables, querying structured data, or analyzing
Codeium (Windsurf) AI coding assistant management and analytics. Covers team seat management, usage tracking, language analytics, editor adoption metrics, security policy configuration, and enterprise
Deep Confluence management covering spaces, pages, content analytics, permissions, and workspace health. Use when performing deep audits of Confluence usage, analyzing content freshness, reviewing spa
Consul KV store management, key namespace analysis, session/lock inspection, and watch configuration. Covers KV tree structure, key count by prefix, session health, prepared queries, and transaction s
Coolify self-hosted PaaS management covering application inventory, service status, deployment history, server resource utilization, database instances, S3 storage configurations, webhook settings, an
Coralogix observability platform for log analytics, metrics, tracing, security, and alerting. Covers log querying with DataPrime, metric exploration, alert management, parsing rules, and data usage in
Cube.dev semantic layer management — monitor deployments, data models, pre-aggregations, query performance, and API health. Use when inspecting cube schemas, debugging slow queries, reviewing pre-aggr
Cucumber BDD testing management and analysis. Covers feature file organization, step definition mapping, scenario tagging, Gherkin syntax validation, test report parsing, and step reuse analysis. Use
Cursor AI-powered editor management and team analytics. Covers team seat management, usage tracking, model configuration, rules file analysis, project settings review, and workspace configuration. Use
Cypress end-to-end testing management and analysis. Covers test execution monitoring, dashboard integration, screenshot and video analysis, test report parsing, flaky test detection, and configuration
dbt Cloud management — monitor projects, environments, job runs, model status, and data freshness. Use when reviewing job run health, debugging failed models, inspecting run artifacts, or auditing pro
Deep Deno Deploy analysis covering project inventory, deployment history, KV database usage, cron job schedules, custom domain mappings, analytics metrics, and environment variable auditing. Use for c
Detox React Native testing management and analysis. Covers test configuration review, device and simulator management, build configuration analysis, test result parsing, artifact collection, and test
Deep Discord server management covering guilds, channels, members, roles, and message analytics. Use when auditing Discord server health, analyzing community engagement, managing channels and roles, o
Dokku self-hosted PaaS management covering app inventory, container status, domain configuration, plugin listing, persistent storage mounts, environment variable auditing, proxy settings, and deployme
Dragonfly in-memory datastore management, multi-threaded performance analysis, memory efficiency monitoring, and compatibility checks. Covers thread utilization, snapshot operations, memory usage per
DuckDB database management, query performance analysis, extension management, and storage optimization. Covers database file inspection, table statistics, memory configuration, Parquet/CSV import heal
Dynatrace software intelligence platform for full-stack APM, infrastructure monitoring, AIOps, log management, and digital experience monitoring. Covers entity discovery, metric querying with DQL, pro
Elastic APM application performance monitoring for distributed tracing, error tracking, metrics collection, and service map visualization within the Elastic Stack. Covers service discovery, transactio
Deep Elasticsearch cluster management including shard allocation, index lifecycle policies, snapshot/restore operations, mapping optimization, and advanced query tuning. Covers node roles, circuit bre
Encore cloud application platform management. Covers service architecture, API inspection, infrastructure provisioning, environment management, deployment history, and local development. Use when buil
etcd cluster management, member health monitoring, key-space analysis, and Raft consensus diagnostics. Covers endpoint health, leader election status, alarm states, compaction/defragmentation, snapsho
Fastly Compute@Edge service management covering service inventory, package deployments, backend health, domain mappings, VCL/Wasm configuration, real-time analytics, cache hit ratios, and edge diction
Deep Fly.io analysis covering app inventory, machine status, volume management, autoscaling configuration, health check results, certificate status, secrets auditing, and region distribution. Use for
FoundationDB cluster management, layer inspection, transaction rate monitoring, and storage engine analysis. Covers cluster health via fdbcli status, process roles, coordination state, backup/restore
Microsoft Garnet cache-store management, RESP protocol compatibility analysis, performance benchmarking, and cluster configuration. Covers server health, memory usage, storage tier inspection, checkpo
Deep Google Cloud Functions analysis covering function inventory across generations (1st and 2nd gen), trigger configurations, execution metrics, memory and CPU allocation, VPC connector usage, secret
Deep Google Cloud Run analysis covering service inventory, revision management, traffic splitting, autoscaling configuration, request latency metrics, container resource limits, domain mappings, and V
GCP Deployment Manager template and deployment management. Covers deployment creation, template validation, resource inspection, manifest review, deployment updates, and type provider management. Use
GCP Organization Policy Service management. Covers org policy constraints, policy evaluation, custom constraints, dry-run policies, exception management, and compliance auditing. Use when managing GCP
GCP Policy Intelligence and IAM recommender management. Covers IAM recommender insights, policy analyzer, policy troubleshooter, access approval settings, and security health analytics. Use when analy
GCP Resource Manager project, folder, and organization management. Covers organization hierarchy, folder structure, project management, IAM policy bindings, org-level constraints, and resource labels.
GitHub Copilot management and usage analytics. Covers organization seat management, usage metrics tracking, policy configuration, content exclusion rules, suggestion acceptance analysis, and billing r
Deep GitHub Issues management covering issue tracking, label analytics, milestone progress, assignee workload, and issue lifecycle metrics. Use when performing deep audits of GitHub Issues, analyzing
GitHub Projects (v2) management covering project boards, views, items, custom fields, and workflow analytics. Use when auditing GitHub Projects usage, managing project items and fields, analyzing proj
Deep GitLab Issues management covering issue tracking, label analytics, milestone progress, assignee workload, and issue lifecycle metrics. Use when performing deep audits of GitLab Issues, analyzing
Gitpod cloud development environment management and monitoring. Covers workspace lifecycle management, organization settings, usage tracking, prebuild configuration, environment class selection, and .
Google Analytics 4 management — monitor property configuration, event streams, user metrics, conversions, and audience data. Use when reviewing traffic trends, inspecting event setup, auditing convers
Google Meet management covering meeting spaces, calendar events with conferencing, and participant analytics. Use when auditing Google Meet usage, managing meeting rooms, retrieving meeting history, o
Graylog centralized log management platform for log collection, search, dashboards, alerting, and pipeline processing. Covers log querying with Lucene syntax, stream management, alert condition config
Groundcover eBPF-based Kubernetes observability platform for APM, infrastructure monitoring, log management, and network analysis without code instrumentation. Covers service map discovery, golden sig
Grouparoo reverse ETL management — monitor apps, sources, destinations, groups, properties, and sync runs. Use when debugging data sync issues, inspecting group membership, auditing property mappings,
Hazelcast cluster management, distributed data structure inspection, partition health monitoring, and near-cache analysis. Covers member discovery, map/cache statistics, WAN replication status, CP sub
Heap analytics management — monitor auto-captured events, defined events, user segments, and data health. Use when inspecting autocapture coverage, reviewing defined events, debugging tracking gaps, o
Height.app project management covering tasks, lists, workspaces, and team activity analytics. Use when auditing Height workspace usage, managing tasks and lists, analyzing team workload, or reviewing
Heroku platform management covering app inventory, dyno formation and scaling, add-on usage, release history, config var auditing, domain and SSL status, log drain configuration, and metrics analysis.
Highlight.io full-stack observability platform for session replay, error monitoring, log management, and tracing. Covers error tracking, log querying, session analysis, trace investigation, and alert
Hightouch reverse ETL management — monitor syncs, destinations, models, sources, and sync run health. Use when debugging sync failures, inspecting record delivery, auditing field mappings, or reviewin
Honeycomb observability platform for distributed tracing, event-driven analytics, BubbleUp root cause analysis, SLOs, and triggers. Covers querying datasets, analyzing trace spans, investigating laten
Hypertrace distributed tracing and observability platform for service dependency mapping, trace analysis, API monitoring, and performance analysis. Covers service discovery, endpoint performance, trac
Icinga infrastructure monitoring platform for host and service monitoring, cluster management, alerting, and performance data analysis. Covers host/service status, check result review, notification ma
Jest testing framework management and analysis. Covers test suite configuration, coverage reporting, snapshot testing review, test result parsing, watch mode configuration, and module mocking analysis
Deep Jira project management covering projects, boards, sprints, issue analytics, velocity tracking, and workflow health. Use when performing deep audits of Jira usage, analyzing sprint velocity, revi
JUnit testing framework management and analysis. Covers test suite configuration, XML report parsing, test runner integration, assertion pattern review, parameterized test analysis, and build tool int
Kanboard project management covering projects, tasks, columns, swimlanes, and productivity analytics. Use when auditing Kanboard usage, managing tasks and projects, analyzing workflow throughput, or r
KeyDB multi-threaded Redis-compatible datastore management, active replication monitoring, FLASH storage tier analysis, and sub-key expiration inspection. Covers thread configuration, multi-master rep
Klotho cloud compiler and infrastructure generation. Covers application compilation, cloud target configuration, generated infrastructure inspection, deployment orchestration, and topology visualizati
LambdaTest cloud testing platform monitoring and analysis. Covers Selenium/Cypress test session management, tunnel monitoring, build tracking, screenshot testing, real-time testing status, and usage a
Last9 observability platform for high-cardinality metrics, distributed tracing, log management, and SLO-based reliability management. Covers metric exploration, trace analysis, log querying, SLO track
LibreNMS network monitoring platform for network devices, servers, SNMP monitoring, alerting, and performance graphing. Covers device discovery, port monitoring, alert management, health sensor data,
Lightdash BI management — monitor projects, dashboards, saved charts, spaces, and dbt model exploration. Use when reviewing dashboard health, inspecting chart queries, auditing project configuration,
Lightstep (now ServiceNow Cloud Observability) platform for distributed tracing, service health monitoring, change intelligence, and SLOs. Covers querying traces, analyzing service performance, review
LogDNA (now Mezmo) log management platform for log aggregation, search, alerting, and analysis. Covers log querying, view management, alert configuration, and usage monitoring. Use when searching LogD
Loggly cloud log management platform for log search, analysis, alerting, and dashboards. Covers log querying with Lucene syntax, field exploration, alert management, and usage analysis. Use when searc
Loom video management covering video library, folders, sharing settings, and engagement analytics. Use when auditing Loom video usage, managing recordings, analyzing viewer engagement, or organizing v
ManageEngine OpManager network monitoring platform for routers, switches, firewalls, servers, and virtual infrastructure. Covers device discovery, interface monitoring, alert management, performance d
Materialize streaming database management — monitor clusters, sources, sinks, materialized views, indexes, and dataflow health. Use when debugging ingestion lag, inspecting view dependencies, auditing
Matomo analytics management — monitor site traffic, visitor behavior, goals, segments, and reporting. Use when reviewing web analytics, inspecting referrer data, checking goal completions, or auditing
Meilisearch index management, search performance analysis, ranking rule optimization, and filterable/sortable attribute configuration. Covers index health, task queues, document counts, typo tolerance
Memcached instance monitoring, slab allocation analysis, hit ratio optimization, and connection management. Covers memory utilization, eviction rates, item distribution across slabs, connection pool h
Mezmo (formerly LogDNA) observability pipeline and log management platform for log ingestion, routing, processing, and analysis. Covers log querying, pipeline management, view configuration, alerting,
Middleware.io full-stack observability platform for infrastructure monitoring, APM, log management, synthetic monitoring, and Kubernetes observability. Covers host metrics, application traces, log ana
Milvus vector database management, collection schema inspection, index build monitoring, and query node health analysis. Covers partition management, segment compaction, resource group allocation, rep
Mixpanel analytics management — monitor event ingestion, user profiles, funnels, retention, and data governance. Use when reviewing event volumes, debugging tracking issues, inspecting user properties
Mode Analytics management — monitor workspaces, reports, queries, data sources, and scheduled runs. Use when reviewing report health, inspecting query performance, auditing data connections, or checki
mParticle CDP management — monitor inputs, outputs, data plans, event forwarding, and audience health. Use when debugging event delivery, reviewing data quality rules, auditing integrations, or inspec
Nagios infrastructure monitoring platform for host and service monitoring, alerting, event handling, and availability reporting. Covers host status review, service check results, notification manageme
Netdata real-time infrastructure monitoring platform for system metrics, application performance, container monitoring, and alerting. Covers CPU, memory, disk, network metrics, active alarms, chart ex
Deep Netlify platform analysis covering site inventory, deploy history, serverless function logs, edge function status, form submissions, bandwidth usage, build plugin configurations, and DNS/domain s
Nitric cloud application framework management. Covers service definition, local development, deployment to AWS/Azure/GCP, resource inspection, stack management, and provider configuration. Use when bu
Observium network monitoring platform for auto-discovery of network devices, SNMP-based monitoring, traffic analysis, and alerting. Covers device inventory, port utilization, health sensors, alert rev
Obsidian Publish site management covering published pages, site configuration, navigation, and content health. Use when auditing an Obsidian Publish site, checking published content status, reviewing
OpenSearch cluster management including index operations, ISM policies, snapshot management, security analytics, and performance tuning. Covers cluster health, shard allocation, anomaly detection, ale
Outerbase database interface management covering workspace inventory, connected database sources, table schemas, query history, saved queries, API endpoint generation, dashboard configurations, and us
Papertrail cloud-hosted log management for log aggregation, search, tail, alerting, and archiving. Covers log searching, system and group management, saved search alerts, and log volume analysis. Use
Pinecone vector database management, index health monitoring, namespace analysis, and query performance tuning. Covers index statistics, dimension configuration, pod utilization, collection management
Pivotal Tracker project management covering projects, stories, epics, iterations, and velocity analytics. Use when auditing Pivotal Tracker usage, managing stories and epics, analyzing iteration veloc
Plane project management covering workspaces, projects, issues, cycles, and modules. Use when auditing Plane workspace usage, managing issues and cycles, analyzing project health, or reviewing team wo
Plausible Analytics management — monitor site traffic, page views, referral sources, goals, and visitor metrics. Use when reviewing website analytics, inspecting traffic sources, checking goal convers
Playwright end-to-end testing management and analysis. Covers test execution monitoring, trace analysis, browser context configuration, test report parsing, flaky test detection, and parallel executio
Polytomic data integration management — monitor connections, syncs, models, bulk syncs, and execution health. Use when debugging sync failures, inspecting data mappings, auditing connection status, or
Portainer container management platform analysis covering environment endpoints, stack inventory, container status, image management, volume and network inspection, user and team access, and resource
PostHog analytics management — monitor events, feature flags, experiments, session recordings, and ingestion health. Use when reviewing event volumes, managing feature flags, debugging tracking, or in
Power BI service management — monitor workspaces, datasets, reports, refresh schedules, and capacity utilization. Use when inspecting dashboard health, debugging refresh failures, auditing workspace p
PractiTest test management platform monitoring and analysis. Covers project organization, test library management, test set and run execution, requirement traceability, issue tracking, custom field an
Preset.io (managed Apache Superset) management — monitor workspaces, dashboards, charts, datasets, and database connections. Use when reviewing dashboard health, inspecting query performance, auditing
PRTG Network Monitor platform for network devices, bandwidth, servers, applications, and cloud services monitoring. Covers sensor status, device tree management, alert review, historic data analysis,
Pulumi Cloud stack and deployment management. Covers stack operations, resource inspection, configuration and secrets, deployment history, policy packs (CrossGuard), and team access controls. Use when
Puppeteer browser automation management and analysis. Covers test suite configuration, browser launch settings, page performance metrics, screenshot and PDF generation review, network interception ana
Pytest testing framework management and analysis. Covers test discovery, fixture analysis, coverage reporting, marker-based filtering, parametrized test review, plugin management, and conftest configu
Qase test management platform monitoring and analysis. Covers project organization, test case management, test run execution tracking, defect linking, shared step libraries, and environment configurat
Qdrant vector database management, collection health monitoring, shard distribution analysis, and query optimization. Covers collection configuration, HNSW index parameters, quantization settings, sna
QuestDB time-series database management — monitor tables, partitions, ingestion throughput, query performance, and server health. Use when inspecting table schemas, debugging slow queries, reviewing W
Broadcom Rally (formerly CA Agile Central) project management covering workspaces, projects, user stories, defects, iterations, and velocity analytics. Use when auditing Rally usage, managing stories
Advanced Redis management including cluster topology, sentinel failover, streams analysis, Lua script auditing, module inspection, and memory defragmentation. Covers RDB/AOF persistence tuning, pub/su
RisingWave streaming database management — monitor sources, sinks, materialized views, clusters, query performance, and ingestion health. Use when debugging stream processing lag, inspecting view depe
Robot Framework test automation management and analysis. Covers test suite organization, keyword library management, variable file configuration, output XML parsing, log and report analysis, and resou
RudderStack CDP management — monitor sources, destinations, event delivery, transformations, and pipeline health. Use when debugging event routing, inspecting transformation logic, auditing data flows
Sauce Labs cloud testing platform monitoring and analysis. Covers test job management, tunnel monitoring, real device testing, build tracking, concurrency analysis, and usage metrics. Use when managin
ScyllaDB cluster management, shard-per-core analysis, compaction strategy tuning, and CQL performance diagnostics. Covers node health, tablet distribution, workload prioritization, repair scheduling,
Segment CDP management — monitor sources, destinations, event delivery, tracking plans, and data quality. Use when inspecting event flow, debugging destination failures, auditing tracking plan violati
Selenium WebDriver testing management and grid monitoring. Covers Selenium Grid node health, session management, browser capability analysis, test result parsing, hub configuration review, and WebDriv
Seq structured log server for centralized log collection, querying, dashboards, alerting, and retention management. Covers log searching with Seq query language, signal management, alert configuration
Serverless Framework service management. Covers service deployment, function invocation, log streaming, plugin management, stage/region configuration, CloudFormation stack inspection, and offline loca
Shortcut (formerly Clubhouse) project management covering stories, epics, iterations, workflows, and team analytics. Use when auditing Shortcut usage, managing stories and epics, analyzing iteration v
Sigma Computing management — monitor workbooks, datasets, connections, materialization schedules, and user activity. Use when reviewing workbook health, inspecting data connections, auditing permissio
SigNoz open-source observability platform for metrics, traces, and logs with OpenTelemetry-native ingestion. Covers service performance monitoring, trace analysis, log querying, dashboard management,
Advanced SingleStore (MemSQL) cluster management, pipeline monitoring, columnstore analysis, and workload profiling. Covers aggregator/leaf health, partition distribution, resource pools, pipeline lag
Site24x7 cloud monitoring platform for websites, servers, networks, applications, and cloud infrastructure. Covers monitor status, alert management, performance metrics, SLA reporting, and threshold c
Apache SkyWalking application performance monitoring platform for distributed tracing, service mesh observability, metric aggregation, and log analysis. Covers service topology, endpoint performance,
Deep Slack workspace management covering channels, messages, users, reactions, and workspace analytics. Use when auditing Slack usage, analyzing communication patterns, managing channels, or retrievin
Smartsheet workspace management covering sheets, reports, dashboards, rows, and collaboration analytics. Use when auditing Smartsheet usage, managing sheets and rows, analyzing project timelines, or r
Snowplow behavioral data pipeline management — monitor collector endpoints, enrichment processes, pipeline health, schema registry, failed events, and data quality. Use when diagnosing event delivery
Apache Solr collection management, core administration, query performance tuning, and schema analysis. Covers SolrCloud cluster health, shard states, replica placement, config sets, commit strategies,
SST (Serverless Stack) application management. Covers stack deployment, live Lambda debugging, console access, resource binding, secret management, and multi-stage environments. Use when building full
StarRocks management — monitor cluster health, databases, tables, materialized views, query performance, and load jobs. Use when inspecting schema layout, debugging slow queries, reviewing compaction
Tableau Server/Cloud management — monitor sites, workbooks, data sources, extract refresh jobs, and user activity. Use when reviewing dashboard health, inspecting failed extracts, auditing permissions
Tabnine AI coding assistant management and analytics. Covers team seat management, usage tracking, model configuration, privacy settings, code completions analytics, and enterprise deployment monitori
Taiga agile project management covering projects, user stories, tasks, sprints, epics, and kanban boards. Use when auditing Taiga project health, managing sprints and user stories, analyzing velocity,
Deep Microsoft Teams management covering teams, channels, messages, memberships, and activity analytics. Use when auditing Teams usage, analyzing collaboration patterns, managing team structures, or r
Teamwork project management covering projects, task lists, milestones, time tracking, and team workload. Use when auditing Teamwork usage, managing tasks and milestones, analyzing time entries, or rev
Terraform Cloud workspace and run management. Covers workspace configuration, run triggers, variable sets, policy checks (Sentinel/OPA), VCS integration, state management, and team access controls. Us
Terraform Enterprise administration and workspace management. Covers TFE installation health, admin settings, workspace operations, Sentinel policies, cost estimation, private module registry, and aud
TestCafe end-to-end testing management and analysis. Covers test execution monitoring, browser provider configuration, concurrent test analysis, report parsing, fixture organization, and selector debu
Testmo test management platform monitoring and analysis. Covers project organization, test case and suite management, automation run tracking, exploratory session review, milestone progress, and field
TestNG testing framework management and analysis. Covers test suite XML configuration, group-based execution, data provider analysis, parallel execution settings, report parsing, and listener configur
TestRail test case management and reporting. Covers project and suite organization, test case review, test run and plan management, milestone tracking, result analysis, and user activity monitoring. U
TimescaleDB Cloud management — monitor services, hypertables, continuous aggregates, compression, retention policies, and query performance. Use when inspecting chunk distribution, debugging slow quer
Trello board management covering boards, lists, cards, members, labels, and activity analytics. Use when auditing Trello usage, managing boards and cards, analyzing workflow bottlenecks, or reviewing
Typesense collection management, search tuning, schema analysis, and cluster health monitoring. Covers collection schemas, search analytics, synonym management, override rules, API key scoping, and cu
Valkey (Redis fork) instance management, cluster health monitoring, memory analysis, and performance tuning. Covers keyspace inspection, replication lag, slow log analysis, client connections, and mod
Deep Vercel platform analysis covering project inventory, deployment history, serverless and edge function metrics, domain configurations, environment variable auditing, build performance, bandwidth u
Vitest testing framework management and analysis. Covers test suite configuration, coverage reporting, snapshot management, benchmark analysis, workspace setup review, and Vite-native test features. U
VoltDB cluster management, stored procedure performance analysis, partition tuning, and export stream monitoring. Covers cluster topology, DR replication, snapshot schedules, command log health, and l
Watchtower container update automation analysis covering monitored container inventory, update schedules, notification configuration, update history, container labeling for inclusion and exclusion, an
Weaviate vector database management, schema inspection, module health monitoring, and query performance analysis. Covers class definitions, shard distribution, vectorizer configuration, multi-tenancy
WebdriverIO test automation management and analysis. Covers test suite configuration, service integration, browser and mobile testing, Allure report parsing, spec file organization, and WDIO runner an
Cisco Webex management covering meetings, rooms, memberships, messages, and usage analytics. Use when auditing Webex usage, managing spaces and meetings, retrieving message history, or analyzing colla
Wing cloud-oriented programming language management. Covers compilation, testing, simulation, deployment to cloud targets, resource inspection, and console usage. Use when developing Wing applications
Wrike project management covering folders, projects, tasks, timelog entries, and workflow analytics. Use when auditing Wrike usage, managing tasks and projects, analyzing time tracking, or reviewing w
Xray test management for Jira monitoring and analysis. Covers test case and precondition management, test plan and execution tracking, Cucumber/Gherkin integration, test set organization, requirement
JetBrains YouTrack issue tracking covering projects, issues, agile boards, sprints, and workflow analytics. Use when auditing YouTrack usage, managing issues and agile boards, analyzing sprint progres
YugabyteDB cluster management, tablet distribution analysis, YSQL/YCQL performance tuning, and replication monitoring. Covers master/tserver health, tablet leader balancing, xCluster replication, CDC
Zabbix enterprise monitoring platform for infrastructure, networks, servers, cloud resources, and applications. Covers host management, trigger and problem review, item data querying, template managem
ZenHub project management covering workspaces, boards, epics, sprints, and velocity analytics layered on GitHub Issues. Use when auditing ZenHub usage, analyzing board pipelines, reviewing sprint prog
Zephyr test management (Scale/Squad) monitoring and analysis. Covers test case management within Jira, test cycle execution tracking, folder organization, execution status reporting, traceability matr
Zoom meeting and webinar management covering scheduled meetings, recordings, users, and usage analytics. Use when auditing Zoom usage, managing meetings, retrieving recordings, or analyzing meeting pa
Structures an Architecture Decision Record (ADR) to capture the context, decision, and consequences of significant architectural choices. This template ensures decisions are documented with sufficient
Analyzes alerting configurations to identify noise, redundancy, and low-value alerts that contribute to on-call fatigue. This template walks teams through auditing their alert rules, measuring signal-
Provides a structured review checklist for evaluating API contracts including REST, gRPC, and GraphQL endpoints. This template covers naming conventions, versioning, error handling, pagination, authen
Reviews and designs caching strategies for services, evaluating cache placement, invalidation approaches, consistency trade-offs, and capacity planning. This template helps teams make informed decisio
Provides a structured process for scanning container images and runtime environments for security vulnerabilities, misconfigurations, and compliance violations. This template covers image scanning, Do
Guides the creation of comprehensive data flow diagrams (DFDs) that map how data moves through a system, identifying sources, destinations, transformations, and storage points. This template supports
Guides teams through a structured review of their error budget consumption against defined SLOs. This template helps SRE teams assess reliability performance, identify budget burn patterns, and make d
Generates a detailed vulnerability report for container images across a registry or set of repositories. This template aggregates scan results, tracks remediation progress, and provides executive-leve
Detects and catalogs infrastructure drift between the desired state defined in Infrastructure as Code and the actual state of deployed resources. This template guides teams through drift detection, im
Reviews the design of message queue and event streaming architectures, covering queue topology, consumer design, ordering guarantees, dead letter handling, and capacity planning. This template helps t
Produces a comprehensive reliability scorecard for a service or system, evaluating it across multiple dimensions including availability, latency, durability, and operational readiness. Use this templa
Structures a Request for Comments (RFC) document for proposing significant technical changes that require cross-team input and approval. This template guides authors through problem definition, propos
Evaluates existing runbooks for automation potential, assessing each procedure's complexity, frequency, and risk to determine which runbooks should be automated first. This template produces a priorit
Reviews database schema migrations for safety, performance impact, and backward compatibility. This template ensures migrations are evaluated for lock contention, data integrity, rollback capability,
Audits the rotation status of all secrets, credentials, API keys, and certificates across services and environments. This template identifies stale secrets, missing rotation policies, and non-complian
Performs a comprehensive health check of a service mesh deployment, evaluating control plane status, data plane proxy health, mTLS coverage, traffic policies, and observability configuration. Use this
Provides a structured template for creating comprehensive system design documents that cover requirements, architecture decisions, component design, data models, API contracts, and operational conside
Guides teams through a structured threat modeling exercise using the STRIDE methodology. This template helps identify threats, assess risks, and define mitigations for a system or feature, producing a
Systematically identifies, quantifies, and prioritizes toil within engineering teams. This template guides SRE and operations teams through cataloging repetitive manual work, measuring its impact on p
Appwrite backend platform management covering project inventory, database collections, user authentication stats, storage bucket usage, function deployments, webhook configuration, and platform health
Banana.dev ML inference platform management covering model inventory, deployment status, API call history, GPU allocation, scaling configuration, build logs, and latency metrics. Use for comprehensive
Baseten ML deployment platform management covering model inventory, deployment status, autoscaling configuration, inference call history, GPU allocation, environment management, and performance metric
BigPanda AIOps platform management covering event correlation, root cause analysis, incident management, alert enrichment, topology-based correlation, environment management, and analytics. Use when c
Bitbucket pull request review management including default reviewers, merge checks, branch permissions, and build status integration. Use when configuring or automating Bitbucket PR review workflows a
BMC Helix ITSM platform management covering incident management, problem management, change management, and CMDB operations. Use when creating or updating incidents with categorization and assignment,
Cloud Custodian policy engine management. Covers policy authoring, dry-run execution, resource filtering, action enforcement, multi-cloud support (AWS/Azure/GCP), Lambda deployment, compliance reporti
CodeBall AI-powered pull request risk assessment including automatic PR quality scoring, risk classification, and auto-approval for low-risk changes. Use when configuring CodeBall for automated PR tri
CodeRabbit AI code review management including automated review comments, auto-suggestions, review customization, and learning from feedback. Use when configuring CodeRabbit for AI-assisted code revie
CodeScene behavioral code analysis including code health scoring, hotspot detection, change coupling analysis, and PR integration for code quality gates. Use when configuring CodeScene for codebase vi
Convex backend platform management covering project inventory, table schemas, function deployments, scheduled job status, index configuration, environment variable auditing, and usage metrics. Use for
CoreWeave GPU cloud management covering Kubernetes namespace inventory, GPU workload status, virtual server instances, persistent volume claims, node allocation, billing analysis, and network configur
Atlassian Crucible code review management including review creation, commenting workflows, approval processes, and Jira integration. Use when managing Crucible review workflows, configuring review tem
Device42 IT infrastructure management covering data center infrastructure management (DCIM), application dependency mapping, IP address management, and asset lifecycle tracking. Use when documenting d
Advanced env0 environment management platform. Covers environment lifecycle tracking, template configuration analysis, cost monitoring integration, custom flow management, policy enforcement, variable
FlashDuty alerting and incident management covering alert routing, escalation policy configuration, on-call scheduling, collaboration channels, duty management, and integration setup. Use when configu
Freshservice ITSM platform management covering ticket lifecycle, asset management, change requests, release management, and CMDB. Use when creating or triaging support tickets, tracking IT assets and
Gerrit code review system management including change review workflows, submit rules, label configuration, group permissions, and Prolog-based submit rules. Use when managing Gerrit review policies, c
GitHub pull request review management including review assignments, CODEOWNERS enforcement, branch protection rules, required reviewers, review dismissal policies, and status check integration. Use wh
GitLab merge request review management including approval rules, merge checks, code ownership via CODEOWNERS, merge request policies, and pipeline integration. Use when configuring or automating GitLa
gitStream PR automation management including routing rules, merge policies, auto-labeling, and reviewer assignment based on code changes. Use when configuring gitStream continuous merge automation, de
GLPI IT asset and service management covering ticket handling, hardware and software inventory, CMDB configuration items, and IT budget tracking. Use when managing helpdesk tickets with categorization
Grafana OnCall management covering on-call schedule configuration, escalation chain design, integration setup with monitoring tools, alert group management, notification routing, and shift override ma
HaloITSM platform management covering ticket lifecycle, SLA policy configuration, knowledge base article management, and asset tracking. Use when creating and managing support tickets with custom work
iLert alerting and incident management covering alert source configuration, on-call scheduling, escalation policies, status page management, heartbeat monitoring, uptime tracking, and notification cha
Advanced Infracost cost estimation and FinOps management. Covers multi-project cost breakdown, diff analysis between branches, CI/CD integration inspection, policy enforcement with OPA/Sentinel, usage
Ivanti Neurons for ITSM covering service request management, IT asset management, workflow automation, and self-service portal configuration. Use when creating and routing service requests, tracking I
Jira Service Management (JSM) for IT service desk operations including request queues, SLA management, automation rules, knowledge base integration, and customer portal management. Use when managing s
Lambda Labs GPU cloud management covering instance inventory, GPU type availability, SSH key management, filesystem status, instance pricing, and capacity analysis. Use for comprehensive Lambda Labs w
Lansweeper IT asset discovery and inventory management covering network scanning, hardware and software inventory, compliance reporting, and vulnerability assessment. Use when discovering devices acro
ManageEngine ServiceDesk Plus management covering incident handling, IT asset tracking, CMDB configuration, and reporting. Use when creating and managing incidents with SLA tracking, cataloging hardwa
Modal serverless compute management covering app inventory, function deployments, container status, volume mounts, secret management, scheduled job analysis, GPU utilization, and usage metrics. Use fo
Moogsoft AI-driven incident management covering intelligent alert detection, noise reduction, correlation engine tuning, situation management, workflow automation, and performance analytics. Use when
Nhost backend platform management covering project inventory, database status, authentication configuration, storage usage, serverless function deployments, Hasura GraphQL engine health, and environme
Advanced OpsGenie management covering routing rules, integration configurations, on-call analytics, alert policy tuning, notification rules, team structures, and escalation optimization. Use when perf
osTicket open-source helpdesk management covering ticket creation and routing, canned response templates, SLA plan configuration, and department-based ticket assignment. Use when managing support tick
Advanced PagerDuty management covering escalation policy design and optimization, event orchestration rules, AIOps noise reduction and intelligent grouping, analytics and reporting on incident volume
Phabricator Differential code review management including review workflows, Herald rules for automated actions, audit trails, and reviewer assignment. Use when managing Phabricator code review process
PullRequest professional code review service management including reviewer pool configuration, review SLAs, quality standards, and integration with GitHub/GitLab/Bitbucket. Use when managing PullReque
Replicate ML platform management covering model inventory, prediction history, deployment status, webhook configuration, training job analysis, hardware utilization, and billing metrics. Use for compr
Review Board code review management including review requests, diff management, review groups, and repository configuration. Use when managing Review Board review workflows, configuring review groups,
RunPod GPU cloud management covering pod inventory, serverless endpoint status, GPU type allocation, template configuration, volume management, spending analysis, and performance metrics. Use for comp
ServiceNow ITSM platform management covering incident lifecycle, change request workflows, CMDB configuration items, knowledge base article management, and SLA tracking. Use when creating or updating
Shoreline.io incident automation platform covering Op packs, automated remediation actions, metric and resource queries, alarm configuration, bot management, and notebook-driven debugging. Use when bu
Snipe-IT asset management covering hardware asset tracking, software license management, and check-in/check-out workflows. Use when cataloging IT hardware with serial numbers and custom fields, managi
Sourcery AI code quality and refactoring management including automated refactoring suggestions, code quality scoring, duplicate detection, and PR review integration. Use when configuring Sourcery for
Steampipe cloud infrastructure query engine management. Covers plugin installation, mod management, SQL-based resource querying, dashboard inspection, benchmark execution, compliance reporting, and co
SysAid ITSM platform management covering ticket management, asset discovery and inventory, self-service portal administration, and reporting. Use when creating and routing helpdesk tickets, reviewing
Transposit incident management platform covering runbook automation, incident lifecycle management, activity tracking, automated workflows, integration connectors, and post-incident review. Use when m
JetBrains Upsource code review management including review workflows, code inspections, branch review tracking, and IDE integration. Use when managing Upsource review processes, configuring automated
Val Town platform management covering val inventory, execution logs, scheduled val status, HTTP endpoint configuration, email handler analysis, blob storage usage, and usage metrics. Use for comprehen
Splunk On-Call (VictorOps) incident management covering incident timeline visualization, routing key configuration, escalation policies, on-call scheduling, team management, and alert rules. Use when
WhatTheDiff AI-powered PR summary generation including automatic change descriptions, reviewer-friendly summaries, and changelog automation. Use when configuring WhatTheDiff for automated PR documenta
xMatters communication workflow management covering on-call scheduling, group management, communication plan design, flow designer automation, event targeting, and notification analytics. Use when con
Zammad helpdesk platform management covering ticket operations, knowledge base article publishing, and reporting dashboards. Use when creating and managing support tickets with tags and custom attribu
Zendesk advanced support management covering ticket workflows, macros, triggers, automations, SLA policies, views, and reporting. Use when managing complex ticket routing, configuring automation rules
Zenduty incident management covering incident lifecycle, SLA tracking, escalation policies, on-call scheduling, postmortem generation, service dependency mapping, and integration management. Use when
Standardized access request process with multi-level approval chain for granting system, application, and data access. Covers request submission, manager approval, security review, provisioning, and a
API backwards compatibility review template for detecting breaking changes in REST, GraphQL, and gRPC APIs. Covers endpoint modifications, schema changes, response format changes, deprecation policy e
Architecture review template for evaluating major system changes, RFC proposals, and design documents. Covers scalability analysis, technology selection rationale, integration patterns, operational re
Response playbook for cascading and correlated failures across multiple services. Covers identification of the originating failure, blast radius mapping, circuit breaker activation, load shedding stra
TLS/SSL certificate expiry incident response and prevention playbook. Covers emergency certificate renewal, impact assessment, interim mitigations, certificate chain validation, automated renewal conf
Planned failure injection exercise framework with observation checklists, hypothesis definition, blast radius controls, rollback procedures, and post-drill analysis. Guides teams through designing, ex
Response playbook for regional or service-level outages from major cloud providers (AWS, GCP, Azure). Covers impact assessment, multi-region failover procedures, customer communication during provider
Regulatory compliance code review template covering SOC 2, HIPAA, PCI DSS, and GDPR requirements. Evaluates code changes for data handling compliance, audit logging, access controls, encryption standa
Customer-facing incident communication templates for status page updates, email notifications, social media responses, and support team talking points. Provides tone guidelines, timing recommendations
Data loss and data corruption incident response playbook covering immediate containment, impact assessment, recovery procedures from backups and replicas, data integrity verification, customer notific
Database migration PR review template covering migration safety, rollback planning, table locking analysis, data integrity validation, and production deployment strategy. Provides a systematic framewo
Dependency update PR review template covering CVE assessment, breaking change detection, license compliance, transitive dependency analysis, and upgrade risk evaluation. Provides a systematic framewor
Structures a comprehensive developer experience survey covering tooling satisfaction, development workflow friction, documentation quality, and overall developer productivity. This template helps plat
DNS-specific incident response playbook covering DNS resolution failures, propagation issues, DNSSEC validation errors, DNS provider outages, and misconfiguration recovery. Provides diagnostic command
Email delivery issues investigation workflow covering bounce analysis, spam filter diagnosis, SPF/DKIM/DMARC validation, mail flow tracing, and mailbox quota management. Guides helpdesk agents through
IT offboarding workflow for departing employees covering access revocation, device collection, data handling, license reclamation, and compliance documentation. Ensures all IT access is properly remov
Defines and structures an engineering metrics dashboard covering DORA metrics, quality indicators, operational health, and team productivity. This template helps engineering leaders select meaningful
Frontend-focused code review template covering accessibility compliance, rendering performance, bundle size impact, responsive design, UX consistency, and cross-browser compatibility. Provides a compr
Hardware request through procurement, setup, and deployment covering the full lifecycle from initial request and budget approval through vendor selection, purchase order processing, hardware configura
Expedited review process template for production hotfixes covering incident correlation, minimal change verification, rollback readiness, regression risk assessment, and post-incident follow-up tracki
Guide for running an effective incident bridge call or war room, covering agenda structure, facilitation techniques, role assignments, information flow management, and decision-making frameworks. Ensu
Comprehensive incident commander (IC) playbook covering IC responsibilities, communication cadence, escalation decision frameworks, delegation patterns, and handoff procedures. Guides the IC through e
Incident response metrics analysis framework covering MTTR (Mean Time to Resolve), MTTA (Mean Time to Acknowledge), MTBF (Mean Time Between Failures), and MTTD (Mean Time to Detect). Provides calculat
Guide for classifying incident severity from SEV1 through SEV4 using a structured impact matrix that considers user impact, revenue impact, data integrity, and blast radius. Provides clear criteria fo
Post-incident timeline building framework for reconstructing the sequence of events from logs, alerts, chat messages, deployment records, and monitoring data. Provides structured approaches to gatheri
Infrastructure-as-Code review template for Terraform, CloudFormation, Pulumi, and other IaC tools. Covers security hardening, cost impact analysis, blast radius assessment, state management safety, an
IT asset lifecycle tracking from procurement through disposal covering asset intake, deployment, maintenance, refresh planning, and secure decommissioning. Provides a framework for managing assets at
ITIL-aligned change management process covering change request submission, risk assessment, CAB review, approval workflows, implementation planning, and post-implementation review. Ensures all IT chan
Template for writing IT support knowledge base articles covering problem description, step-by-step resolution, troubleshooting tips, and related resources. Provides a standardized format for documenti
Structures a knowledge transfer plan for engineering teams facing team transitions, departures, or domain ownership changes. This template ensures critical knowledge about systems, processes, and trib
IT-wide incident communication workflow for notifying end users and stakeholders during major IT outages or service disruptions. Covers initial impact notification, periodic status updates, resolution
Guides teams through a structured, blameless review of major incidents, covering timeline reconstruction, root cause analysis, contributing factors, and corrective actions. This template ensures incid
Distributed systems code review template covering service resilience, API contract compliance, observability instrumentation, data consistency patterns, and inter-service communication. Provides a sys
ML/AI code review template covering data leakage detection, model reproducibility, bias assessment, feature engineering validation, and experiment tracking. Provides a systematic review framework for
Mobile application code review template covering battery efficiency, memory management, offline-first patterns, permission handling, app lifecycle management, and platform-specific best practices. Pro
Network issue diagnosis decision tree covering WiFi connectivity, wired LAN problems, DNS resolution failures, DHCP issues, and general internet access troubleshooting. Guides helpdesk agents through
IT onboarding checklist for new employees covering account provisioning, hardware setup, access permissions, software installation, and initial training. Generates a comprehensive onboarding task list
Provides a structured template for defining, tracking, and grading OKRs (Objectives and Key Results) for engineering teams. This template covers objective setting, key result definition with measurabl
Structured on-call rotation handoff checklist ensuring effective context transfer between outgoing and incoming on-call engineers. Covers active incidents, known issues, recent deployments, pending al
Provides a comprehensive onboarding checklist for new engineering team members covering environment setup, access provisioning, codebase orientation, team introductions, and ramp-up milestones. This t
External open source contribution review template covering code quality standards, licensing compliance, security vetting, CLA verification, and community guidelines adherence. Provides a systematic f
Self-service and assisted password reset workflow covering identity verification, reset execution across multiple systems, MFA recovery, and account lockout resolution. Provides step-by-step procedure
Performance-focused code review template covering N+1 query detection, memory leak identification, caching strategy evaluation, algorithmic complexity analysis, and resource utilization review. Provid
Response playbook for latency spikes, throughput degradation, and performance incidents. Covers systematic investigation of application, database, infrastructure, and network layers, with decision fra
Structured pull request review checklist covering security, performance, testing, documentation, and code quality. Provides a comprehensive, consistent review framework to ensure thorough reviews acro
Common printer issues resolution guide covering connectivity problems, print quality issues, paper jams, driver configuration, and network printer setup. Provides systematic troubleshooting steps for
Production readiness validation focused on incident response preparedness before launching a new service or major feature. Reviews on-call coverage, runbook completeness, monitoring and alerting setup
Provides a structured framework for engineering quarterly planning, covering goal setting, capacity allocation, dependency mapping, risk assessment, and milestone definition. This template helps engin
Guides teams through designing and implementing rate limiting strategies for APIs and services. This template covers algorithm selection, limit configuration, client communication, and monitoring to p
Comprehensive root cause analysis framework combining 5-Whys analysis, Ishikawa (fishbone) diagrams, and fault tree analysis methods. Provides structured templates for identifying contributing factors
Security-focused code review template covering OWASP Top 10 vulnerabilities, injection attacks, authentication flaws, authorization bypass, sensitive data exposure, and cryptographic misuse. Provides
Security-specific incident response playbook covering breach detection, compromised credentials response, data leak containment, evidence preservation, regulatory notification requirements, forensic i
SLA breach detection and escalation procedure covering proactive SLA monitoring, breach notification workflows, escalation matrices, and remediation tracking. Ensures SLA breaches are identified early
Software request, approval, licensing check, and deployment workflow covering the full lifecycle from user request through procurement, license validation, security review, installation, and verificat
Facilitates structured sprint retrospectives that help engineering teams reflect on what went well, what could be improved, and what actions to take. This template guides teams through data-driven ref
Provides a structured framework for assessing engineering team health across key dimensions including delivery pace, code quality, collaboration, well-being, and technical practices. This template hel
Response playbook for when a third-party vendor or external dependency experiences an outage. Covers impact assessment, customer communication, workaround activation, vendor status monitoring, interna
VPN connectivity troubleshooting decision tree covering common VPN client issues, authentication failures, split tunneling problems, performance degradation, and DNS resolution failures. Guides helpde
Defines a structured war room protocol for managing major incidents, including role assignments, communication cadences, escalation paths, and decision-making frameworks. This template helps teams res
100ms live video and audio infrastructure management for video conferencing, live streaming, and interactive sessions. Use when monitoring active rooms, analyzing session quality, reviewing usage metr
Ably real-time messaging platform management covering channels, presence, connections, usage analytics, and account health. Use when monitoring active channels, analyzing connection metrics, reviewing
Adobe Acrobat Sign (formerly Adobe Sign) eSignature platform management covering agreements, templates, workflows, users, and audit trails. Use when monitoring agreement status, analyzing signing comp
Adyen payment platform management including merchant accounts, payment methods, terminal configuration, risk management, settlement reports, and webhook notifications. Covers authorization rates, paym
Agora real-time engagement platform management for video/voice calling, interactive live streaming, and real-time messaging. Use when monitoring active channels, analyzing call quality metrics, review
Alchemy Web3 development platform management including app configuration, RPC endpoint health, webhook notifications, NFT API usage, Enhanced APIs, gas optimization, and usage analytics. Covers reques
Advanced Algolia search platform management including indices, search analytics, A/B tests, query rules, synonyms, API key audit, and infrastructure monitoring. Covers search performance metrics, rele
Anthropic API platform management covering models, usage tracking, rate limits, and message analytics. Use when monitoring API usage and costs, analyzing model request patterns, reviewing rate limit h
Anyscale platform management covering Ray clusters, services, model deployments, jobs, and usage analytics. Use when monitoring deployed services, analyzing cluster utilization, reviewing job status,
AWS GameLift fleet management including game server deployments, fleet scaling, matchmaking configuration, game session monitoring, and player session tracking. Covers fleet health, capacity utilizati
AWS IoT Core management including things, thing groups, certificates, policies, rules, topic monitoring, device shadows, and fleet indexing. Covers device connectivity health, message throughput, rule
Advanced AWS SES management including sending quotas, identity verification, configuration sets, dedicated IPs, reputation dashboard, suppression list, email receiving rules, and deliverability adviso
Azure IoT Hub management including device registry, device twins, message routing, endpoints, jobs, and IoT Edge deployments. Covers device connectivity status, message throughput, routing health, quo
balena IoT fleet management including applications, devices, releases, services, environment variables, and device diagnostics. Covers fleet deployment status, device connectivity, container health, u
Bandwidth communications platform management for voice calls, SMS/MMS messaging, phone number ordering, and 911 services. Use when managing phone number inventory, analyzing message delivery, monitori
BigCommerce store management including products, orders, customers, channels, shipping, inventory, storefront themes, and webhooks. Covers sales analytics, order fulfillment rates, inventory health, a
Bloomreach commerce search and merchandising platform management including catalog feeds, search configuration, autosuggest, category pages, SEO, A/B testing, pixel tracking, and recommendation widget
Box cloud content management platform covering files, folders, users, collaborations, metadata, and usage analytics. Use when monitoring storage usage, analyzing file activity, reviewing collaboration
Braintree payment gateway management including transactions, subscriptions, customers, payment methods, disputes, settlements, and merchant account configuration. Covers transaction success rates, dec
Cal.com open-source scheduling platform management covering event types, bookings, availability, teams, and analytics. Use when monitoring booking rates, analyzing event type usage, reviewing team ava
Calendly scheduling platform management covering event types, scheduled events, invitees, users, and booking analytics. Use when monitoring booking rates, analyzing event type performance, reviewing u
Clerk authentication and user management platform covering users, sessions, organizations, invitations, and sign-in methods. Use when analyzing user growth, monitoring authentication health, reviewing
Close CRM management including leads, contacts, opportunities, activities, sequences, smart views, and custom fields. Covers pipeline velocity, calling metrics, email outreach performance, sequence en
Cohere AI platform management covering models, embeddings, reranking, datasets, fine-tuning, and connectors. Use when monitoring API usage, analyzing model performance, reviewing fine-tuning jobs, man
Constructor.io product search and discovery platform management including catalog sync, search configuration, autosuggest, browse, recommendations, quizzes, and A/B testing. Covers catalog health, sea
ConvertKit (Kit) creator marketing platform management including subscriber management, forms, sequences, broadcasts, tags, and automation rules. Covers subscriber growth, email deliverability, sequen
Courier notification orchestration platform management covering messages, templates, brands, users, and delivery analytics. Use when monitoring notification delivery, analyzing template performance, r
Coveo enterprise search platform management including sources, indexes, query pipelines, machine learning models, usage analytics, and security providers. Covers indexing health, query performance, ML
Cronofy calendar API platform management covering calendars, events, availability, scheduling, and account sync health. Use when monitoring calendar integrations, analyzing scheduling patterns, review
Daily.co video and audio API platform management for real-time video calls, rooms, recordings, and usage analytics. Use when monitoring active rooms, analyzing meeting quality, reviewing usage and bil
Descope authentication and identity management platform covering users, tenants, access keys, flows, and audit logs. Use when analyzing user authentication patterns, monitoring tenant health, reviewin
DocuSign eSignature platform management covering envelopes, templates, users, signing workflows, and account analytics. Use when monitoring envelope status, analyzing signing completion rates, reviewi
Dropbox Business file storage and collaboration platform management covering files, folders, team members, sharing, and usage analytics. Use when monitoring storage usage, analyzing file activity, rev
Elastic Cloud (Elasticsearch Service) management including deployments, cluster health, index management, ILM policies, snapshot repositories, Kibana spaces, and APM configuration. Covers cluster perf
Filestack file upload, transformation, and delivery platform management covering uploads, transformations, CDN delivery, security policies, and usage analytics. Use when monitoring upload health, anal
Firebase Cloud Messaging (FCM) management covering message sending, topic subscriptions, device groups, and delivery analytics. Use when monitoring message delivery, analyzing notification performance
Fireworks AI inference platform management covering models, deployments, fine-tuning, and usage analytics. Use when monitoring API usage, analyzing model performance, reviewing fine-tuning jobs, check
Frontegg authentication and user management platform covering users, tenants, roles, permissions, SSO, and audit logs. Use when analyzing tenant health, monitoring user authentication, reviewing role
Google Cloud IoT platform management using Pub/Sub, Cloud Functions, and device management patterns that replaced the deprecated IoT Core. Covers device telemetry via Pub/Sub, command delivery, device
Google Drive API management covering files, folders, shared drives, permissions, storage quotas, and activity analytics. Use when monitoring storage usage, analyzing file sharing patterns, reviewing p
Google Maps Platform management including API key configuration, usage metrics, billing, quota monitoring, and service health across Maps, Routes, Places, and Geocoding APIs. Covers cost analysis, quo
Groq Cloud inference platform management covering models, API usage, rate limits, and performance metrics. Use when monitoring API health, analyzing inference latency, reviewing available models, chec
HelloSign (Dropbox Sign) eSignature platform management covering signature requests, templates, teams, and account analytics. Use when monitoring signature request status, analyzing completion rates,
HERE Maps platform management including project configuration, API key management, usage statistics, service health, and quota monitoring across Geocoding, Routing, Map Tile, and Search APIs. Covers r
Infura Web3 infrastructure management including project configuration, RPC endpoint health, IPFS gateway, usage statistics, and multi-chain support. Covers request volume monitoring, error tracking, b
Klaviyo marketing automation platform management including email and SMS campaigns, flows, lists, segments, metrics, and revenue attribution. Covers deliverability monitoring, flow performance, list g
Knock notification infrastructure management covering workflows, channels, users, messages, and delivery analytics. Use when monitoring notification delivery, analyzing workflow performance, reviewing
LiveKit real-time video and audio infrastructure management for WebRTC-based video conferencing, live streaming, and data channels. Use when monitoring active rooms, analyzing participant quality, rev
Mailchimp email marketing platform management including audience lists, campaigns, automations, templates, and analytics. Covers list health, campaign performance, subscriber engagement, bounce and un
Mailgun email service management including domain configuration, sending statistics, deliverability monitoring, bounce and complaint tracking, route management, and mailing list administration. Covers
Mapbox platform management including map styles, tilesets, datasets, tokens, usage statistics, and geocoding. Covers API usage monitoring, tileset processing status, token permission auditing, and rat
Moralis Web3 data platform management including EVM and Solana API endpoints, Streams (webhooks), token and NFT APIs, wallet APIs, and usage monitoring. Covers API health, compute unit usage, stream d
Novu open-source notification infrastructure management covering workflows, subscribers, messages, integrations, and delivery analytics. Use when monitoring notification delivery, analyzing workflow p
Nylas calendar and email API platform management covering calendars, events, messages, contacts, and scheduling pages. Use when monitoring calendar sync health, analyzing email delivery, reviewing sch
OneSignal push notification and messaging platform management covering notifications, segments, users, templates, and delivery analytics. Use when monitoring notification delivery rates, analyzing use
OpenAI API platform management covering models, usage, fine-tuning jobs, files, assistants, and billing. Use when monitoring API usage and costs, analyzing model performance, reviewing fine-tuning sta
OpenStreetMap API and related services management including Nominatim geocoding, Overpass API queries, tile server health, changeset monitoring, and data quality checks. Covers API endpoint health, us
Paddle billing and payments platform management including products, prices, subscriptions, transactions, customers, discounts, and payouts. Covers MRR tracking, churn analysis, subscription health, re
PandaDoc document automation platform management covering documents, templates, contacts, workspaces, and analytics. Use when monitoring document status, analyzing completion rates, reviewing template
Particle IoT platform management including device fleet, firmware, products, events, integrations, and SIM cards. Covers device connectivity monitoring, OTA firmware deployment status, data usage trac
Passage by 1Password passwordless authentication management covering users, devices, WebAuthn credentials, and app configuration. Use when analyzing passkey adoption, monitoring user authentication he
Perplexity AI API management covering models, search-augmented generation, usage analytics, and rate limits. Use when monitoring API health, analyzing query performance, reviewing available models, ch
Photon Engine real-time multiplayer services management including Photon Realtime, PUN, Fusion, Quantum, Chat, and Voice. Covers CCU monitoring, room statistics, region health, bandwidth usage, and ap
Pipedrive CRM management including deals, persons, organizations, pipelines, activities, products, and email integration. Covers pipeline health, deal velocity, activity completion rates, revenue fore
PlayFab backend services management including player data, title configuration, economy, matchmaking, multiplayer servers, and LiveOps. Covers player analytics, server fleet health, economy balance mo
Plivo cloud communications platform management for voice calls, SMS messaging, phone number management, and usage analytics. Use when analyzing message delivery rates, monitoring call quality, reviewi
Postmark transactional email service management including server configuration, message streams, delivery statistics, bounce tracking, template management, and sender signature verification. Covers de
Pusher real-time messaging platform management covering channels, presence, usage analytics, and webhook configuration. Use when monitoring active channels, analyzing connection metrics, reviewing usa
Advanced Salesforce CRM management including objects, SOQL queries, reports, dashboards, flows, Apex triggers, API usage, org limits, deployment status, and user management. Covers org health monitori
Sendbird messaging platform management covering group channels, open channels, users, messages, moderation, and usage analytics. Use when monitoring chat health, analyzing message volumes, reviewing u
Brevo (formerly Sendinblue) marketing platform management including transactional email, SMS campaigns, contact management, automation workflows, and landing pages. Covers sending quotas, deliverabili
Advanced Shopify store management including products, inventory, orders, customers, analytics, fulfillment, shipping, themes, apps, and webhooks. Covers sales performance, inventory health, fulfillmen
SparkPost email delivery service management including sending domains, IP pools, message events, deliverability metrics, templates, webhooks, and suppression lists. Covers bounce classification, engag
Square payment and commerce platform management including locations, catalog, orders, payments, subscriptions, invoices, team members, and devices. Covers payment volume metrics, location performance,
Stream Chat messaging platform management covering channels, users, messages, moderation, and usage analytics. Use when monitoring chat health, analyzing message volumes, reviewing user activity, mana
Advanced Stripe payment platform management including payment intents, subscriptions, invoices, disputes, payouts, Connect accounts, radar fraud rules, and billing portal. Covers revenue analytics, di
Stytch authentication and identity platform management covering users, sessions, magic links, OTPs, OAuth, and organization management. Use when analyzing user authentication patterns, monitoring sess
thirdweb Web3 development platform management including contract deployments, Engine instances, embedded wallets, auth sessions, RPC usage, and storage (IPFS). Covers contract interaction health, wall
Together AI inference platform management covering models, fine-tuning jobs, usage analytics, and billing. Use when monitoring API usage and costs, analyzing model performance, reviewing fine-tuning s
Deep Twilio management covering voice calls, SMS/MMS messaging, SIP trunking, phone number provisioning, usage records, and account health. Use when analyzing Twilio call quality, messaging delivery r
Unity Cloud services management including Unity Gaming Services (UGS), build automation, cloud content delivery, multiplayer relay, matchmaking, and player authentication. Covers project health monito
Unreal Engine cloud services management including Epic Online Services (EOS), matchmaking, lobbies, player data storage, analytics, and anti-cheat. Covers deployment health, player session metrics, ti
Vonage (formerly Nexmo) communications platform management covering SMS, voice, video, and messaging APIs. Use when analyzing message delivery, monitoring voice call quality, reviewing account balance
WooCommerce store management via REST API including products, orders, customers, coupons, shipping, taxes, reports, and system status. Covers sales analytics, inventory monitoring, order fulfillment,
Zoho CRM management including leads, contacts, accounts, deals, activities, reports, workflows, and blueprints. Covers pipeline analytics, lead conversion tracking, activity completion, workflow execu
Conducts an ethical review of AI/ML systems covering fairness, transparency, accountability, privacy, and safety. Evaluates potential harms, bias in training data and model outputs, explainability req
Provides a systematic investigation framework for diagnosing and resolving API latency issues. Covers distributed tracing analysis, bottleneck identification across the request path, database query im
Guides a structured migration from AWS to Google Cloud Platform, covering service mapping, data transfer strategies, IAM reconfiguration, networking changes, and validation testing. Produces a phased
Evaluates organizational compliance with the California Consumer Privacy Act (CCPA) and California Privacy Rights Act (CPRA). Covers consumer rights implementation, data inventory, privacy notice requ
Reviews and optimizes CDN configuration for maximum cache hit ratio, minimal latency, and cost efficiency. Covers cache policy tuning, origin shield configuration, edge function optimization, security
Reviews CI/CD pipeline configuration for reliability, speed, security, and best practices. Covers build optimization, test strategy, deployment patterns, secret management, artifact handling, and pipe
Configures comprehensive billing alerts and budget notifications across cloud accounts. Covers budget threshold alerts, anomaly detection, per-service spending limits, team-level notifications, and es
Calculates the optimal mix of reserved instances, savings plans, and committed use discounts based on historical usage data. Produces a purchase plan that maximizes savings while maintaining flexibili
Designs a cost allocation and chargeback model for cloud spending across teams, projects, and environments. Covers tagging strategy, allocation rules for shared services, showback/chargeback reporting
Assesses organizational readiness for Cybersecurity Maturity Model Certification (CMMC 2.0) at the target level. Covers CUI identification, NIST SP 800-171 control implementation, gap analysis against
Guides the analysis and optimization of connection pools for databases, HTTP clients, and message brokers. Covers pool sizing calculations, timeout configuration, leak detection, monitoring setup, and
Guides systematic CPU profiling to identify and resolve CPU-bound performance bottlenecks. Covers profiler selection and setup, flame graph analysis, hot path identification, optimization strategies,
Guides the setup and implementation of a data catalog to enable data discovery, documentation, lineage tracking, and governance. Covers tool selection, metadata extraction, automated cataloging, owner
Establishes a comprehensive data governance framework covering data ownership, quality standards, classification, lineage tracking, access policies, and stewardship roles. Produces actionable policies
Conducts a systematic assessment of data quality across key dimensions including completeness, accuracy, consistency, timeliness, uniqueness, and validity. Identifies data quality issues, quantifies t
Guides the migration of databases to cloud-managed services, covering schema compatibility analysis, data transfer methods, replication setup, cutover procedures, and post-migration validation. Suppor
Provides a systematic approach to diagnosing and resolving database performance issues. Covers query analysis, index optimization, schema review, configuration tuning, connection management, and capac
Assesses an organization's DevOps maturity across key dimensions including culture, automation, measurement, and sharing (CALMS). Produces a current maturity score, identifies improvement areas, and g
Audits the state of technical documentation across an engineering organization. Covers documentation inventory, freshness assessment, gap identification, quality scoring, ownership assignment, and a r
Reviews ETL/ELT pipeline architecture for reliability, performance, data quality, and maintainability. Covers pipeline design patterns, error handling, idempotency, monitoring, testing strategies, and
Assesses an organization's readiness for FedRAMP authorization, covering the 325+ NIST 800-53 controls required at Low, Moderate, or High baselines. Includes system boundary definition, control implem
Assesses an organization's GitOps adoption maturity across declarative configuration, version control practices, automated reconciliation, and observability. Evaluates adherence to GitOps principles a
Plans and facilitates a tabletop exercise to test incident response procedures without impacting production systems. Covers scenario design, participant preparation, exercise facilitation, response ev
Provides a comprehensive audit checklist aligned with ISO 27001:2022 Annex A controls for information security management systems. Covers all 93 controls across organizational, people, physical, and t
Guides the modernization of legacy APIs (SOAP, XML-RPC, proprietary protocols) to modern RESTful or GraphQL interfaces. Covers API discovery, contract analysis, versioning strategy, backward compatibi
Provides a structured framework for evaluating large language models (LLMs) for production use. Covers task-specific benchmarking, safety testing, cost analysis, latency measurement, prompt engineerin
Designs a comprehensive load testing plan covering test scenario definition, workload modeling, environment preparation, tool selection, execution strategy, and results analysis. Supports stress testi
Provides a structured methodology for migrating mainframe workloads (COBOL, JCL, CICS, IMS) to cloud-native platforms. Covers code analysis, automated conversion, data migration from hierarchical and
Conducts a review of team meeting practices to identify waste, improve effectiveness, and reclaim productive time. Covers meeting inventory, cost analysis, attendee optimization, agenda quality, decis
Provides a systematic methodology for detecting, diagnosing, and resolving memory leaks in production applications. Covers heap analysis, allocation profiling, leak pattern identification, garbage col
Provides a comprehensive checklist for deploying machine learning models to production, covering model validation, infrastructure setup, serving configuration, monitoring, A/B testing, and rollback pr
Guides the decomposition of a monolithic application into serverless functions and managed services. Covers domain boundary identification, function extraction, event-driven architecture design, state
Provides a structured framework for comparing costs across multiple cloud providers for equivalent workloads. Covers service-level price comparison, TCO analysis, hidden cost identification, discount
Conducts an assessment against the NIST Cybersecurity Framework (CSF 2.0), evaluating organizational maturity across all six core functions: Govern, Identify, Protect, Detect, Respond, and Recover. Pr
Evaluates an organization's observability maturity across the three pillars (metrics, logs, traces) plus alerting, dashboards, and AIOps capabilities. Identifies gaps in visibility, assesses signal qu
Provides a comprehensive framework for migrating on-premises infrastructure to a cloud provider. Covers workload assessment, network connectivity, data migration, security posture translation, and pha
Plans and scopes a penetration test engagement, covering target identification, rules of engagement, testing methodology selection, team coordination, communication protocols, and findings remediation
Assesses the maturity and effectiveness of an organization's internal developer platform (IDP). Covers self-service capabilities, developer experience, golden paths, platform team structure, adoption
Analyzes current cloud compute usage patterns to recommend optimal reserved instance or savings plan purchases. Covers utilization analysis, commitment term selection, payment option comparison, cover
Assesses the integration of security practices into DevOps workflows (DevSecOps). Covers shift-left security, automated security testing in CI/CD, supply chain security, runtime protection, security c
Conducts an IT general controls audit for Sarbanes-Oxley (SOX) compliance, covering access management, change management, computer operations, and program development controls for systems involved in
Designs a spot instance strategy for fault-tolerant and flexible workloads to achieve significant cost savings. Covers workload suitability assessment, instance diversification, interruption handling,
Evaluates an organization's Site Reliability Engineering maturity across SLO management, incident response, toil reduction, capacity planning, and reliability culture. Produces a maturity scorecard an
Guides the process of creating or updating a technology radar that tracks the adoption status of technologies, tools, frameworks, and practices across the organization. Covers technology assessment, r
Provides a structured approach for migrating VM-based workloads to containers, covering application analysis, Dockerfile creation, orchestration setup, storage and networking adaptation, and progressi
Runs a focused sprint to identify and eliminate cloud resource waste. Covers idle resource detection, rightsizing opportunities, orphaned resource cleanup, storage optimization, and quick-win cost red
Turn runbooks, SOPs, and tribal knowledge into reusable AI skills