Files
bi-agents/.agents/skills/bmad-testarch-nfr/steps-c/step-04c-subagent-reliability.md
Cassel 647cbec54f docs: update all documentation and add AI tooling configs
- Rewrite README.md with current architecture, features and stack
- Update docs/API.md with all current endpoints (corporate, BI, client 360)
- Update docs/ARCHITECTURE.md with cache, modular queries, services, ETL
- Update docs/GUIA-USUARIO.md for all roles (admin, corporate, agente)
- Add docs/INDEX.md documentation index
- Add PROJETO.md comprehensive project reference
- Add BI-CCC-Implementation-Guide.md
- Include AI agent configs (.claude, .agents, .gemini, _bmad)
- Add netbird VPN configuration
- Add status report

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-19 13:29:03 -04:00

1.8 KiB

name, description, subagent, outputFile
name description subagent outputFile
step-04c-subagent-reliability Subagent: Reliability NFR assessment true /tmp/tea-nfr-reliability-{{timestamp}}.json

Subagent 4C: Reliability NFR Assessment

SUBAGENT CONTEXT

This is an isolated subagent running in parallel with other NFR domain assessments.

Your task: Assess RELIABILITY NFR domain only.


SUBAGENT TASK

1. Reliability Assessment Categories

A) Error Handling:

  • Try-catch blocks for critical operations
  • Graceful degradation
  • Circuit breakers
  • Retry mechanisms

B) Monitoring & Observability:

  • Logging implementation
  • Error tracking (Sentry/Datadog)
  • Health check endpoints
  • Alerting systems

C) Fault Tolerance:

  • Database failover
  • Service redundancy
  • Backup strategies
  • Disaster recovery plan

D) Uptime & Availability:

  • SLA targets
  • Historical uptime
  • Incident response

OUTPUT FORMAT

{
  "domain": "reliability",
  "risk_level": "LOW",
  "findings": [
    {
      "category": "Error Handling",
      "status": "PASS",
      "description": "Comprehensive error handling with circuit breakers",
      "evidence": ["Circuit breaker pattern in src/services/", "Retry logic implemented"],
      "recommendations": []
    },
    {
      "category": "Monitoring",
      "status": "CONCERN",
      "description": "No APM (Application Performance Monitoring) tool",
      "evidence": ["Logging present but no distributed tracing"],
      "recommendations": ["Implement APM (Datadog/New Relic)", "Add distributed tracing"]
    }
  ],
  "compliance": {
    "SLA_99.9": "PASS"
  },
  "priority_actions": ["Implement APM for better observability"],
  "summary": "Reliability is good with minor monitoring gaps"
}

EXIT CONDITION

Subagent completes when JSON output written to temp file.