Blast Radius Assessment¶

The blast radius assessment feature provides ITIL v4 compliant risk analysis for infrastructure changes by combining dependency analysis with planned changes to assess the potential impact before deployment.

Overview¶

Blast radius assessment helps teams: - Minimize Risk: Identify potential issues before deployment - Improve Planning: Provide clear mitigation and rollback steps - Enforce Governance: ITIL v4 compliant approval workflows - Enhance Visibility: Show complete impact of changes

Command Usage¶

Basic Blast Radius Assessment¶

thothctl check -type blast-radius --recursive

With Specific Plan File¶

thothctl check -type blast-radius --recursive --plan-file tfplan.json

Command Options¶

--recursive: Analyze dependencies recursively through subdirectories
--plan-file: Path to terraform plan JSON file (optional)
--directory: Target directory to analyze (default: current directory)

Risk Assessment Formula¶

Base Risk Factors¶

The assessment uses weighted risk factors to calculate component risk scores:

Factor	Weight	Description
Changes Frequency	30%	How often the component changes
Dependencies Count	25%	Number of dependencies
Complexity	20%	Complexity of the component
Criticality	15%	How critical the component is
Recent Changes	10%	Recent changes to the component

Risk Calculation¶

component_risk = (
    changes_frequency * 0.3 +
    dependencies_count * 0.25 +
    complexity * 0.2 +
    criticality * 0.15 +
    recent_changes * 0.1
)

Change Type Multipliers¶

Different change types have varying risk levels:

Change Type	Risk Multiplier	Description
Delete	1.5x	Deletion is 50% more risky
Replace	1.3x	Replacement is 30% more risky
Update	1.0x	Update is baseline risk
Create	0.8x	Creation is 20% less risky
No Change	0.0x	No change = no risk

Overall Risk Assessment¶

# Weighted combination of factors
final_risk_score = (
    avg_risk * 0.6 +          # 60% weight on average risk
    max_risk * 0.3 +          # 30% weight on maximum risk
    blast_radius_factor * 0.1  # 10% weight on blast radius size
)

ITIL v4 Risk Categories¶

Risk Levels and Thresholds¶

Risk Score	Risk Level	Color	ITIL Change Type	Approval Required
0.0 - 0.3	LOW	🟢 Green	STANDARD	Automated
0.3 - 0.6	MEDIUM	🟡 Yellow	NORMAL	Team Lead
0.6 - 0.8	HIGH	🟠 Orange	NORMAL	Senior Management
0.8 - 1.0	CRITICAL	🔴 Red	EMERGENCY	CAB Approval

Change Types¶

STANDARD Changes¶

Risk Level: LOW
Approval: Automated process
Timing: Can be deployed during business hours
Monitoring: Basic monitoring sufficient

NORMAL Changes¶

Risk Level: MEDIUM to HIGH
Approval: Team Lead or Senior Management
Timing: Scheduled maintenance windows recommended
Monitoring: Enhanced monitoring required

EMERGENCY Changes¶

Risk Level: CRITICAL
Approval: Change Advisory Board (CAB)
Timing: Immediate with full incident response
Monitoring: Real-time monitoring with on-call team

Output Example¶

High Risk Scenario¶

================================================================================
🎯 BLAST RADIUS ASSESSMENT (ITIL v4 Compliant)
================================================================================

┌─────────────────── 📊 Risk Summary ───────────────────┐
│ Risk Level: HIGH                                      │
│ Change Type: NORMAL                                   │
│ Total Components: 12                                  │
│ Affected Components: 7                                │
└───────────────────────────────────────────────────────┘

                    💥 Affected Components                     
┌─────────────────────┬──────────────┬────────────┬─────────────┐
│ Component           │ Change Type  │ Risk Score │ Criticality │
├─────────────────────┼──────────────┼────────────┼─────────────┤
│ vpc-main            │ update       │ 0.85       │ critical    │
│ security-group-web  │ replace      │ 0.72       │ high        │
│ rds-primary         │ update       │ 0.68       │ high        │
└─────────────────────┴──────────────┴────────────┴─────────────┘

┌─────────────────── 📋 ITIL v4 Recommendations ───────────────────┐
│ • ⚠️ HIGH: Require senior management approval                    │
│ • ⚠️ Schedule during maintenance window                          │
│ • ⚠️ Prepare detailed rollback procedures                       │
│ • ⚠️ Monitor affected systems closely                           │
└──────────────────────────────────────────────────────────────────┘

Low Risk Scenario¶

┌─────────────────── 📊 Risk Summary ───────────────────┐
│ Risk Level: LOW                                       │
│ Change Type: STANDARD                                 │
│ Total Components: 8                                   │
│ Affected Components: 2                                │
└───────────────────────────────────────────────────────┘

┌─────────────────── 📋 ITIL v4 Recommendations ───────────────────┐
│ • ✅ LOW: Standard change process applies                        │
│ • ✅ Can be deployed during business hours                       │
│ • ✅ Basic monitoring sufficient                                 │
└──────────────────────────────────────────────────────────────────┘

Integration with Existing Commands¶

Dependency Analysis Integration¶

The blast radius assessment leverages the existing dependency analysis:

# First run dependency analysis
thothctl check -type deps --recursive

# Then run blast radius assessment
thothctl check -type blast-radius --recursive

Plan Analysis Integration¶

Works with terraform plan files:

# Generate plan first
terraform plan -out=tfplan.json

# Assess blast radius with plan
thothctl check -type blast-radius --plan-file tfplan.json --recursive

Best Practices¶

Pre-Deployment Workflow¶

Generate Dependencies: Run thothctl check -type deps --recursive
Create Plan: Generate terraform plan with terraform plan -out=tfplan.json
Assess Risk: Run thothctl check -type blast-radius --plan-file tfplan.json --recursive
Review Results: Follow ITIL v4 recommendations for approval
Implement Mitigations: Execute recommended mitigation steps
Deploy with Monitoring: Deploy with appropriate monitoring level

Risk Mitigation Strategies¶

For HIGH/CRITICAL Risk Changes¶

Phased Deployment: Break changes into smaller, less risky phases
Blue-Green Deployment: Use blue-green deployment strategies
Canary Releases: Deploy to subset of infrastructure first
Maintenance Windows: Schedule during low-traffic periods
Rollback Testing: Test rollback procedures before deployment

For MEDIUM Risk Changes¶

Staging Validation: Thorough testing in staging environment
Monitoring Setup: Enhanced monitoring during deployment
Team Coordination: Ensure team availability during deployment

For LOW Risk Changes¶

Standard Process: Follow normal deployment procedures
Basic Monitoring: Standard monitoring sufficient
Documentation: Ensure changes are properly documented

Troubleshooting¶

Common Issues¶

No Dependencies Found¶

# Ensure terragrunt.hcl files exist
ls -la */terragrunt.hcl

# Check directory structure
thothctl check -type deps --recursive

Plan File Not Found¶

# Generate plan file first
terraform plan -out=tfplan.json

# Or use without plan file
thothctl check -type blast-radius --recursive

High Risk False Positives¶

Review component criticality settings
Check dependency graph accuracy
Validate change type detection
Consider component-specific risk factors

Configuration¶

Risk Threshold Customization¶

Risk thresholds can be customized in the service configuration:

risk_thresholds = {
    ChangeRisk.LOW: 0.3,      # Adjust as needed
    ChangeRisk.MEDIUM: 0.6,   # Adjust as needed
    ChangeRisk.HIGH: 0.8,     # Adjust as needed
    ChangeRisk.CRITICAL: 1.0  # Maximum risk
}

Component Criticality Override¶

Components can be marked with specific criticality levels: - Critical: Core infrastructure components - High: Important but not critical components - Medium: Standard components - Low: Non-essential components

thothctl check -type deps - Dependency analysis
thothctl check -type plan - Plan validation
thothctl inventory iac - Infrastructure inventory
thothctl scan iac - Security scanning