Backup
Planned procedure for protecting data and system state, governing restoration, retention and validation.
Classification
- ComplexityMedium
- Impact areaTechnical
- Decision typeArchitectural
- Organizational maturityIntermediate
Technical context
Principles & goals
Use cases & scenarios
Compromises
- Risk of corrupted backups remaining undetected without integrity checks.
- Unencrypted backups endanger privacy and compliance.
- Lack of offsite copies increases risk in site-level failures.
- Perform automated, verified restore tests regularly at minimum.
- Use encryption and access controls for backup data.
- Combine local fast restores with off-site redundancy.
I/O & resources
- List of data sets and systems to be backed up
- Backup policy with retention and frequency
- Storage targets, encryption and access policies
- Backup archives (full/incremental), verification logs
- Recovery documentation and test reports
- Retention and deletion evidence for compliance
Description
Backup describes procedures for scheduled protection of data and system state, including retention, restoration and validation. The method covers full, incremental and differential strategies, retention policies and regular restore tests to ensure recoverability. Its goal is to maximise data availability and minimise downtime across on-premises and cloud environments.
✔Benefits
- Reduces data loss and shortens recovery times.
- Enables compliance via audit-proof retention.
- Supports restoration after outages or failures.
✖Limitations
- Requires additional storage and operational costs.
- Insufficient testing can create a false sense of security.
- Complexity in heterogeneous system landscapes.
Trade-offs
Metrics
- Recovery Time Objective (RTO)
Target time to restore a service after failure.
- Recovery Point Objective (RPO)
Maximum tolerable data loss measured in time.
- Backup success rate
Proportion of successful backup runs versus scheduled runs.
Examples & implementations
Small business nightly backups
Daily full backups overnight with weekly offsite copies.
Enterprise with tiered retention
Combination of incremental backups, snapshots and long-term archival for compliance.
Cloud-native app using storage snapshots
Use of block and object storage snapshots for fast recovery.
Implementation steps
Analyze data criticality and define RTO/RPO
Select appropriate backup strategy and tools
Set up automation, monitoring and schedule restore tests
⚠️ Technical debt & bottlenecks
Technical debt
- Legacy backup formats without documentation hinder recovery.
- Non-standard backup scripts complicate automation.
- Unclear retention rules lead to unnecessary storage buildup.
Known bottlenecks
Misuse examples
- Only local backups without an off-site copy in case of site failure.
- Using backup scripts without error handling and alerts.
- Automated backups but no scheduled restore tests.
Typical traps
- Relying on success messages without integrity checks.
- Unclear responsibilities for restore operations.
- Missing consideration of dependencies between systems.
Required skills
Architectural drivers
Constraints
- • Legal retention periods and data protection requirements.
- • Limited local storage capacity during backup windows.
- • Performance impact on production during backups.