ACE PlatformDR Checklist — Data Engineering

Business Function

The ETL RDS instances store the following information from the data warehouse:

  • ETL Metadata
  • ETL Batch / Job Stats
  • DB Config Details
  • Trust Rule Metadata

There are four environments:

  • dev
  • qa
  • uat
  • prod

Scope

The scope of this document is all four RDS ETL environments (dev, qa, uat, prod) running in the Prod ACE AWS Account.

The regional scope of this RC is the us-west-2 region of AWS.

Out of Scope

  • Multi-region availability is out of scope per business requirements.

Backup retention

  • Backup retention for dev and qa: 30 days
  • Backup retention for uat and prod: 14 days

All backups start time is 19:00 UTC and Duration is 0.5 hours.

Key Information

  • Enabled VPN
  • Logged in to AWS CLI
  • Logged in to Terraform
  • Name of RDS you want to restore
  • Deployment files are available in terraform-ace-prod repo. Steps below are relevant to this repo.

Recovery Checklist & Dependencies

The following tables provide the step-by-step recovery checklist that the ACE Infra team can use to guide them through the recovery steps for the ETL RDS instances.

StatusTask SummaryTask Details
- [ ] CompleteOpen terraform-ace-prod repoNavigate to ./terraform-ace-prod/us-west-2
- [ ] CompleteSearch for RDS directoryIn the directory, search for the name of RDS you want to restore (In VSCode is by command + f on explorer tab) and navigate to its directory
- [ ] CompleteMake changesFollow [Disaster Recovery] comments in main.tf file, and make changes accordingly
- [ ] Completeterraform initRun terraform init
- [ ] Completeterraform planAfter successful run of terraform init, run terraform plan
- [ ] Completeterraform applyAfter successful run of terraform plan, run terraform apply