Emergency rollback: Execute the predefined rollback plan, such as configuration restoration from backups and VM recovery from snapshots.
Service migration: Migrate services that will be interrupted to other nodes. Prioritize the recovery of core services (such as production systems) after the O&M operation is complete.
Troubleshooting: Use aDeploy to collect logs and contact technical support to locate the root cause.
Mitigate data security risks:
Periodic backup: Back up the incremental data of system configuration and services daily, and back up the full data of system configuration and services weekly. Save backup files to a dedicated device.
Backup verification: Perform a backup restoration test every month to ensure that backup files can be used for restoration.
Data monitoring: Enable data integrity verification (such as RAID card verification) to identify bad sectors or abnormal data blocks at the earliest opportunity.
Mitigate compatibility risks:
Pre-verification: Before adding hardware or installing SPs, confirm compatibility by checking the compatibility list of SCP, and run tests in a test environment if necessary.
Compatibility issue fixing: When compatibility issues occur (such as incompatible NIC drivers), contact the Sangfor R&D team to provide a solution, such as dedicated driver packages.[11]