Resiliency
Resiliency → ability of systems and infrastructure to withstand failures, attacks, or outages and continue operating or recover quickly with minimal impact to availability.
HA
High availability (HA) → design approach focused on keeping systems online and accessible at all times by minimizing downtime through failover mechanisms and continuous operation.
Redundancy
Redundancy → use of duplicate components or systems to eliminate single points of failure, including redundant disks, power, or network paths; common redundant array of independent disks levels include RAID 0 for striping, RAID 1 for mirroring, RAID 5 for striping with parity, RAID 6 for dual parity, and RAID 10 for mirrored striping.
Server Clustering
Server clustering → resiliency technique that combines two or more servers to operate as a single system, allowing workloads to fail over automatically if one server goes down.
Load Balancing
Load balancing → traffic distribution method that spreads requests across multiple independent servers to improve availability and performance, even though the servers do not coordinate or share state.
Hot Site
Hot site → fully operational backup location with up-to-date systems and data that can take over immediately after a primary site failure.
Cold Site
Cold site → backup location with minimal infrastructure that requires significant setup time before becoming operational after a disaster.
Warm Site
Warm site → partially prepared backup location with some systems and data in place, requiring less setup time than a cold site but more than a hot site.
COOP
Continuity of operations planning (COOP) → organizational planning process that ensures essential functions continue during and after a disruption, focusing on people, processes, and technology.