Andrew Despres

Posted on Mar 4

CompTIA SY0-701 Security+ 3.4 Study Guide: Operations and Resiliency

#comptia #securityplus #beginners #cybersecurity

This study guide provides a comprehensive overview of data backup strategies, power resiliency, capacity planning, and organizational resiliency as outlined in the CompTIA SY0-701 domain.

1. Data Backup and Integrity Strategies

Backups are the foundation of data recovery, allowing organizations to restore information quickly after data loss. Developing a backup strategy requires balancing variables such as data volume, media types, and storage locations.

Backup Types and Methods

On-site Backups: Data and media are stored at the same location as the source. This is cost-effective and allows for immediate restoration but offers no protection if the physical site is destroyed.
Off-site Backups: Data is transferred to a different physical location or shipped via physical media. This ensures data is recoverable even if the primary facility is lost.
Cloud Backups: Data is stored on remote servers managed by a third party. Encryption is essential here, as the organization has no control over who physically accesses the cloud hardware.
Snapshots: Commonly used for Virtual Machines (VMs), a snapshot captures the entire state of a system at a specific moment. It acts like an incremental backup, saving only the changes since the last snapshot.
- Real-World Comparison: Think of a snapshot like a "Save Point" in a video game. If you make a mistake or the system crashes, you can "roll back" to exactly where you were when you clicked the button.
Replication: Data is copied from a source to one or more locations in near real-time. If the original data changes, the replicas are updated within seconds or minutes.

Data Integrity and Security

Encryption: To prevent unauthorized access—especially during physical transport or cloud storage—backup data should be encrypted. Organizations must manage recovery keys carefully to ensure data can be decrypted during a restore.
Journaling: This process prevents data corruption during power failures. Data is first written to a "journal" on the drive. If power is lost during the final write to the database, the system uses the journal to complete or correct the information upon reboot.

2. Power Resiliency and Management

Power is the fundamental resource for all technology. Data centers must be engineered to handle blackouts (total loss), brownouts (drop in voltage), and surges (excessive voltage).

Uninterruptible Power Supplies (UPS)

A UPS provides immediate, battery-based power during an outage.

Offline/Standby UPS: Constantly runs on main power but switches to battery via an internal switch when power fails.
Line-Interactive UPS: Can slowly increase voltage during brownouts without switching entirely to battery.
Online/Double-Conversion UPS: Always runs systems from the battery, avoiding any switch-over time and providing the most stable power.

Generators

Generators provide long-term power as long as fuel is available. Because they take about a minute to "ramp up" and begin providing power, they are typically used in conjunction with a UPS, which covers the gap until the generator is stable.

3. Capacity Planning and Scalability

Capacity planning involves balancing supply (resources) and demand (user needs) to avoid application slowdowns or wasted expenditure.

4. Organizational Resiliency and Recovery

Resiliency is the ability of an organization to maintain uptime and availability through redundant configurations and disaster planning.

High Availability (HA) and Redundancy

High Availability: Systems are configured to be always on and running in parallel. If one fails, the other immediately takes the load.
Server Clustering: Multiple servers work together to appear as a single system. They often use shared storage to ensure all servers have synchronized data.
Load Balancing: A central device distributes requests across multiple independent servers. If one server fails, the load balancer removes it from the rotation.

Site Resiliency

When a primary data center is compromised, organizations move to a recovery site:

Hot Site: An exact replica of the data center with live, synchronized data and updated applications.
Warm Site: Contains some equipment and data, but requires additional hardware or data restoration to become fully functional.
Cold Site: An empty building with power and light. The organization must bring all equipment, data, and people.

Alternative Resiliency Methods

Geographical Dispersion: Placing recovery sites in different regions (e.g., a different state) to ensure a single natural disaster does not affect both locations.
Platform Diversity: Using different operating systems (Linux, Windows, macOS) to ensure a single OS vulnerability cannot take down the entire infrastructure.
Continuity of Operations Planning (COOP): Non-technical "failback" methods, such as using paper receipts or manual credit card processing when technology is unavailable.

5. Recovery Testing and Simulations

A disaster recovery plan is only effective if it is tested.

Tabletop Exercises: A group sits around a table and discusses the steps of a recovery plan to identify logistical gaps without actually disrupting systems.
Failover Test: Intentionally breaking a primary link (like a router or firewall) to see if redundant systems automatically take over.
Simulations: Phishing simulations are used to test if users click malicious links and to evaluate if automated detection systems are working.
Parallel Processing: Using multiple CPUs or computers to handle transactions simultaneously. This improves efficiency and provides resiliency; if one processor fails, the others handle the load.

The ability to recover from a disaster is not just about having a copy of your data; it is about the meticulous planning of power, people, and processes. Whether it is ensuring your database uses journaling to survive a power flicker or deploying a multi-cloud strategy to survive a provider outage, resiliency is a multi-layered discipline.

Security Forem