System Design

Archit Rathod, Donne Martin, ChatGPT,Interview PreparationSystem DesignSoftware Engineering

System Design

1. Scalability

Scalability is the ability of a system to handle increased load by adding resources (servers, CPUs, RAM, etc.). It’s one of the most critical aspects of system design when dealing with growing users and data.

There are two main approaches to scalability:

  1. Vertical Scaling (Scale-Up)
  2. Horizontal Scaling (Scale-Out)

Vertical Scaling (Scale-Up)

Theory:

Vertical scaling refers to increasing the capacity of a single server (e.g., adding more CPU, RAM, or disk space). Think of it like upgrading a computer to make it faster and handle more load.

Diagram:

Single Server (Low capacity)


+---------------+
|    Server 1   |  ← Upgrade CPU/RAM/Storage
+---------------+


Improved Single Server (High capacity)

Example:

Suppose you have a small e-commerce website running on one server. To handle more traffic, you add:

The server now performs better but still has its limitations.


Pros and Cons:

ProsCons
Simple to implement and manage.Hardware has physical limits (CPU, RAM, etc.).
No changes in software architecture.Single point of failure if the server crashes.
Good for low traffic systems.Expensive for very high-end hardware.

Horizontal Scaling (Scale-Out)

Theory:

Horizontal scaling involves adding more servers to the system. Instead of upgrading a single server, you distribute the load across multiple servers.

This approach is much better for handling large-scale traffic.

Diagram:

         +---------------+         +---------------+
Clients  →|  Load Balancer |------->|    Server 1   |
         +---------------+         +---------------+
                 │                         │
                 ▼                         ▼
         +---------------+         +---------------+
         |    Server 2   |         |    Server 3   |
         +---------------+         +---------------+

Example:

Imagine an online streaming platform like YouTube. To serve millions of users:


Pros and Cons:

ProsCons
No single point of failure (redundancy).More complex to implement and manage.
Can handle unlimited traffic.Requires load balancing and data replication.
Flexible and cost-effective.Ensuring consistency across servers is harder.

Comparison Between Vertical and Horizontal Scaling:

FeatureVertical ScalingHorizontal Scaling
Scaling approachAdd resources to a single serverAdd more servers.
LimitationsHardware limitsNearly unlimited.
CostExpensive hardware upgradesCost-effective with commodity servers.
ComplexityLowHigher (load balancing, etc.).
Single point failureYesNo (redundant servers).

Caching

Theory:

Caching involves storing frequently accessed data in a temporary, fast-access location (like memory). This reduces the load on databases and speeds up response times.

Caches can be:


Diagram:

          +------------------+
          |   Application    |
          +------------------+

   +----------------▼---------------+
   |             Cache              |
   | (e.g., Redis, Memcached)       |
   +----------------▲---------------+

          +------------------+
          |    Database      |
          +------------------+

Example:

A user frequently views their profile on a social media app. To avoid repeatedly querying the database:

  1. User’s profile data is cached in Redis.
  2. On subsequent requests, the server retrieves data from the cache.

Types of Caching:

  1. Read-through Cache: Data is fetched from the database and stored in the cache when requested.
  2. Write-through Cache: Updates are written to the cache and database simultaneously.
  3. Cache-aside: Application decides what to cache.
  4. CDN (Content Delivery Network): Caches static content (e.g., images, videos).

Eviction Policies:

When cache is full, old data must be evicted:


Load Balancing

Theory:

Load balancing distributes incoming requests across multiple servers to:

  1. Prevent overloading any single server.
  2. Ensure high availability and fault tolerance.

Diagram:

       Clients


 +------------------+
 |  Load Balancer   |
 +------------------+
    │      │      │
    ▼      ▼      ▼
+-------+ +-------+ +-------+
|Server 1| |Server 2| |Server 3|
+-------+ +-------+ +-------+

Types of Load Balancers:

  1. DNS-based: DNS resolves requests to different servers.
  2. Layer 4 (Transport): Balances traffic based on IP and port.
  3. Layer 7 (Application): Balances traffic based on application data (e.g., URL, cookies).

Example:

Imagine Google Search. Millions of requests are handled simultaneously by:

  1. A load balancer that routes user requests to healthy servers.
  2. Servers handle the requests and send responses back.

Benefits:


Database Replication

Theory:

Database replication involves copying data from a master database to one or more replica databases. This improves:


Diagram:

     +------------------+
     |   Master DB      |
     +--------▲---------+

   +----------▼-----------+
   |       Replica DBs    |
   +-----------+----------+
        │             │
        ▼             ▼
+-------------+ +-------------+
| Read Query  | | Read Query  |
+-------------+ +-------------+

Types:

  1. Master-Slave Replication:

    • Writes go to the master.
    • Reads go to replicas.
  2. Master-Master Replication:

    • Both nodes handle reads and writes.

Example:

In a global app like Instagram:


Database Partitioning (Sharding)

Theory:

Partitioning (sharding) splits large databases into smaller, manageable parts (shards). Each shard contains a subset of the data.


Diagram:

          +--------------------+
          |    Load Balancer   |
          +---------▲----------+

  +---------+-------+-------+---------+
  | Shard 1 | Shard 2 | Shard 3 | Shard 4 |
  +---------+---------+---------+---------+

Example:

In a large user database:

Benefits:


Challenges:


Summary

ConceptPurpose
Vertical ScalingAdd resources to a single server.
Horizontal ScalingAdd more servers to handle the load.
CachingSpeed up responses using temporary storage.
Load BalancingDistribute traffic across servers.
Database ReplicationCopy data to improve read performance.
Database PartitioningSplit data into smaller, manageable parts.

Add a Comment

Archit Rathod

Archit Rathod

Software Developer and tech enthusiast. Passionate about web development and creating user-friendly experiences.

Read More