logo
  1. If we are dealing with a read-heavy system, it's good to consider using a Cache
  1. If we need low latency in a system, it's good to consider using a Cache & CDN
  1. If we are dealing with a write-heavy system, it's good to consider using a Message Queue for Async processing
  1. If we need a system to be ACID compliant, we should go for RDBMS or SQL Database
  1. If data is unstructured & doesn't require ACID properties, we should go for NO- SQL Database
  1. If the system has complex data in the form of videos, images, files etc, we should go for Blob/Object storage
  1. If the system requires complex pre-computation like a news feed, we should consider using a Message Queue & Cache
  1. If the system requires searching data in high volume, we should consider using a search index, tries, or a search engine like Elasticsearch
  1. If the system requires to Scale SQL Database, we should consider using Database Sharding
  1. If the system requires High Availability, Performance, and Throughput, we should consider using a Load Balancer
  1. If the system requires faster data delivery globally, reliability, high availability, and performance, we should consider using a CDN
  1. If the system has data with nodes, edges, and relationships like friend lists, and road connections, we should consider using a Graph Database
  1. If the system needs scaling of various components like servers, databases, etc, we should consider using Horizontal Scaling
  1. If the system requires high-performing database queries, we should consider using Database Indexes
  1. If the system requires bulk job processing, we should consider using Batch Processing & Message Queues
  1. If the system requires reducing server load and preventing DOS attacks, we should consider using a Rate Limiter
  1. If the system has microservices, we should consider using an API Gateway (Authentication, SSL Termination, Routing, etc)
  1. If the system has a single point of failure, we should implement Redundancy in that component
  1. If the system needs to be fault-tolerant, and durable, we should implement Data Replication (creating multiple copies of data on different servers)
  1. If the system needs user-to-user communication (bi-directional) in a fast way, we should consider using Websockets
  1. If the system requires the ability to detect failures in a distributed system, we should consider implementing Heartbeat
  1. If the system needs to ensure data integrity, we should consider implementing Checksum Algorithm
  1. If the system needs to transfer data between various servers in a decentralized way, we should go for Gossip Protocol
  1. If the system needs to scale servers with add/removal of nodes efficiently, with no hotspots, we should implement Consistent Hashing
  1. If the system needs anything to deal with a location like maps, or nearby resources, we should consider using Quadtree, Geohash, etc
  1. Avoid using any specific technology names such as - Kafka, S3, or EC2. Try to use more generic names like message queues, object storage, etc
  1. If High Availability is required in the system, it's better to mention that system cannot have strong consistency. Eventual Consistency is possible
  1. If the system needs to limit the huge amount of data for a network request like youtube search, trending videos, etc. One way is to implement Pagination which limits response data.
Share