Storage & Databases

Storage & Databases

Database Indexes
B-tree, hash, composite, covering indexes, write overhead, and when not to index
B+ Tree Internals
Pages, fan-out, tree height, node structure, read and write paths, page splits, and write amplification
Hash Index
O(1) equality lookups, collision resolution, PostgreSQL USING HASH, InnoDB Adaptive Hash Index, and Bitcask
LSM Trees
Memtable, SSTables, Bloom filters, compaction strategies, and B-tree vs LSM tradeoffs
RDBMS Internals
ACID, MVCC, WAL, buffer pool, and query planner
SQL vs NoSQL
Decision framework, polyglot persistence, and NewSQL
Key-Value Stores (Redis)
Data structures, persistence, clustering, eviction, and use cases
Wide-Column Stores (Cassandra)
Partition key, consistency levels, compaction, and hot partition avoidance
Document Stores (MongoDB)
BSON, aggregation pipeline, replication, and sharding
Object Storage (S3)
Flat namespace, multipart upload, consistency model, and durability
Time-Series Databases
Append-mostly writes, compression, retention, and downsampling
Columnar Storage
Row vs columnar layout, compression, vectorized execution, Parquet, and OLAP vs OLTP
Consistent Hashing
Hash ring, virtual nodes, replication, hotspot mitigation, and real-world applications
Bloom Filters & HyperLogLog
Probabilistic membership testing, false positive tuning, counting variants, and cardinality estimation
Caching Patterns
Cache-aside, write-through, write-back, write-around, stampede, penetration, and avalanche
Cache Eviction
LRU, LFU, ARC, W-TinyLFU, CLOCK, FIFO, and choosing the right policy
Hot-Key / Hotspot Problems
Root cause, Redis key replication, partition bucketing, Kafka skew, and the celebrity problem
Full-Text Search (Elasticsearch)
Inverted index, BM25, Lucene segments, shard architecture, and relevance tuning