Pratik's Blog
  • Home
  • About
  • Portfolio
Sign in Subscribe

Big-Data

A collection of 4 posts
Distributing Key Values
Distributed-Systems

Distributing Key Values

Consistency is a virtue of mules?
07 Nov 2024 4 min read
PipeDream: The Dreamweaver of Distributed Training
Big-Data

PipeDream: The Dreamweaver of Distributed Training

The authors of PipeDream set out to solve an ambitious problem - how do you train a colossal model on a distributed cluster? Dividing the vast dataset required to support model training is certainly needed. With huge models, we may even need to segment the model itself into various stages.
17 Feb 2024 6 min read
MapReduce: Distributed Computing For All
Big-Data

MapReduce: Distributed Computing For All

Computer science is the science of abstractions. So much simply happens because a group of developers somewhere set out to abstract out a complex process for developers everywhere. But there's still a balancing act. Too many abstractions introduces too many overheads - too few and we're
10 Feb 2024 5 min read
Google File System: Chunk and Conquer
Big-Data

Google File System: Chunk and Conquer

The Google File System is a widely studied distributed file system. Its ability to provide a scalabily and fault tolerance on inexpensive commodity hardware made it a platform of choice within Google. The design goals of this file system was similar to its predecessors - performance, scalability, reliability and availability
29 Jan 2024 8 min read
Page 1 of 1
Pratik's Blog © 2025
Powered by Ghost