Sajeed Syed BakhtBatch Processing from the Ground Up: Part IIIIn the previous article, we dived into MapReduce. MapReduce took the data processing world by storm for providing fast and distributed…4 min read·Oct 26, 2020----
Sajeed Syed BakhtBatch Processing from the Ground Up: Part IIWe last left off on Unix pipelines and how its philosophy can help us scale up batch processing on a distributed network. We then…15 min read·Oct 24, 2020----
Sajeed Syed BakhtBatch Processing from the Ground Up: Part IBatch Processing Systems: A system that takes a large amount of input data and runs a job to process it and produces some output data…6 min read·Oct 9, 2020----
Sajeed Syed BakhtDatabases from the ground up Part IIIWe have shown how to think about data retrieval systems. We have covered LSM-Trees, B-Trees, how to think of segment files, and how memory…8 min read·Oct 5, 2020----
Sajeed Syed BakhtDatabases from the ground up Part IIWe last left off at hash indexes. We explored how hash indexes sped up the retrieval of data by keeping the offset of the key within a…11 min read·Oct 1, 2020----
Sajeed Syed BakhtDatabases from the ground up Part IIn the growing world of data lingo, you might have heard Online Analytical Processing(OLAP), Online Transaction Processing(OLTP), and Data…5 min read·Sep 27, 2020----
Sajeed Syed BakhtA dive into Data Models; how should you structure your data?The way we structure data inherently affects how we think and reason about the problem. For example, in a declarative language like SQL…5 min read·Sep 24, 2020----
Sajeed Syed BakhtReliability, Scalability, and Maintainability is all a Data System NeedsData has become the forefront of powering many applications. From complex machine learning algorithms to social media apps, to government…8 min read·Sep 23, 2020----