Our hardware infrastructure comprises millions of machines, all of which generate logs that we need to process, store, and serve. The total size of these logs is several petabytes every hour. The o…
Scribe: Transporting petabytes per hour - Engineering at Meta
Facebook Hadoop Usecase
Facebook End of Business as Usual - Glenn's blog
A survey on the Distributed Computing stack - ScienceDirect
Reanalysis in Earth System Science: Toward Terrestrial Ecosystem Reanalysis - Baatz - 2021 - Reviews of Geophysics - Wiley Online Library
Blowing Past the Zettabyte Era - ENERGY TODAY
PDF) ChronoLog: A Distributed Shared Tiered Log Store with Time-based Data Ordering
Big data and analytics
Critical analysis of Big Data challenges and analytical methods - ScienceDirect
Scribe: Transporting petabytes per hour - Engineering at Meta