You are on page 1of 4

Wednesday, August 22, 12

INFRASTRUCTURE

Big Data
Wednesday, August 22, 12

Big Data

2.5B - content items shared 2.7B - Likes 300M - photos uploaded 100+PB - disk space in a single HDFS cluster 105TB - data scanned via Hive (30min) 70,000 - queries executed 500+TB - new data ingested

Wednesday, August 22, 12

Life of data at Facebook


Data Tools
Workow Hive MapReduce Real-time HDFS Import Copier/Loader
Realtime Analytics (PUMA) Scribe/ScribeH www.facebook.com UDB

Wednesday, August 22, 12

You might also like