Because of the nature of its business, Google has long been a pioneer in embracing both the challenges and opportunities of big data. Google has had to solve the same challenges that many companies face—the difference is the sheer scale of the problem. They’ve often had to invent entirely new approaches to meet the need of their businesses. Over the past decade, Google has developed many custom solutions to support their own products and services. They’ve documented many of these internal solutions in white papers and many have evolved into open source projects that now are the foundation of the Hadoop ecosystem.
We will go through three Google white papers
“The Google File System”
“Bigtable: A Distributed Storage System for Structured Data”
“MapReduce: Simpli_ed Data Processing on Large Clusters”