Image for Big data for chimps

Big data for chimps (First edition.)

See all formats and editions

Finding patterns in massive event streams can be difficult, but learning how to find them doesnt have to be.

This unique hands-on guide shows you how to solve this and many other problems in large-scale data processing with simple, fun, and elegant tools that leverage Apache Hadoop.

Youll gain a practical, actionable view of big data by working with real data and real problems.Perfect for beginners, this books approach will also appeal to experienced practitioners who want to brush up on their skills.

Part I explains how Hadoop and MapReduce work, while Part II covers many analytic patterns you can use to process any data.

As you work through several exercises, youll also learn how to use Apache Pig to process data.Learn the necessary mechanics of working with Hadoop, including how data and computation move around the clusterDive into map/reduce mechanics and build your first map/reduce job in PythonUnderstand how to run chains of map/reduce jobs in the form of Pig scriptsUse a real-world datasetbaseball performance statisticsthroughout the bookWork with examples of several analytic patterns, and learn when and where you might use them

Read More
Special order line: only available to educational & business accounts. Sign In
£24.50
Product Details
O'Reilly
1491923903 / 9781491923900
eBook (EPUB)
005.7
28/09/2015
China, People's Rep
English
220 pages
Copy: 100%; print: 100%
Description based on CIP data; resource not viewed.