Large-scale Data Processing with Hadoop and PHP

Large-Scale Data Processing with Hadoop and PHP

The MapReduce framework promises to make computing of large sets of data very easy. The approach offers excellent scalability across many computing nodes, and can easily be integrated with existing systems. This session will give an introduction to the basic techniques and ideas behind MapReduce, followed by hands-on examples using Apache Hadoop, a major implementation of MapReduce, and Hadoop's streaming functionality that allows users to write processing jobs not just in Java, but in any programming language, including PHP.

Speaker: 

David Zülke

David Zülke

David Zülke is the lead developer of the Agavi project, an open source MVC framework for PHP, and managing director at Bitextender GmbH, a Munich, Germany based software company. He has been doing PHP development for more than ten years and regularly speaks at conferences around the world about lovely topics like HTTP, REST, CouchDB, MapReduce and, of course, PHP.

Video: