Pydoop is a Python interface to Hadoop that allows you to write MapReduce
applications in pure Python.

Pydoop offers several features not commonly found in other Python
libraries for Hadoop:

-- a rich HDFS API;
-- a MapReduce API that allows to write pure Python record readers / writers,
   partitioners and combiners;
-- transparent Avro (de)serialization;
-- easy installation-free usage;

WWW: http://crs4.github.io/pydoop/
