Hadoop
Framework for distributed storage and processing of very large data sets.
Hadoop
Elasticsearch Hadoop 2.0k
updated 1mo ago
Elasticsearch real-time search and analytics natively integrated with Hadoop. Supports Map/Reduce, Cascading, Apache Hive and Apache Pig.
mrjob 2.6k
updated 3y ago
mrjob is a Python 2.5+ package that helps you write and run Hadoop Streaming jobs.
hdfs-du 228
(archived)
HDFS-DU is an interactive visualization of the Hadoop distributed file system.