Project Awesome project awesome

Data Frames > pysparkling

A pure Python implementation of Apache Spark's RDD and DStream interfaces. <img height="20" src="img/spark_big.png" alt="Apache Spark based">

Package 271 stars GitHub
Back to Data Science