Project Awesome project awesome

Data quality > deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Package 3.6k stars GitHub
Back to Apache Spark