Project Awesome project awesome

Kreuzberg

Polyglot document intelligence library with a Rust core and bindings for Python, TypeScript, Go, and more. Extracts text, tables, and metadata from 62+ document formats for data pipeline ingestion.

Package 7.1k stars GitHub
Back to Data Engineering