Page cover

CSV & Excel Ingestion for Data Engineers

Stop writing custom import and validation scripts. CsvPath Framework automates CSV and Excel ingestion and data quality checks. Open source. Python.

Logo for the CsvPath Framework

Automated Data Preboarding

End Manual Validation • Automate File Feed Ingestion

CsvPath Framework registers, validates, upgrades, and stages CSV and Excel files from data partners before they break your pipelines.

CsvPath Framework is an open source data quality shift-left. It enables you to control data entering the enterprise with less manual effort, fewer ingestion failures, and more agile development using a data preboarding pattern you can try in minutes.

Your data lake deserves a data publisher it can trust!

Introducing FlightPath Data, the frontend to CsvPath Framework

FlightPath Data is a powerful new frontend to CsvPath Framework. Go beyond CsvPath Framework's built-in CLI. Get up and running faster with a purpose-built preboarding development and operations console. FlightPath Data gives you all the help and examples you need move quickly.

FlightPath Data is bundled with FlightPath Server, the automation REST API connecting your existing infrastructure to data preboarding.

Available as a free download from the Microsoft Store and the Apple MacOS Store.

The Architecture For Efficient Data File Feed Ingestion

CsvPath Framework implements the Collect, Store, Validate Publish architectural pattern. Ingestion goes faster, is more cost-efficient, and more effective with a preboarding stage.

CsvPath Framework was built to fill the blindspot between MFT (managed file transfer) and the data lake with a simple path to provably correct data.

This data preboarding blindspot is a big deal. Think about it. If even 1 in 30 companies depends heavily on CSV or Excel data, the lack of delimited file preboarding is a trillion-dollar problem.

A data flow diagram showing how CSV, Excel and other tabular data come into the organization through a preboarding process that acts as a Trusted Publisher to the data lake and applications.

Why roll your own preboarding? CsvPath Framework is a purpose-built solution you can rollout now.

Powerful CSV and Excel Validation

CSV and Excel validation is core to the Framework. CsvPath Validation Language is simple, easy to integrate, and flexible enough to handle the unexpected. Inspired by Schematron, XPath, and SQL, CsvPath Validation Language brings powerful data validation to less structured data. Start here.

Together CsvPath Framework and FlightPath Data can help you build leadership's confidence that your data governance doesn't turn a blind eye to your most unruly data.

Integrated With Your Existing Tools

Logos of the many popular DataOps tools that are integrated with CsvPath Framework: aws s3, azure, slack, Excel, opentelemetry, sftp, ckan, pandas, openlineage, and more
CsvPath has a bunch of built-in integrations. Suggest more!

Give CsvPath Framework a Try