Page cover
Logo for the CsvPath Framework

Automated Data Preboarding

Stop Manual CSV & Excel Validation - Automate File Feed Ingestion

CsvPath is an open source framework that validates, cleans, and stages CSV/Excel files from data partners before they break your pipelines.

The open source CsvPath Framework is a data quality shift-left that enables you to control data entering the enterprise with less manual effort, fewer ingestion failures, and more agile development using a straightforward preboarding pattern.

Your data lake deserves a data publisher it can trust!

The Architecture For Efficient Data Ingestion

The CsvPath Framework implements the Collect, Store, Validate Publish architectural pattern. This data preboarding style of ingestion goes faster, is more cost-efficient, and is more effective. Why roll your own preboarding solution when there is a purpose-built option?

Out-of-the-box, CsvPath Framework is built to fill the blindspot between MFT (managed file transfer) and the data lake with a simple path to provably correct data.

This data onboarding blindspot is a big deal. Think about it. If even 1 in 30 companies depends heavily on CSV or Excel data, the lack of delimited file pre-boarding is a trillion-dollar problem. In our experience, 1 in 30 would be a low estimate.

A data flow diagram showing how CSV, Excel and other tabular data come into the organization through a preboarding process that acts as a Trusted Publisher to the data lake and applications.

Powerful CSV and Excel Validation

CSV and Excel validation is core to the Framework. CsvPath Validation Language is simple, easy to integrate, and flexible enough to handle the unexpected. Inspired by Schematron, XPath, and SQL, CsvPath Validation Language brings powerful data validation to less structured data. Start here.

Introducing FlightPath, the frontend to CsvPath Framework

FlightPath is a powerful new frontend to CsvPath Framework. Go beyond the Framework's built-in CLI. Get up and running faster with a purpose-built preboarding development and operations console. And FlightPath gives you all the help and examples you need move quickly. Available as a free download from the Microsoft Store and the Apple MacOS Store.

Together CsvPath Framework and FlightPath Data can help you build leadership's confidence that your data governance doesn't turn a blind eye to your most unruly data.

Logos of the many popular DataOps tools that are integrated with CsvPath Framework: aws s3, azure, slack, Excel, opentelemetry, sftp, ckan, pandas, openlineage, and more
CsvPath has a bunch of built-in integrations. Suggest more!

Last updated