CsvPath
Last updated
Last updated
CsvPath is a declarative validation language to verify that CSV, Excel, and other delimited files meet expectations.
The CsvPath language, and its Python library, is simple, easy to integrate, and flexible enough to handle the unexpected quirks of CSV and Excel data. Inspired by Schematron, XPath, and the Collect, Store, Validate design pattern, CsvPath brings rules-based data validation to less structured data.
Think about it. If even 1 in 30 companies depends heavily on CSV or Excel data, the lack of good delimited file validation is a trillion-dollar problem. In our experience, 1 in 30 would be a low estimate.
CsvPath isn't the silver bullet to reams of messy delimited data, but it can help build confidence that your data governance doesn't turn a blind eye to your most unruly data.
Take a look through these pages and cruise over to the detailed docs on the CsvPath Github to see if open source CSV and Excel data validation should be part of your DataOps toolkit.