# CSV, Excel and JSONL Ingestion

&#x20;    &#x20;

<figure><img src="https://2402701329-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F6wzqgDHj9mZbFxabiEbc%2Fuploads%2FRIv0YEi1VT4LHsrCAtRr%2Flogo-wordmark-200dpi-428x105.png?alt=media&#x26;token=628c0b3c-32f0-460c-a304-508d3864b2b4" alt="Logo for the CsvPath Framework" width="321"><figcaption></figcaption></figure>

<h2 align="center">Automate CSV, Excel, and JSONL Data Preboarding </h2>

<h3 align="center">End Manual File-by-file Validation!</h3>

#### CsvPath Framework registers, versions, validates, upgrades, and stages CSV, Excel, and JSONL files from data partners before they break your pipelines.

[CsvPath Framework](https://github.com/csvpath/csvpath) is open source data quality. Shift-left. Control data entering the enterprise with **less manual effort, fewer ingestion failures, and more agile development** using consistent data preboarding [you can try in minutes](https://www.csvpath.org/getting-started).  &#x20;

<figure><img src="https://2402701329-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F6wzqgDHj9mZbFxabiEbc%2Fuploads%2FenNshk0Hh2pjTL2xtAej%2FScreenshot%202025-03-09%20at%206.45.40%E2%80%AFPM.png?alt=media&#x26;token=1169d82b-ce59-4f6a-9d38-d91b0bee2235" alt="" width="375"><figcaption><p>Your data lake deserves a data publisher it can trust!</p></figcaption></figure>

### Introducing FlightPath Data, the frontend to CsvPath Framework

<figure><img src="https://2402701329-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F6wzqgDHj9mZbFxabiEbc%2Fuploads%2F4EaKgKH0UKdpVY0XEEGI%2Fflightpath-logo-1-sm.png?alt=media&#x26;token=9786cf7f-b3d4-466c-b708-a4d3948bace9" alt=""><figcaption></figcaption></figure>

[**FlightPath Data** is a powerful new frontend to CsvPath Framework](https://www.flightpathdata.com/). Go beyond CsvPath Framework's built-in CLI. Get up and running faster with a purpose-built preboarding development and operations console. FlightPath Data gives you all the help and examples you need move quickly.&#x20;

FlightPath Data is bundled with **FlightPath Server**, the automation REST API connecting your existing infrastructure to data preboarding.&#x20;

Available as a *free* download from the [Microsoft Store](https://apps.microsoft.com/detail/9p9pbpkz4jdf?hl=en-US\&gl=US) and the [Apple MacOS Store](https://apps.apple.com/us/app/flightpath-data/id6745823097).

### The Architecture For Efficient Data File Feed Ingestion

**CsvPath Framework** implements the [**Collect, Store, Validate Publish architectural pattern.** ](https://static1.squarespace.com/static/66df9d47982d0d40e1574327/t/6771fe6f63bbf5361725ad05/1735523953587/The+Collect+Store+Validate+Pattern+-+Atesta+Analytics.pdf) Ingestion goes faster, is more cost-efficient, and more effective with a preboarding stage.&#x20;

CsvPath Framework was built to fill the blindspot between MFT (managed file transfer) and the data lake with a simple path to provably correct data.

This data preboarding blindspot is a big deal. Think about it. If even 1 in 30 companies depends heavily on CSV or Excel data, the *lack of **delimited file preboarding is a trillion-dollar problem***.&#x20;

<figure><img src="https://2402701329-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F6wzqgDHj9mZbFxabiEbc%2Fuploads%2F4ZPwvK3V1mvCptJLcrnD%2Fdata-flow.png?alt=media&#x26;token=e9d67078-0cb8-43d2-a5f9-b5cd0ad1b72a" alt="A data flow diagram showing how CSV, Excel and other tabular data come into the organization through a preboarding process that acts as a Trusted Publisher to the data lake and applications."><figcaption></figcaption></figure>

Why roll your own preboarding? CsvPath Framework is a purpose-built solution you can rollout now.

### Powerful CSV and Excel Validation

CSV and Excel validation is core to the Framework. **CsvPath Validation Language** is simple, easy to integrate, and flexible enough to handle the unexpected. Inspired by Schematron, XPath, and SQL, CsvPath Validation Language brings powerful data validation to less structured data. [Start here](https://www.csvpath.org/topics/higher-level-topics/validation/schemas-or-rules).

Together CsvPath Framework and FlightPath Data can help you build leadership's confidence that your data governance doesn't turn a blind eye to your most unruly data.&#x20;

### Integrated With Your Existing Tools

<figure><img src="https://2402701329-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F6wzqgDHj9mZbFxabiEbc%2Fuploads%2FzlYFVV4xbiaKZb6uc0TR%2Fintegration_logos.png?alt=media&#x26;token=86d6e931-f247-412d-b870-2192f6c4e22b" alt="Logos of the many popular DataOps tools that are integrated with CsvPath Framework: aws s3, azure, slack, Excel, opentelemetry, sftp, ckan, pandas, openlineage, and more" width="563"><figcaption><p>CsvPath has a bunch of built-in integrations. Suggest more!</p></figcaption></figure>

[Parquet](https://www.csvpath.org/topics/how-tos/parquet) / [Sqlite](https://www.csvpath.org/topics/how-tos/sqlite) / [S3](https://www.csvpath.org/topics/how-tos/storage-backends/aws-s3) / [OpenLineage](https://www.csvpath.org/getting-started/dataops-integrations/openlineage) / [OpenTelemetry](https://www.csvpath.org/getting-started/dataops-integrations/opentelemetry) / [Slack](https://www.csvpath.org/topics/how-tos/slack) / [Postgres](https://www.csvpath.org/topics/how-tos/postgres-and-mysql) / [MySQL](https://www.csvpath.org/topics/how-tos/postgres-and-mysql) / [Azure](https://www.csvpath.org/topics/how-tos/storage-backends/azure) / [Google Cloud Storage](https://www.csvpath.org/topics/how-tos/storage-backends/google-cloud-storage) / [Webhooks](https://www.csvpath.org/topics/how-tos/webhooks) / [JSONPath](https://github.com/csvpath/csvpath/blob/main/docs/func_gen/jsonpath.md) / [XPath](https://github.com/csvpath/csvpath/blob/main/docs/func_gen/xpath.md) / [Airflow](https://www.csvpath.org/topics/how-tos/airflow) and more.

### Give CsvPath Framework a Try

* [Getting Started](https://www.csvpath.org/getting-started)
* [How-tos](https://www.csvpath.org/topics/how-tos)
* [DataOps Integrations](https://www.csvpath.org/getting-started/dataops-integrations)
* [The FlightPath Examples](https://www.csvpath.org/getting-started/the-flightpath-data-examples)

{% hint style="success" %}
For more background on preboarding and the CsvPath and FlightPath architecture [check out the CsvPath blog](https://blog.csvpath.org/).&#x20;
{% endhint %}

{% file src="<https://2402701329-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F6wzqgDHj9mZbFxabiEbc%2Fuploads%2Fv5A894jerZ8gZeiwtErK%2FCsvPath%20-%20Data%20Onboarding%20Simplified.pdf?alt=media&token=3f272f16-ff77-415c-b1ab-b1be10f8da20>" %}

{% embed url="<https://github.com/csvpath/csvpath>" %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://www.csvpath.org/readme.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
