Reference

Parquet

Parquet is an open columnar storage format for large datasets, widely used in data analytics and big-data tools. By storing data column by column with compression, it makes analytical queries fast and files much smaller than CSV.

Files & formatsGeneral

Parquet

Also known as: .parquet file, Apache Parquet, columnar storage

Open columnar storage format for analytics
Compresses well; smaller and faster than CSV
Binary; read with data tools, not a text editor

Why columnar storage matters

A CSV stores data row by row. Parquet stores it column by column, so a query that reads only a few columns skips the rest entirely, and similar values in a column compress extremely well.

That design makes Parquet the default for analytics engines and data lakes. It is a binary format, not human-readable, and is meant to be read by data tools rather than opened in a text editor.

Parquet vs CSV and other formats

Compared with CSV, Parquet files are typically far smaller for the same data and much faster to query, while also preserving column data types. The trade-off is that you need a library or tool to read them.

Within the big-data world, Parquet is column-oriented while Avro is row-oriented; the two are often used together in data pipelines.

Related terms

Keep reading the reference.

Files & formatsCSVCSV (comma-separated values) is a plain-text format that stores tabular data as rows of values separated by commas. It carries no formatting, formulas, or charts, which makes CSV files tiny and universally readable across apps and programming languages.General Files & formatsAvroAvro is an open row-based data serialization format from the Apache ecosystem. It stores records together with their schema, making it compact and self-describing, and is common in data pipelines and streaming such as Kafka.General Web & SEOStructured data (schema markup)Structured data is machine-readable code, usually written in JSON-LD using the schema.org vocabulary, that labels what a page is about — an article, product, recipe, or FAQ. Search engines use it to understand content and to power rich results like star ratings and FAQ dropdowns.General iPhone & iPadDocuments & DataDocuments & Data is the storage an app accumulates on top of its install size — downloads, saved files, login state, and cached media. It often dwarfs the app itself, which is why a small app can occupy several gigabytes.iOSiPadOS

Act on it

Guides and tools for this topic.

CSV to JSON