Skip to content

Conversation

@alamb
Copy link
Owner

@alamb alamb commented Sep 30, 2025

Rationale

This PR is part of testing how much faster Parquet thrift decoding would be if we extended the APIs to skip parsing the portions that are not needed for query, specifically various statistics from the parquet footer metadata (aka column stats and encoding stats)

I do not intend to propose merging this (ever)

Contents:

It is a PR to the most recent version of the thrift-remodel branch from @etseidl here

With the code for deserializing statistics disabled.

@alamb alamb changed the title Demo: Hack off parsing statistics Demo: Skip parsing all statistics Sep 30, 2025
@alamb alamb closed this Oct 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant