Skip to content

Parquet

Validation of Parquet data files

check_file

check_file(file: ParquetFile, spec: Filetype) -> dict[str, list[str]]

Compare FILE to the expectations described by SPEC. Return a semi-structured description of the differences. An empty dict means no significant differences were found.

check_meta

check_meta(file: ParquetFile, spec: DataObject) -> dict[str, list[str]]

Compare the metadata of FILE to the expectations described by SPEC. Return a semi-structured description of the differences. An empty dict means no significant differences were found.

check_schema

check_schema(file: ParquetFile, spec: DataObject) -> dict[str, list[str]]

Compare the schema of FILE to the schema described by SPEC. Return a semi-structured description of the differences. An empty dict means no significant differences were found.

convert_meta

convert_meta(raw_val: str, spec: ObjectMetadata) -> str | bool | float | int

Convert a Parquet metadata value (which is always a string) to a Python object of an appropriate type as specified by spec.