Parquet
Validation of Parquet data files
check_file
check_file(file: ParquetFile, spec: Filetype) -> dict[str, list[str]]
Compare FILE to the expectations described by SPEC. Return a semi-structured description of the differences. An empty dict means no significant differences were found.
check_meta
check_meta(file: ParquetFile, spec: DataObject) -> dict[str, list[str]]
Compare the metadata of FILE to the expectations described by SPEC. Return a semi-structured description of the differences. An empty dict means no significant differences were found.
check_schema
check_schema(file: ParquetFile, spec: DataObject) -> dict[str, list[str]]
Compare the schema of FILE to the schema described by SPEC. Return a semi-structured description of the differences. An empty dict means no significant differences were found.
convert_meta
convert_meta(raw_val: str, spec: ObjectMetadata) -> str | bool | float | int
Convert a Parquet metadata value (which is always a string) to a Python
object of an appropriate type as specified by spec.