Utilities

MD5 Checksum Tools

utils.md5sum_check.check_sums(tmp_dir: str, md5sum_file: str, dir_prefix: str = 'lookup_tables_', required_files: List[str] | None = None) List[str]

Validate parquet files in tmp subdirectories against a checksum manifest.

Returns a list of valid parquet file paths when a fully valid directory is found. Returns an empty list when no valid directory exists.

Output Mapping