Skip to content

Profiler

Column-level profiling that computes summary statistics for data sources.

Data profiler: generates statistical profiles and suggests checks.

ColumnProfile(name, dtype, total_count=0, null_count=0, null_pct=0.0, distinct_count=0, distinct_pct=0.0, min_value=None, max_value=None, mean_value=None, median_value=None, stddev_value=None, min_length=None, max_length=None, avg_length=None, top_values=list()) dataclass

Statistical profile of a single column.

TableProfile(table, row_count, column_count, columns=list()) dataclass

Statistical profile of a table.

profile_table(connection, table, sample_size=None)

Generate a statistical profile of a table.

suggest_checks(profile)

Suggest quality checks based on a table profile.

checks_to_yaml(checks, source_type, table)

Convert suggested checks to provero.yaml format.