Profiler¶
Column-level profiling that computes summary statistics for data sources.
Data profiler: generates statistical profiles and suggests checks.
ColumnProfile(name, dtype, total_count=0, null_count=0, null_pct=0.0, distinct_count=0, distinct_pct=0.0, min_value=None, max_value=None, mean_value=None, median_value=None, stddev_value=None, min_length=None, max_length=None, avg_length=None, top_values=list())
dataclass
¶
Statistical profile of a single column.
TableProfile(table, row_count, column_count, columns=list())
dataclass
¶
Statistical profile of a table.
profile_table(connection, table, sample_size=None)
¶
Generate a statistical profile of a table.
suggest_checks(profile)
¶
Suggest quality checks based on a table profile.
checks_to_yaml(checks, source_type, table)
¶
Convert suggested checks to provero.yaml format.