Validator Types
A validator is a check that calculates a metric on your data over a window and compares it against an expected threshold to detect data quality issues. A validator type defines the specific check a validator performs, and Validio organizes validator types by data quality dimension: timeliness, completeness, uniqueness, validity, accuracy, and consistency.

Validator type options when creating a new validator on a source
This guide lists the different validator types and supported metrics for each validator.
Validator Metrics and Windows: Validators calculate metrics over a window. For example, a validator calculates the mean value over a daily window, and then validates if these daily mean values follow an expected seasonal pattern. See About Validators and Configuring Validators.
Timeliness
Verify that data is up-to-date and that events arrive within the expected timeframe.
| Validator Type | Description | Metric Options |
|---|---|---|
| Freshness | Ensure data is timely by checking if its timestamp is within the expected range. | Freshness |
| Freshness (metadata) | Ensure data is timely by checking if the table was last updated within the expected time range. This check is based on the warehouse metadata. | Freshness |
| Relative Time | Compare the time difference between two data subsets. | Minimum difference, Maximum difference, Mean difference |
Completeness
Ensure datasets contain the expected data by checking for null values, empty strings, missing records, and overall data volume.
| Validator Type | Description | Metric Options |
|---|---|---|
| Empty Strings | Check a specific field for empty string values to maintain validity. | Count, Percentage |
| Null Values | Check for null values to ensure data completeness and reliability. | Count, Percentage |
| Row Count | Verify the number of rows in a table meets expected thresholds. | Count |
| Row Count (metadata) | Ensure the row count in the table is within the expected range. This check is based on the warehouse metadata. | Count |
| Volume | Check data volume metrics like count, percentage, duplicates, or distinct values. | Count, Percentage, Duplicate count, Duplicate percentage, Unique count, Unique percentage |
Metadata Validators for Data Warehouse Sources Metadata validators, Freshness (metadata) and Row Count (metadata), are only available for BigQuery and Snowflake sources.
Uniqueness
Maintain data quality by checking for duplicate or distinct values in specific fields.
| Validator Type | Description | Metric Options |
|---|---|---|
| Distinct Values | Ensure a column contains only distinct values or matches expected uniqueness. | Unique Count, Unique Percentage |
| Duplicate Values | Identify duplicate entries in a column to maintain data integrity. | Duplicate Count, Duplicate Percentage |
Validity
Verify that data conforms to expected formats, allowed values, and custom business rules.
| Validator Type | Description | Metric Options |
|---|---|---|
| Custom SQL | Use SQL queries for tailored validation. Write your own query or describe what you want and let AI generate it. | Custom |
| Enum Values | Ensure a field matches a predefined set of allowed values. | Count, Percentage |
| Record Match Count | Count the records matching a specific filter condition. | Count |
Accuracy
Evaluate numeric and categorical data to verify it matches expected statistical patterns and distributions.
| Validator Type | Description | Metric Options |
|---|---|---|
| Categorical Distribution | Compare two datasets to check if a categorical field's values match expected proportions. | Categories added, Categories removed, Categories changed, Relative entropy |
| Numeric Distribution | Compare two datasets to check if a numeric field’s values match the expected distribution. | Relative entropy, Mean ratio, Maximum ratio, Minimum ratio, Standard deviation ratio |
| Numeric Statistics | Check a numeric field against statistical metrics. | Mean, Maximum, Minimum, Standard deviation, Sum |
Consistency
Ensure data remains aligned across validators, sources, or datasets by comparing metrics and record-level consistency.
| Validator Type | Description | Metric Options |
|---|---|---|
| Reconciliation | Compares aggregated metrics between two validators. | Absolute diff, Diff, Ratio, Sum |
| Referential Integrity | Validate record-level consistency between two datasets within a data warehouse. | N/A |
| Relative Volume | Compare the volume between two data subsets. | Count ratio, Percentage ratio |
Reference Source Validation
Validators are either single source or reference. Single source validators calculate metrics based on one dataset, while reference source validators calculate metrics based on fields from two different datasets (sources). You can only configure a reference source for specific validator types that support comparative analysis (such as Categorical Distribution, Numeric Distribution, and Relative Volume).
Reference source validators only calculate metrics if there is data in the target dataset. For example, a Categories removed validator where the reference dataset has 4 categories and the target dataset has 3 categories, yields a result of 1. If the target dataset has 0 categories, the validator does not return any result, because the target dataset has no data to calculate metrics on.
For more information and configuration examples, see Reference Source Validation.
Updated 14 days ago