Frequently Asked Questions
I have set up a Source and Validator, but I do not see any data?
Make sure you start the Source. You can start the Source on the Sources page or Source details page.
What data sources do you support?
We support the major data warehouses, data streams, query engines, and more. We are constantly adding support for new sources. For the current list of supported Sources, refer to Sources.
What can be monitored with Validio?
Validio monitors actual data across data warehouses and streams on:
- A dataset level: Validating aggregate metrics like mean or relative entropy.
- A datapoint level: Validating individual datapoints for outliers and anomalies.
- Metadata: Validating ancillary metrics like data volume and freshness.
What if the type of Validator I need is not supported?
We offer an extensive library of out-of-the-box validators for different monitoring use cases. You can also create a validator using SQL queries to monitor custom metrics. Refer to Validator Types
Of course, we are always looking to extend our validator capabilities. If your use case requires a metric or a Validator that is currently not in the platform, reach out to us.
What does real-time data quality monitoring mean?
Validio is built in Rust for real-time processing and validation of individual datapoints in sub-second speed, this allows you to catch data failures as they arise even on data streams.
What is a metric?
A metric refers to the quantity or statistics a Validator produces and ultimately what is being monitored. A Threshold can be applied to the metric to determine when the metric should be considered an anomaly.
Monitor metric examples include: mean, standard deviation, mode, relative time between two timestamps.
For more information, refer to Validators.
Should I deploy Validio in my own environment or choose the managed solution?
Validio provides two primary deployment options designed to offer complete flexibility over where your data is stored and processed: Validio Managed Solution and Customer Virtual Private Cloud (VPC).
- Managed Solution: We recommend the managed solution for most of our customers since it releases valuable engineering time of managing the Validio platform. Validio hosts your deployment in your chosen cloud region, processes metadata and manages retention, while ensuring regional compliance.
- Customer VPC: In cases where data must not leave the customer's environment due to regulations or security issues, the VPC deployment gives you full control. Validio has no access to your data, and you are in control of data processing, retention, and compliance.
How does Validio manage backups and recovery?
For Validio Managed Solution customers, Validio performs daily backups of environments (including configuration and calculated metrics) to easily restore the environment. Backups are retained for 90 days in Validio. This allows for easy restoration and recovery times of less than 24 hours.
For more information, see Data Handling and Retention.
What data or information can I export from Validio?
The Validio API allows complete flexibility in exporting all information in Validio (including configurations, audit logs, data quality history) to other systems (such as catalogs, BI tools, and data warehouses). Validio also provides SDK recipes with pre-written code for exporting metrics or incident groups to CSV.
Refer to the SDK Recipes and Validio API Documentation.
Updated 9 days ago