I have set-up a Source, Window, and Validator but I don't see any data?
Make sure you start the Source. You can start the Source on the Sources page or Source details page.
What data sources do you support?
We support the major data warehouses, object storage, and streaming tools. We’re constantly adding support for new sources. For supported Sources, refer to Sources.
What can be monitored with Validio?
Validio monitors actual data across data warehouses, object storage, and streams on:
- A dataset level: Validating aggregate metrics like mean or relative entropy.
- A datapoint level: Validating individual datapoints for outliers and anomalies.
- Metadata: Validating ancillary metrics like data volume and freshness.
What if the type of Validator I need is not supported?
We are always looking to extend our Validator capabilities.
If your use case requires a metric or a Validator that is currently not in the platform, reach out to us.
What is a Destination?
Destination allow you to write out individual anomalies. Consider 1 billion data points and 0.1% being anomalies (meaning, 1 million anomalies). This can result in extreme alert fatigue.
Via Destinations, you can egress these anomalies to a data system, such as DWH or Object storage, of your choice to perform custom analysis and processing.
What does real-time data quality monitoring mean?
Validio is built in Rust for real-time processing and validation of individual datapoints in sub-second speed, this allows you to catch data failures as they arise even on data streams.
What is a metric?
A metric refers to the quantity or statistics a Validator produces and ultimately what’s being monitored. A Threshold can be applied to the metric to determine when the metric should be considered an anomaly.
Monitor metric examples: mean, standard deviation, mode, relative time between two timestamps.
For more information, refer to Validators.
Should I deploy Validio in my own environment or choose the Managed Solution?
We understand that deployment optionality is important for our customers, that's why we are offering two types of deployment options.
We recommend the managed solution for most of our customers since it releases valuable engineering time of managing the Validio platform.
In cases where data can't leave the customer's environment due to regulations or security issues, we also offer deployment in a customer's environment.
Will you be available on cloud marketplaces?
Yes, we are currently underway on getting listed on both GCP and AWS marketplaces.
If you are interested in early access, reach out to us!
Updated 27 days ago