Data Handling and Retention
This article discusses the types of data that Validio can access and process, as well as the retention period for each type of data.
Deployment and Data Management
Validio provides two primary deployment options designed to offer complete flexibility over where your data is stored and processed. Both models allow you to select specific regions (GCP, AWS, or Azure) to ensure compliance with local data residency requirements.
The following table compares the data management based on the deployment options:
| Feature | Validio Managed Solution | Customer Virtual Private Cloud (VPC) |
|---|---|---|
| Hosting | Hosted by Validio in your chosen cloud region. | Installed and managed within your own infrastructure. |
| Data Access | Validio processes metadata and managed retention. | Validio has no access to raw data or logs. |
| Data Retention | Standard retention periods apply to processed data. See Data Retention Policies | Customer-managed; no backup provided by Validio. |
| Control | High ease of use with regional compliance. | Full customer control over entire environment. |
For more information about Validio deployments, see Validio Managed Solution and Validio VPC Deployment. For more information about how Validio protects your data, see Security and Compliance.
Data Retention Policies
The level of data access and the applicable retention policies depend heavily on your setup:
- Managed Solution: Validio manages the retention periods for the data types processed within the platform.
- VPC Deployment: Because Validio does not have access to your raw data or logs in a VPC environment, we do not provide a backup solution for your installation. Data retention in this model is opt-in only and applies exclusively if you choose to send performance metrics and error logs to Validio for troubleshooting and analysis.
The following table lists the types of data Validio processes and their retention period depending on your deployment.
Type of Data | Validio Managed Solution | Validio VPC Deployment | Retention Period (In Validio) |
|---|---|---|---|
Raw Data from streaming sources Validio ingests raw data from streaming sources. | Y | N | 1 hour after the data is processed |
Raw Data from data warehouse sources Validio ingests aggregrate metrics from data warehouse sources, and only ingests raw data when you run the debug query for incidents. The query runs against the warehouse and displays the results, and this data is not retained. | N | N | N/A |
Query Logs | Y | N | 30 days |
Logs, metrics, and traces This data includes performance metrics and application error logs. | Y | Y1 | 60 days |
Daily backups of environments. The backups include configuration and calculated metrics. | Y | N | 90 days |
Anonymized calculated metrics | Y | N | Depends on window and segment configuration. |
1 For Validio VPC deployments, you can opt-out from sending performance metrics and error logs to Validio.
Manually Exporting Data
The Validio API allows complete flexibility in exporting all information in Validio (including configurations, audit logs, data quality history) to other systems (such as catalogs, BI tools, and data warehouses). Validio also provides SDK recipes with pre-written code for exporting metrics or incident groups to CSV. Refer to the SDK Recipes and Validio API Documentation.
Updated 13 days ago