Data Handling and Retention

This article discusses the types of data that Validio can access and process, as well as the retention period for each type of data.

Deployment and Data Management

Validio provides two primary deployment options designed to offer complete flexibility over where your data is stored and processed. Both models allow you to select specific regions (GCP, AWS, or Azure) to ensure compliance with local data residency requirements.

The following table compares the data management based on the deployment options:

FeatureValidio Managed SolutionCustomer Virtual Private Cloud (VPC)
HostingHosted by Validio in your chosen cloud region.Installed and managed within your own infrastructure.
Data AccessValidio processes metadata and managed retention.Validio has no access to raw data or logs.
Data RetentionStandard retention periods apply to processed data. See Data Retention PoliciesCustomer-managed; no backup provided by Validio.
ControlHigh ease of use with regional compliance.Full customer control over entire environment.

For more information about Validio deployments, see Validio Managed Solution and Validio VPC Deployment. For more information about how Validio protects your data, see Security and Compliance.

Data Retention Policies

The level of data access and the applicable retention policies depend heavily on your setup:

  • Managed Solution: Validio manages the retention periods for the data types processed within the platform.
  • VPC Deployment: Because Validio does not have access to your raw data or logs in a VPC environment, we do not provide a backup solution for your installation. Data retention in this model is opt-in only and applies exclusively if you choose to send performance metrics and error logs to Validio for troubleshooting and analysis.

The following table lists the types of data Validio processes and their retention period depending on your deployment.

Type of Data

Validio Managed Solution

Validio VPC Deployment

Retention Period

(In Validio)

Raw Data from streaming sources Validio ingests raw data from streaming sources.

Y

N

1 hour after the data is processed

Raw Data from data warehouse sources Validio ingests aggregrate metrics from data warehouse sources, and only ingests raw data when you run the debug query for incidents. The query runs against the warehouse and displays the results, and this data is not retained.

N

N

N/A

Query Logs

Y

N

30 days

Logs, metrics, and traces This data includes performance metrics and application error logs.

Y

Y1

60 days

Daily backups of environments. The backups include configuration and calculated metrics.

Y

N

90 days

Anonymized calculated metrics

Y

N

Depends on window and segment configuration.

1 For Validio VPC deployments, you can opt-out from sending performance metrics and error logs to Validio.

Manually Exporting Data

The Validio API allows complete flexibility in exporting all information in Validio (including configurations, audit logs, data quality history) to other systems (such as catalogs, BI tools, and data warehouses). Validio also provides SDK recipes with pre-written code for exporting metrics or incident groups to CSV. Refer to the SDK Recipes and Validio API Documentation.