VPC deployment

Validio is available for the following cloud providers:

Google Cloud Platform (GCP): GKE.
Amazon Web Services (AWS): EKS.
Microsoft Azure: AKS.

Prerequisites

📘
Installing Validio on Customer VPC requires knowledge of:

Kubernetes: how to install, configure, and monitor resources on AWS, GCP or Azure.

Helm: how to install and configure helm charts on Kubernetes.

1. Access to docker images using a JSON file provided by Validio. For more information, contact us.

2. Configure a Kubernetes cluster on GCP, AWS or Azure. Considering the following:

All nodes must be in the same AWS availability zone, GCP zone or Azure zone.
Persistent volume provisioning enabled, to create Kubernetes PVC resources.
We also recommended a configured load balance or ingress controller, such as:

3. Install the following:

kubectl CLI tool and set the kubectl context to the destination cluster.
Helm version 3.8.0 or later with Helm OCI support.

Installation

1. Create namespace

Create the namespace you want to use for your configuration. In our examples, we use validio.

kubectl create namespace validio

2. Add the docker-registry secret

Create a Kubernetes Secret to pull the docker images required by the helm chart. This requires the JSON file provided by Validio.

First, set the PULL_CONTAINERS_KEY environment variable to point to your JSON file:

export PULL_CONTAINERS_KEY=/home/user/Downloads/validio-docker-registry.json

Then, add the Kubernetes Secret:

kubectl -n validio create secret docker-registry artifact-registry --docker-server=https://europe-docker.pkg.dev --docker-username=_json_key --docker-password="$(cat ${PULL_CONTAINERS_KEY})"

3. Install the helm chart

📘
The helm chart installation requires the validio-values.yaml file. For details, refer to configuration and examples.

helm install validio oci://europe-docker.pkg.dev/validio-platform-prod/charts/validio --version [your-validio-version] --namespace validio --values validio-values.yaml

4. Upgrade Validio version

📘
Make sure you use the correct validio-values.yaml file when upgrading Validio. Otherwise, you risk changing your configuration.

helm upgrade validio oci://europe-docker.pkg.dev/validio-platform-prod/charts/validio --version [your-validio-version] --namespace validio --values validio-values.yaml

5. Configure the `validio-values.yaml` file

The Helm chart supports the following keys in the validio-values.yaml file:

Key	Default value	Description
`pigeon.env.http_proxy`	`nil`	Set proxy for `HTTP` requests, will be used by Sentry
`pigeon.env.https_proxy`	`nil`	Set proxy for `HTTPS` requests, will be used by Sentry
`pigeon.resources.*`	`nil`	Resources for Pigeon. Suggested values: memory `512Mi`, cpu `200m`.
`postgres.resources.*`	`nil`	Resources for Postgres. Suggested values: memory `4Gi` and cpu `1000m`.
`postgres.storage.capacity`	`nil`	Storage capacity for Postgres. Suggested value: `50Gi`.
`postgres.storage.class`	`nil`	Storage class for Postgres. For example, `premium-rwo` for GCP and `gp2` for AWS. Leave empty to use cloud provider default.
`redis.resources.*`	`nil`	Resources for Postgres. Suggested values: memory `1Gi` and cpu `500m`.
`redis.storage.capacity`	`nil`	Storage capacity for Redis. Suggested values: `20Gi`.
`redis.storage.class`	`nil`	Storage class for Redis. For example, `premium-rwo` for GCP and `gp2` for AWS. Leave empty to use cloud provider default.
`surface.env.http_proxy`	`nil`	Set proxy for `HTTP` requests, will be used by Slack and Sentry
`surface.env.https_proxy`	`nil`	Set proxy for `HTTPS` requests, will be used by Slack and Sentry
`surface.ingress.enabled`	`false`	Set to `true` to enable Kubernetes ingress. Note: This requires an ingress controller and that all ingress parameters are set accordingly.
`surface.ingress.host`	`nil`	The ingress hostname. Note: You must also set a DNS name.
`surface.ingress.path`	`/*`	Set the ingress path. For Nginx and Azure, `/` is required.
`surface.ingress.annotations`	`nil`	Set the ingress annotations.
`surface.ingress.managedCert`	`false`	Create a `ManagedCertificate` resource for the ingress. Note: For GCP only.
`surface.ingress.tls.enable`	`false`	Enable Ingress TLS, required for use in Azure with cert-manager
`surface.resources.*`	`nil`	Resources for Surface. Suggested values: memory `512Mi`, cpu `200m`.
`ve.diver.resources.*`	`nil`	Resources for Diver. Suggested values: memory `512Mi`, cpu `200m`.
`ve.ingress.resources.*`	`nil`	Resources for Ingress. Suggested vlaues: memory `1Gi`, cpu `500m`. Note: This is not a Kubernetes ingress parameter.
`ve.ingress.cache.size`	`1Gi`	Local disk cache for Ingress. Will be allocated on the host where the pod runs.
`ve.pipelines.resources.*`	`nil`	Resources for Pipelines. Suggested values: memory `2Gi`, cpu `1000m`.
`ve.pipelines.cache.size`	`1Gi`	Local disk cache for Pipelines. Will be allocated on the host where the pod runs.

If the default value is false or true, the value type is bool. For all others, the value type is String.

All <service>.resources.* keys supports both CPU and Memory resources, specified only as <service>.resource.*.

Available keys for each service:

<service>:
  resources:
    limits:
      cpu: <value>
      memory: <value>
    requests:
      cpu: <value>
      memory: <value>

📘
Memory and CPU recommendations

Set a limit for memory use with <service>.resources.limits.memory.

Only set requests for CPU resources with <service>.resources.requests.cpu.

For more information, refer to Kubernetes resources documentation.

HTTP Proxy

Validio supports using HTTP proxy for outbound communication for Slack and Sentry. The URL:s for both http_proxy and https_proxy can be the same, it's internally in the application that they are used for http respectively https URL:s.

If you're using a proxy that needs whitelisting of destinations, here is a list of hosts that Validio will use:

hooks.slack.com
sentry.io
o4506020911316992.ingest.sentry.io

Here is an example configuration:

surface:
  env:
    http_proxy: http://proxy.example.com:8080
    https_proxy: http://proxy.example.com:8080
pigeon:
  env:
    http_proxy: http://proxy.example.com:8080
    https_proxy: http://proxy.example.com:8080

Sign in to the Validio platform

The first time you sign in to the Validio platform, you must use the admin credentials created in the Kubernetes Secret.

🚧
Do not change or update anything in the Kubernetes Secret. You can change the password in the Validio platform.

kubectl -n validio get secrets validio-postgres -o=jsonpath='{.data.validio_app_auth_password}' | base64 -d ; echo

Access your Validio platform:

In your browser, navigate to the hostname specified in the surface.ingress.host key. For example, https://validio.example.com/

If you did not configure an ingress, use port-forward to forward the Validio platform to your localhost:

kubectl -n validio port-forward svc/validio-surface 8889

Then, you can find it on http://localhost:8889/

Services

The Validio services are listed as follows. We recommend that you use this list as a reference when you allocate resources to services running in your system.

Validation Engine

The Validation Engine is the backend application in Validio. The Validation Engine consists of the following services:

Diver - Orchestrator and running background tasks for the Validio application.
Ingress - connects to configured sources to read statistics, data, and source metadata.
Pipelines - handles processing of any read data and performs calculations and anomaly detection on data statistics.

Surface

Surface is the "backend for the frontend". This service is responsible for communication between the frontend application in your browser and the Validation engine.

Pigeon

Pigeon is responsible for sending notifications to different channels, such as Slack and webhooks.

Postgres

Postgres governs all artifacts and configuration settings.

Redis

Redis is used as a transport layer between different services.

Flux GitOps tools

Optionally, you can use Flux to control your GitOps flow. For more information, refer to Flux Helm OCI repository.

Examples

Example of GCP ingress

surface:
  ingress:
    enabled: true
    host: validio.example.com
    managedCert: true
    annotations:
      kubernetes.io/ingress.allow-http: "false"
      kubernetes.io/ingress.class: gce

This example creates a Kubernetes Ingress resource and a ManagedCertificate resource, with the specified hostname. Make sure that the created DNS A record points to the IP address of the load balancer. Without a DNS A record, the certificate is not created and the Kubernetes Ingress won’t work.

Use the following command to verify the IP address of the load balance:

kubectl -n validio get ingress validio-surface -o 
jsonpath='{.status.loadBalancer.ingress[0].ip}'

Example of AWS ingress

surface:
  ingress:
    enabled: true
    host: validio.example.com
    annotations:
      alb.ingress.kubernetes.io/certificate-arn: <ARN of certificate>
      alb.ingress.kubernetes.io/listen-ports: '[{"HTTPS":443}]'
      alb.ingress.kubernetes.io/scheme: internet-facing
      alb.ingress.kubernetes.io/success-codes: 200,302
      alb.ingress.kubernetes.io/target-type: ip
      kubernetes.io/ingress.class: alb

This requires the AWS Load Balancer Controller Add-On. This example creates a Kubernetes Ingress resource with the specified hostname. You must first create a certificate with AWS Certificate Manager as specified in the annotations part. Then, use the Alias option to create a DNS A record that points to the endpoint of the load balancer.

Use the following command to verify the endpoint of the load balancer:

kubectl -n validio get ingress validio-surface -o 
jsonpath='{.status.loadBalancer.ingress[0].hostname}'

Example of Azure ingress

surface:
  ingress:
    enabled: true
    host: validio.example.com
    annotations:
      cert-manager.io/cluster-issuer: letsencrypt
    tls: true
    path: /

This requires the AKS Application routing add-on with a workin external-dns setup and cert-manager for TLS . This example creates a Kubernetes Ingress resource with the specified hostname using TLS.

Example of default memory and cpu resources set

🚧
Below is an example configuration using the above suggested resource configuration and some other sensible values. The YAML key surface.ingressbelow are GCP, AWS or Azure specific, please refer to examples above. Validio will provide customer specific configuration when applicable

pigeon:
  resources:
    limits:
      memory: 512Mi
    requests:
      cpu: 200m
postgres:
  resources:
    limits:
      memory: 4Gi
    requests:
      cpu: 1000m
  storage:
    capacity: 50Gi
redis:
  resources:
    limits:
      memory: 1Gi
    requests:
      cpu: 500m
  storage:
    capacity: 20Gi
surface:
  ingress:
    enabled: true
    host: validio.example.com
  resources:
    limits:
      memory: 512Mi
    requests:
      cpu: 200m
ve:
  diver:
    resources:
      limits:
        memory: 512Mi
      requests:
        cpu: 200m
  ingress:
    resources:
      limits:
        memory: 1Gi
      requests:
        cpu: 200m
    cache:
      size: 10Gi
  pipelines:
    resources:
      limits:
        memory: 2Gi
      requests:
        cpu: 1000m
    cache:
      size: 10Gi

VPC deployment

Prerequisites

📘
Installing Validio on Customer VPC requires knowledge of:

Installation

1. Create namespace

2. Add the docker-registry secret

3. Install the helm chart

📘

4. Upgrade Validio version

📘

5. Configure the `validio-values.yaml` file

📘
Memory and CPU recommendations

HTTP Proxy

Sign in to the Validio platform

🚧

Services

Validation Engine

Surface

Pigeon

Postgres

Redis

Flux GitOps tools

Examples

Example of GCP ingress

Example of AWS ingress

Example of Azure ingress

Example of default memory and cpu resources set

🚧

Prerequisites

📘Installing Validio on Customer VPC requires knowledge of:

Installation

1. Create namespace

2. Add the docker-registry secret

3. Install the helm chart

📘

4. Upgrade Validio version

📘

5. Configure the validio-values.yaml file

📘Memory and CPU recommendations

HTTP Proxy

Sign in to the Validio platform

🚧

Services

Validation Engine

Surface

Pigeon

Postgres

Redis

Flux GitOps tools

Examples

Example of GCP ingress

Example of AWS ingress

Example of Azure ingress

Example of default memory and cpu resources set

🚧

📘
Installing Validio on Customer VPC requires knowledge of:

5. Configure the `validio-values.yaml` file

📘
Memory and CPU recommendations