VPC deployment
Install Validio on customer VPC. The Validio application is distributed as a Helm chart. The Helm chart is installable on managed Kubernetes clusters.
Validio is available for the following cloud providers:
Prerequisites
Installing Validio on Customer VPC requires knowledge of:
- Kubernetes: how to install, configure, and monitor resources on AWS, GCP or Azure.
- Helm: how to install and configure helm charts on Kubernetes.
1. Access to docker images using a JSON file provided by Validio. For more information, contact us.
2. Configure a Kubernetes cluster on GCP, AWS or Azure. Considering the following:
- All nodes must be in the same AWS availability zone, GCP zone or Azure zone.
- Persistent volume provisioning enabled, to create Kubernetes PVC resources.
- We also recommended a configured load balance or ingress controller, such as:
3. Install the following:
- kubectl CLI tool and set the
kubectl
context to the destination cluster. - Helm version 3.8.0 or later with Helm OCI support.
Installation
1. Create namespace
Create the namespace you want to use for your configuration. In our examples, we use validio
.
kubectl create namespace validio
2. Add the docker-registry secret
Create a Kubernetes Secret
to pull the docker images required by the helm chart. This requires the JSON file provided by Validio.
First, set the PULL_CONTAINERS_KEY
environment variable to point to your JSON file:
export PULL_CONTAINERS_KEY=/home/user/Downloads/validio-docker-registry.json
Then, add the Kubernetes Secret
:
kubectl -n validio create secret docker-registry artifact-registry --docker-server=https://europe-docker.pkg.dev --docker-username=_json_key --docker-password="$(cat ${PULL_CONTAINERS_KEY})"
3. Install the helm chart
The helm chart installation requires the
validio-values.yaml
file. For details, refer to configuration and examples.
helm install validio oci://europe-docker.pkg.dev/validio-platform-prod/charts/validio --version [your-validio-version] --namespace validio --values validio-values.yaml
4. Upgrade Validio version
Make sure you use the correct
validio-values.yaml
file when upgrading Validio. Otherwise, you risk changing your configuration.
helm upgrade validio oci://europe-docker.pkg.dev/validio-platform-prod/charts/validio --version [your-validio-version] --namespace validio --values validio-values.yaml
5. Configure the validio-values.yaml
file
validio-values.yaml
fileThe Helm chart supports the following keys in the validio-values.yaml
file:
Key | Default value | Description |
---|---|---|
pigeon.env.http_proxy | nil | Set proxy for HTTP requests, will be used by Sentry |
pigeon.env.https_proxy | nil | Set proxy for HTTPS requests, will be used by Sentry |
pigeon.resources.* | nil | Resources for Pigeon. Suggested values: memory 512Mi , cpu 200m . |
postgres.resources.* | nil | Resources for Postgres. Suggested values: memory 4Gi and cpu 1000m . |
postgres.storage.capacity | nil | Storage capacity for Postgres. Suggested value: 50Gi . |
postgres.storage.class | nil | Storage class for Postgres. For example, premium-rwo for GCP and gp2 for AWS. Leave empty to use cloud provider default. |
redis.resources.* | nil | Resources for Postgres. Suggested values: memory 1Gi and cpu 500m . |
redis.storage.capacity | nil | Storage capacity for Redis. Suggested values: 20Gi . |
redis.storage.class | nil | Storage class for Redis. For example, premium-rwo for GCP and gp2 for AWS. Leave empty to use cloud provider default. |
surface.env.http_proxy | nil | Set proxy for HTTP requests, will be used by Slack and Sentry |
surface.env.https_proxy | nil | Set proxy for HTTPS requests, will be used by Slack and Sentry |
surface.ingress.enabled | false | Set to true to enable Kubernetes ingress. Note: This requires an ingress controller and that all ingress parameters are set accordingly. |
surface.ingress.host | nil | The ingress hostname. Note: You must also set a DNS name. |
surface.ingress.path | /* | Set the ingress path. For Nginx and Azure, / is required. |
surface.ingress.annotations | nil | Set the ingress annotations. |
surface.ingress.managedCert | false | Create a ManagedCertificate resource for the ingress. Note: For GCP only. |
surface.ingress.tls.enable | false | Enable Ingress TLS, required for use in Azure with cert-manager |
surface.resources.* | nil | Resources for Surface. Suggested values: memory 512Mi , cpu 200m . |
ve.diver.resources.* | nil | Resources for Diver. Suggested values: memory 512Mi , cpu 200m . |
ve.ingress.resources.* | nil | Resources for Ingress. Suggested vlaues: memory 1Gi , cpu 500m . Note: This is not a Kubernetes ingress parameter. |
ve.ingress.cache.size | 1Gi | Local disk cache for Ingress. Will be allocated on the host where the pod runs. |
ve.pipelines.resources.* | nil | Resources for Pipelines. Suggested values: memory 2Gi , cpu 1000m . |
ve.pipelines.cache.size | 1Gi | Local disk cache for Pipelines. Will be allocated on the host where the pod runs. |
If the default value is
false
ortrue
, the value type isbool
. For all others, the value type isString
.
All <service>.resources.*
keys supports both CPU and Memory resources, specified only as <service>.resource.*
.
Available keys for each service:
<service>:
resources:
limits:
cpu: <value>
memory: <value>
requests:
cpu: <value>
memory: <value>
Memory and CPU recommendations
- Set a limit for memory use with
<service>.resources.limits.memory
.- Only set requests for CPU resources with
<service>.resources.requests.cpu
.For more information, refer to Kubernetes resources documentation.
HTTP Proxy
Validio supports using HTTP proxy for outbound communication for Slack and Sentry. The URL:s for both http_proxy
and https_proxy
can be the same, it's internally in the application that they are used for http
respectively https
URL:s.
If you're using a proxy that needs whitelisting of destinations, here is a list of hosts that Validio will use:
hooks.slack.com
sentry.io
o4506020911316992.ingest.sentry.io
Here is an example configuration:
surface:
env:
http_proxy: http://proxy.example.com:8080
https_proxy: http://proxy.example.com:8080
pigeon:
env:
http_proxy: http://proxy.example.com:8080
https_proxy: http://proxy.example.com:8080
Sign in to the Validio platform
The first time you sign in to the Validio platform, you must use the admin credentials created in the Kubernetes Secret.
Do not change or update anything in the Kubernetes Secret. You can change the password in the Validio platform.
kubectl -n validio get secrets validio-postgres -o=jsonpath='{.data.validio_app_auth_password}' | base64 -d ; echo
Access your Validio platform:
In your browser, navigate to the hostname specified in the surface.ingress.host
key. For example, https://validio.example.com/
If you did not configure an ingress, use port-forward to forward the Validio platform to your localhost:
kubectl -n validio port-forward svc/validio-surface 8889
Then, you can find it on http://localhost:8889/
Services
The Validio services are listed as follows. We recommend that you use this list as a reference when you allocate resources to services running in your system.
Validation Engine
The Validation Engine
is the backend application in Validio. The Validation Engine
consists of the following services:
Diver
- Orchestrator and running background tasks for the Validio application.Ingress
- connects to configured sources to read statistics, data, and source metadata.Pipelines
- handles processing of any read data and performs calculations and anomaly detection on data statistics.
Surface
Surface
is the "backend for the frontend". This service is responsible for communication between the frontend application in your browser
and the Validation engine
.
Pigeon
Pigeon
is responsible for sending notifications to different channels, such as Slack and webhooks.
Postgres
Postgres
governs all artifacts and configuration settings.
Redis
Redis
is used as a transport layer between different services.
Flux GitOps tools
Optionally, you can use Flux to control your GitOps flow. For more information, refer to Flux Helm OCI repository.
Examples
Example of GCP ingress
surface:
ingress:
enabled: true
host: validio.example.com
managedCert: true
annotations:
kubernetes.io/ingress.allow-http: "false"
kubernetes.io/ingress.class: gce
This example creates a Kubernetes Ingress
resource and a ManagedCertificate
resource, with the specified hostname. Make sure that the created DNS A record
points to the IP address of the load balancer. Without a DNS A record
, the certificate is not created and the Kubernetes Ingress
won’t work.
Use the following command to verify the IP address of the load balance:
kubectl -n validio get ingress validio-surface -o
jsonpath='{.status.loadBalancer.ingress[0].ip}'
Example of AWS ingress
surface:
ingress:
enabled: true
host: validio.example.com
annotations:
alb.ingress.kubernetes.io/certificate-arn: <ARN of certificate>
alb.ingress.kubernetes.io/listen-ports: '[{"HTTPS":443}]'
alb.ingress.kubernetes.io/scheme: internet-facing
alb.ingress.kubernetes.io/success-codes: 200,302
alb.ingress.kubernetes.io/target-type: ip
kubernetes.io/ingress.class: alb
This requires the AWS Load Balancer Controller Add-On
. This example creates a Kubernetes Ingress
resource with the specified hostname. You must first create a certificate with AWS Certificate Manager
as specified in the annotations part. Then, use the Alias option
to create a DNS A record
that points to the endpoint of the load balancer.
Use the following command to verify the endpoint of the load balancer:
kubectl -n validio get ingress validio-surface -o
jsonpath='{.status.loadBalancer.ingress[0].hostname}'
Example of Azure ingress
surface:
ingress:
enabled: true
host: validio.example.com
annotations:
cert-manager.io/cluster-issuer: letsencrypt
tls: true
path: /
This requires the AKS Application routing add-on
with a workin external-dns
setup and cert-manager
for TLS . This example creates a Kubernetes Ingress
resource with the specified hostname using TLS.
Example of default memory and cpu resources set
Below is an example configuration using the above suggested resource configuration and some other sensible values. The YAML key
surface.ingress
below are GCP, AWS or Azure specific, please refer to examples above. Validio will provide customer specific configuration when applicable
pigeon:
resources:
limits:
memory: 512Mi
requests:
cpu: 200m
postgres:
resources:
limits:
memory: 4Gi
requests:
cpu: 1000m
storage:
capacity: 50Gi
redis:
resources:
limits:
memory: 1Gi
requests:
cpu: 500m
storage:
capacity: 20Gi
surface:
ingress:
enabled: true
host: validio.example.com
resources:
limits:
memory: 512Mi
requests:
cpu: 200m
ve:
diver:
resources:
limits:
memory: 512Mi
requests:
cpu: 200m
ingress:
resources:
limits:
memory: 1Gi
requests:
cpu: 200m
cache:
size: 10Gi
pipelines:
resources:
limits:
memory: 2Gi
requests:
cpu: 1000m
cache:
size: 10Gi
Updated about 1 year ago