Kinesis
Connect Validio to Amazon Kinesis for real-time data quality monitoring on streaming data.
The Amazon Kinesis integration enables real-time data quality monitoring on streaming data in Validio. To integrate with Kinesis, you will need to add a credential and add a source for the data stream.
Prerequisites for Integrating with Kinesis
Prepare credentials and permission in the AWS Management Console for Validio to validate your data:
- An IAM user with access and permissions to the specified Kinesis stream.
- Credentials for the IAM user. For more information, refer to Actions, resources, and condition keys for Amazon Kinesis Data Streams.
IAM user for Source config
We recommend that you create an IAM user where the Validio platform is granted access to the Kinesis stream you want to read data from.
The following permissions must be assigned to your IAM user:
| Actions | Access level | Required resource type |
|---|---|---|
SubscribeToShard | Read | consumer |
ListShards | List | stream |
DescribeStreamConsumer | Read | consumer |
ListStreamConsumers | Read | stream |
RegisterStreamConsumer | Write | stream |
ListStreams | List | |
EnableEnhancedMonitoring | Write | |
DescribeLimits | Read | |
DisableEnhancedMonitoring | Write |
Access key
You must supply an Access key and secret key to authenticate to the Kinesis stream. For more information, refer to Managing access keys for IAM users.
Add a Kinesis Credential
To add a Kinesis credential in Validio,
- Navigate to Credentials and click + New Credential.
- Under Namespace, select a namespace where the resources will be created.
- For Credential Type, select Amazon Kinesis.
- Fill in the Configuration parameter fields. Refer to the Kinesis Credential Parameters section.
- (Optional) Click Test credential to validate that Validio can successfully access the Kinesis stream. If validation fails, check that you provided the correct parameter values.
- Check Use for catalog and schema checks to automatically discover credentials and add them to the catalog page.
- Click Create.
Kinesis Credential Parameters
| Field | Description |
|---|---|
| Name | Identifier for the credentials. Used when accessing Sources. |
| Access key | Access key with permissions to access the Kinesis stream. |
| Secret key | Secret key for the specified access key. |
Add a Kinesis Source
To add a source for Kinesis,
- Navigate to Sources and click + New Source.
- Under Source type, select Amazon Kinesis.
- Fill in the configuration parameters. For more information, see Kinesis Configuration Parameters.
- Click Continue to create the source.
Kinesis Configuration Parameters
| Field | Description |
|---|---|
| Name | Identifier for the Source. Used when setting up Validators. |
| AWS region | The AWS region that the Kinesis stream is available in. |
| Stream name | Name of the stream to read data from. |
| Message format | JSON, AVRO or PROTOBUF |
| Message Schema | For JSON, Validio automatically infers the schema. For AVRO and PROTOBUF, you can upload schemas. For more information, see Data Stream Sources. |