Data Sampling
Preview a tabulated sample of rows from your source to inspect actual data values and verify data patterns.
Data Sampling displays a tabulated preview of rows from your source or catalog asset, helping you verify data patterns and inspect actual values.

Data sampling configuration and preview
Running a Sample
To sample data from your source or catalog asset:
- Navigate to the Schema & Profiling tab.
- Click Sample data.
- In the dialog window, configure the following settings to define the sample size and fields.
| Setting | Description |
|---|---|
| Sampling Percentage | (Only available for tables and materialized views.) Adjust the percentage of data to include in the sample. Lower values speed up sampling on large tables. |
| Fields | Select specific fields to include in the sample. By default, all fields are included. |
| Row limit | Set the maximum number of rows to include in the sample. Use this to cap execution time on large datasets. |
| Partition field | (Advanced) Select a partition field to filter the data before sampling. This limits the sample to specific partitions rather than scanning the full dataset. |
| Partition lookback | (Advanced) Set the number of partitions to look back from the most recent partition. Use this with Partition field to sample only recent data. |
The dialog includes a SQL query that is auto-generated from your settings. You can edit this query directly for more control, but note that manual edits are not synced back to the settings above. Click Reset query to discard any manual edits and regenerate the query from the current settings.
- Click Load samples to display the results.
Use Cases
Data Sampling is useful for:
- Spot-checking that actual values match the detected schema and data types.
- Verifying data patterns before configuring validators.
- Investigating unexpected values or formats flagged during profiling or validation.
Updated about 21 hours ago