Schema & Profiling Tab

View and manage your source schema, run data profiling, and preview sample data.

The Schema & Profiling tab on each source and catalog asset gives you a central place to inspect your schema, profile your data, and preview sample rows.


Schema & Profiling tab for a source

Schema List

The table displays the detected schema for your source, including field names, data types, nullability settings, and profiled statistics.

From this section you can:

  • View the full list of fields and their data types.
  • View more information about individual schema fields in a sidepanel.
  • Search for specific fields, filter by data type, and sort the list order.
  • Update nullability settings and data types to better reflect your data structure. (The Update schema button is available for sources, but not catalog assets.)

For details on how Validio automatically derives schemas through metadata reading, inference, and manual configuration, see Schema Detection.

Data Profiling

Data Profiling provides on-demand statistical analysis of the fields in your source table to provide information for setting up validators. You can trigger a profiling run to calculate summary statistics for each field.

Profiling results are specific to the types of data. For example, results may include metrics such as:

  • Null percentages and unique values (for all data types)
  • Minimum, maximum, and mean values (for numeric fields)
  • Maximum and minimum length (for strings)

To profile your source, navigate to the Schema & Profiling tab, click Profile data, and configure the profile settings in the dialog. Results are displayed inline for each field once the run completes. For more information, see Data Profiling.

Data Sampling

Data Sampling returns a quick view of your source rows, letting you verify patterns and inspect actual values before you set up filters and segmentation.

To sample data, open the Schema & Profiling tab, click Sample data, and configure the sample size in the dialog. For more information, see Data Sampling.