Snowflake Data Profiler

A data profiler runs an analysis on a sample piece of data against selected parameters like completeness, validity, character count and so on. This provides statistics of the data on those parameters.

  1. Add the data quality stage after the Data Lake stage. Add a Data Profiler node. Connect the node to and from the data lake.

    DQ Snowflake Data Profiler

  2. Click the Data Profiler node and then click Create Job.

    SF Data Profiler Create Job

  3. Provide the following information to create the data profiler job:

  1. The Data Profiler job is created. Click the Data Profiler node and click Start to initiate the job run.

    Snowflake Data Profile Job run

    The status of the job can be one of the following:

    • Running

    • Completed

    • Failed

  2. Once the job is complete, click the Profiler Result tab and click View Profiler Results.

    DQ Snowflake Profiler Output

    On the Data Profiler Output screen, you can do the following:

    • View the results of the Data Profiler job.

    • Download the results in a CSV file. Click .

    • Provide the pattern for validation and click Validate.

      Note:The pipeline must be in Edit mode for the Validate button to be enabled.

  1. Specify the pattern for validation and click Validate to validate the data in the selected columns based on the provided pattern. You see a success message: Validation rules updated successfully.

  1. You can run the validation job in the following ways:

    • Publish the changes. Click Start to run the validation job.

    • Click the Data Profiler node. Click Start to run the validation job.

Once the job is complete you can view the results of the validation job under the Validated Profiler Result tab. You can download the results in the form of a CSV file.

Related Topics Link IconRecommended Topics What's next? Snowflake Data Analyzer