Zum Hauptinhalt springen

Configure a Teradata Vantage connection in DataHub

Overview

This how-to demonstrates how to create a connection to Teradata Vantage with DataHub, and ingest metadata about tables and views, along with usage and lineage information.

Prerequisites

Setup DataHub

  • Install the Teradata plugin for DataHub in the environment where you have DataHub installed
  • Setup a Teradata user and set privileges to allow that user to read the dictionary tables
  • If you want to run profiling, you need to grant select permission on all the tables you want to profile.

  • If you want to extract lineage or usage metadata, query logging must be enabled and it is set to size which will fit for your queries (the default query text size Teradata captures is max 200 chars) An example how you can set it for all users:

Add a Teradata connection to DataHub

With DataHub running, open the DataHub GUI and login. In this example this is running at localhost:9002

  • Start the new connection wizard by clicking on the ingestion plug icon Ingestion Label

    and then selecting "Create new source"

Create New Source

  • Scroll the list of available sources and select Other Select Source

  • A recipe is needed to configure the connection to Teradata and define the options required such as whether to capture table and column lineage, profile the data or retrieve usage statistics. Below is a simple recipe to get you started. The host, username and password should be changed to match your environment.

Pasting the recipe into the window should look like this: New Ingestion Source

  • Click Next and then setup the required schedule. Set Schedule

  • Click Next to Finish Up and give the connection a name. Click Advanced so that the correct CLI version can be set. DataHub support for Teradata became available in CLI 0.12.x. Suggest selecting the most current version to ensure the best compatibility. Finish up

  • Once the new source has been saved, it can be executed manually by clicking Run. Execute

Clicking on "Succeeded" after a sucessful execution will bring up a dialogue similar to this one where you can see the Databases, Tables and Views that have been ingested into DataHub.
Ingestion Result

  • The metadata can now be explored in the GUI by browsing:
    • DataSets provides a list of the datasets (tables and views) loaded datasets
    • Entities captured from the database Entities
    • Schema of an entity showing column/field names, data types and usage if it has been captured Schema display
    • Lineage providing a visual representation of how data is linked between tables and views Lineage picture

Summary

This how-to demonstrated how to create a connection to Teradata Vantage with DataHub in order to capture metadata of tables, views along with lineage and usage statistics.

Further reading

Hinweis

If you have any questions or need further assistance, please visit our community forum where you can get support and interact with other community members.

Auch interessant