Run large bulkloads efficiently with Teradata Parallel Transporter (TPT)
Overview
We often have a need to move large volumes of data into Vantage. Teradata offers Teradata Parallel Transporter (TPT)
utility that can efficiently load large amounts of data into Teradata Vantage. This how-to demonstrates how to use TPT
. In this scenario, we will load over 300k records, over 40MB of data, in a couple of seconds.
Prerequisites
-
Access to a Teradata Vantage instance.
노트If you need a test instance of Vantage, you can provision one for free at https://clearscape.teradata.com
-
Download Teradata Tools and Utilities (TTU) - supported platforms: Windows, MacOS, Linux (requires registration).
Install TTU
- Windows
- MacOS
- Linux
Run in Powershell:
Unzip the downloaded file, go to the unzipped directory and run:
Get Sample data
We will be working with the US tax fillings for nonprofit organizations. Nonprofit tax filings are public data. The US Internal Revenue Service publishes them in S3 bucket. Let's grab a summary of filings for 2020: https://storage.googleapis.com/clearscape_analytics_demo_data/TPT/index_2020.csv
. You can use your browser, wget
or curl
to save the file locally.