TPump

From Wikipedia, the free encyclopedia

TPump is a Teradata utility.

Contents

[edit] Teradata TPump – Continuous Data Loading

Teradata TPump is a highly parallel utility designed to continuously move data from data sources into Teradata tables without locking the affected table. TPump provides near-real-time data into your data warehouse, allowing you to maintain fresh, accurate data for up-to-the-moment decision-making. You can use TPump to insert, update, upsert, and delete data in the Teradata Database, particularly for environments where batch windows are shrinking and warehouse maintenance overlaps normal working hours. And because TPump uses row hash locks, users can run queries even while it’s updating the Teradata Warehouse.

[edit] Features

  • Fast, scalable continuous data loads
  • Row hash lock enables concurrent queries
  • Dynamic throttling feature
  • Best for small data volumes

[edit] Supported Platforms

  • NCR UNIX SVR4 MP-RAS
  • IBM z/OS (MVS and USS)
  • z/OS VM
  • Microsoft Windows 2000, XP, and Server 2003
  • Sun Solaris SPARC
  • IBM
  • HP-UX

[edit] Overview

TPump is a data loading utility that helps you maintain (update, delete, insert, and atomic upsert) the the data in your Teradata Relational Database Management System (Teradata RDBMS). TPump allows you to achieve near real time data in your data warehouse. If your system is too busy to devote a designated batch window to upload data, then you need TPump.

TPump uses standard Teradata SQL to achieve moderate to high data loading rates to the Teradata RDBMS. Multiple sessions and multistatement request are typically used to increase throughput.

TPump provides an alternative to MultiLoad for the low volume batch maintenance of large databases under control of a Teradata system. Instead of updating Teradata databases overnight, or in batches throughout the day, TPump updates information in real time, acquiring every bit of data from the client system with low processor utilization. It does this through a continuous feed of data into the data warehouse, rather than the traditional batch updates. Continuous updates results in more accurate, timely data.

And, unlike most load utilities, TPump uses row hash locks rather than table level locks. This allows you to run queries while TPump is running. This also means that TPump can be stopped instantaneously. As a result, businesses can make better decisions that are based on the most current data.

TPump also provides a dynamic throttling feature that enables it to run “all out” during batch windows, but within limits when it may impact other business uses of the Teradata RDBMS. Operators can specify the number of statements run per minute, or may alter throttling minute-by-minute, if necessary.

TPump’s main attributes are:

  • Simple, hassle-free setup – doesn’t require staging of data, intermediary files, or special hardware.
  • High-end portability – supports IBM mainframes; UNIX® MP-RAS; AIX®; HP-UX®; Windows 98®, Windows NT®, Windows 2000®, and Windows XP®; and Solaris® SPARC.
  • Efficient, time-saving operation – jobs can continue running in spite of database restarts, dirty data, and network slow downs. Jobs can restart with absolutely no intervention.
  • Flexible data management – accepts an infinite variety of data forms from an infinite number of data sources, including direct feeds from other databases. TPump is also able to transform that data on the fly before sending it to Teradata. SQL statements and conditional logic are usable within the utilities, making it unnecessary to write wrapper jobs around the utilities.

[edit] External links

[edit] See also

In other languages