0% found this document useful (0 votes)

3K views51 pages

Introduction To Teradata

The document provides an overview of the Teradata database system architecture and capabilities. It describes how Teradata uses a shared-nothing architecture with multiple processing nodes to enable massive parallel processing. Each node contains processing engines and amps (access modules) that work together to distribute data and queries across the system for high performance and scalability. The document also discusses data distribution, primary indexes, single and multi-amp operations, and how queries are processed in parallel through the system.

Uploaded by

raghavendra.nie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3K views51 pages

Introduction To Teradata

Uploaded by

raghavendra.nie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 51

Teradata

M.S.Prasad 165916

Contents
Introduction to Teradata Teradata Architecture Data Distribution PI characteristics Data Access Teradatas scalability Data Protection features

Introduction to Teradata
Teradata is a Relational Database Management System (RDBMS):
1. 2. 3. 4. 5. 6. 7. 8. Designed to run the worlds largest commercial databases. Preferred solution for enterprise data warehousing (OLAP). Executes on UNIX-MP-RAS or NT-based system platforms Compliant with ANSI industry standards Runs on single (SMP) or multiple (MPP) nodes Acts as Database server to client applications throughout the enterprise Uses Parallelism to manage Terabytes of data Shared-Nothing Architecture

Advantage Teradata
1. 2. 3. 3. 4. 5. Unlimited, Proven Scalability Unlimited Parallelism - Parallel sorts/aggregations, temporary tables Shared-Nothing architecture Mature Optimizer - Complex queries, joins per query, ad-hoc processing Its a Cost Based Optimizer. Model the Business - 3NF, robust view processing, star schema Lowest TCO - ease of setup & maintenance, robust parallel utilities, no re-orgs, lowest disk to data ratio, robust expansion utility High Availability - No single point of failure, scalable data loading, parallel load utilities Note: If the table demographics are well defined, the optimizer will choose the best plan for the query execution.

Advantage Teradata
7. Enormous capacity Billions of rows Terabytes of data 8. High-performance parallel processing 9. Single database server for multiple clients Single Version of the Truth 10. Network and Mainframe connectivity 11. Industry standard access language (SQL) 12. Manageable growth via modularity 13. Fault tolerance at all levels of hardware and software 14. Data integrity and reliability

Advantage Teradata DBA

Things Teradata DBAs NEVER Have to Do! 1. 2. 3. 4. 5. 6. 7. Reorganize data or index space Pre-prepare data for loading (convert, sort, split, etc.) Ensure that queries run in parallel Unload/reload data spaces due to expansion Design, implement and support partition schemes. Write programs to figure how to divide data into partitions Write or run programs to split the input data into partitions for loading

They know that if data doubles, the system can expand easily to accommodate it. The workload for creating a table of 100,000 rows is the same as creating 1,000,000,000 rows!

Advantage Teradata Warehouse

ATM

MVS

POS

Operational Data

Teradata
Cognos Access BO

Data Warehouse

Access Tools

End Users

Architecture
Channel-Attached System
Client Application

Network-Attached System
Client Application

CLI Channel

CLI MTDP MOSI

TDP

Teradata Node
TPA PDE OS UN IX/ NT Channel Driver

LAN
Teradata Gateway

Parsing Engine

BYNET AMP AMP AMP AMP

VDisk

Architecture In Detail
Node
1. The basic building block for a Teradata system, the node is where the processing occurs for the database. 2. A node is a term for a general-purpose processing unit under the control of a single operating system. 3. Teradata system contains one or more nodes. Single Node - Symmetric Multi Processing (SMP) Multi Node - Massive Parallel Processing (MPP)

Node Components:
1. 2. 3. Parsing Engine BYNET AMP

Understanding Node
Parsing Engine

BYNET
AMP AMP AMP AMP

Vdisk

Node Components
Component Parsing Engine Functionality
1. 2. 3. 4. 5. Managing individual sessions (up to 120) Parsing and optimizing your SQL requests Dispatching the optimized plan to the AMPs ASCII / EBCDIC conversion (if necessary) Sending the answer set response back to the requesting client Storing and retrieving rows to and from the disks Lock management Sorting rows and Aggregating columns Join processing Output conversion and formatting Creating answer sets for clients Disk space management and Accounting Special utility protocols Recovery processing A vdisk (pronounced, "VEE-disk") is the logical disk space that is managed by an AMP.

AMP

Vdisk

1. 2. 3. 4. 5. 6. 7. 8. 9. 1.

Node

Other components
Component Channel Driver Functionality
Channel driver software is the means of communication between the PEs and applications running on channel-attached (mainframe) clients. The Teradata Gateway software is the means of communication between the PEs and applications running on: 1. LAN-attached clients 2. A node in the system The PDE (Parallel Database Extensions) software layer runs the operating system on each node. It was created by NCR to support the parallel environment. A Trusted Parallel Application (TPA) uses PDE to implement virtual processors (vprocs). The Teradata RDBMS is classified as a TPA

Gateway

PDE

TPA

Other Components Continued..

Component Functionality
and responses to/from the RDBMS 2. Performs logon and logoff functions

CLI (Call Level Interface) 1. Library of routines for blocking/unblocking requests

Teradata Director Program (TDP)

1. The Teradata Director Program is used by the mainframe HOST to communicate with the Teradata system. 2. It manages all traffic between the Call Level Interface (CLI) and the Teradata System. Its functions include session initiation and termination, logging, verification, recovery, and restart. Performs many of the TDP functions including session management but not session balancing Provides operating system and network protocol independent interface

MTDP (Micro Teradata Director Program) MOSI (Micro Operating System Interface)

The Parsing Engine

SQL Request Parsing Engine Session Control Parser Optimizer Dispatcher Answer Set Response

BYNET AMP AMP AMP AMP

Data Distribution
1. The Parsing Engine uses the Hashing Algorithm to distribute data across the AMPs. Data distribution is dependent on the hash value of the Primary index (PI). The Hashing Algorithm acts like a mathematical "blender." It takes up to 16 columns of mixed data as input and generates a single 32-bit binary value called a Row Hash.

Hashing Algorithm

2. Input to the algorithm is the Primary Index (PI) value of a row. 3. Row Hash uniqueness depends directly on PI uniqueness.

4. 5. 6. 7. 8.

Good data distribution depends directly on Row Hash uniqueness.

The algorithm produces random, but consistent, Row Hashes. The same PI value and data type combination always hash identically. Rows with the same Row Hash will always go to the same AMP. Different PI values rarely produce the same Row Hash (Collisions).

Row Hash
1. 2. 3. 4. 5.
A 32-bit binary value.

The logical storage location of the row.

Used to identify the AMP of the row. Table ID + Row Hash is used to locate the Cylinder and Data Block. Used for distribution, placement, and retrieval of the row.

Primary Index (PI) Hash Mapping

Primary Index Value for a Row Hashing Algorithm

DSW - Destination Selection Word

DSW (first 16 bits)

Row Hash (32 bits)

Remaining 16 bits

Hash Map - 65,536 entries (memory resident)

Message Passing Layer (PDE and BYNET)

AMP AMP AMP AMP AMP AMP AMP AMP AMP AMP 0 1 2 3 4 5 6 7 8 9

Data Distribution
Records From Client (in random sequence) 2 32 67 12 90 6 Teradata 54 75 18 25 80 41

From Host

EBCDIC Parsing Engine(s) ASCII Message Passing Layer Parsing Engine(s)

ASCII

Converted and Hashed

Distributed

AMP 1

AMP 2

AMP 3

AMP 4

Formatted

2 18

12 5 4
41

80 9 0
75 3 2

67 6

2 5

Stored

PI Characteristics
Primary Indexes (UPI and NUPI)

1. 2. 3. 4.

A Primary Index may be different than a Primary Key. Every table has only one, Primary Index. A Primary Index may contain null(s). Single-value access uses ONE AMP and, typically, one I/O.

Unique Primary Index (UPI)

1. Involves a single base table row at most. 2. No spool file is ever required. 3. The system automatically enforces uniqueness on the index value.
Non-Unique Primary Index (NUPI)

1. 2. 3. 4. 5.

May involve multiple base table rows. A spool file is created when needed. Duplicate values go to the same AMP and the same data block. Only one I/O is needed if all the rows fit in a single data block. Duplicate row check for a Set table is required if there is no USI on the table.

PI Considerations
ACCESS Maximize one-AMP operations: Choose the column most frequently used for access. Consider both join and value access. DISTRIBUTION Optimize parallel processing: Choose a column that provides good distribution. VOLATILITY Reduce maintenance resource overhead (I/O): Choose a column with stable data values. The Column chosen for PI must be at least nearly UNIQUE to achieve good distribution of data. Higher the distribution, higher the parallelism

AMP Operations
Single AMP operation (Typical UPI access)

Multi-AMP operation

All AMP operation

Single AMP operation - Illustration

SAMPLE NUMBER UPI 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 LETTER P U Y T R E W Q A S D F G H J K L M N B V C X Z

SELECT FROM WHERE ; ANSWER : N

LETTER SAMPLE NUMBER = 19

Single-AMP operation

Application to PE

APPL 1

APPL 2

PE 1

PE 2

AMP AMP AMP AMP AMP AMP AMP AMP 7 1 2 3 4 5 6 8

SQL Request
SELECT FROM WHERE LETTER SAMPLE NUMBER = 19; 13 G 6 E 12 F 15 J 20 B 7 W 4 T 19 N 16 K 17 L 2 U 11 D 14 H 23 X 3 Y 9 A 22 C 21 V 5 R 24 Z 18 M 1 P 10 S 8 Q

1. 2. 3. 4.

APPL 1 establishes a user session on PE 1. APPL 1 sends the SQL request to the PE on the forward channel. PE 1 acknowledges the message on the back channel. PE 1 parses and optimizes the request.

PE to AMP

1. PE 1 produces a one-step plan as a message to the BYNET. 2. BYNET uses the hash map to determine the destination to AMP 3. 3. BYNET sends the message to AMP 3 on the forward channel. 4. AMP 3 acknowledges message across the back channel.

AMP to PE

AMP 3 sends answer set to PE 1 on forward channel. PE acknowledges receipt across back channel

PE to Application
Single-AMP Query

PE 1 forwards response parcels to APPL 1 on forward channel. APPL 1 acknowledges messages on back channel. APPL 1 processes response and generates output.

All-AMP operation with a Sort

SELECT FROM WHERE ORDER BY ; ANSWER: NUMBER, LETTER SAMPLE NUMBER > 9 LETTER 20 22 11 12 13 14 15 16 17 18 19 10 21 23 24 B C D F G H J K L M N S V X Z SAMPLE NUMBER UPI 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 P U Y T R E W Q A S D F G H J K L M N B V C X Z LETTER

Application to PE
All-AMP Query with Sort

SQL Request
SELECT FROM WHERE ORDER BY NUMBER, LETTER SAMPLE NUMBER > 9 LETTER ;

1. APPL 1 establishes a user session on PE 1. 2. APPL 1 sends the SQL request to the PE on the forward channel. 3. PE 1 acknowledges the message on the back channel. 4. PE 1 parses and optimizes the request.

PE to AMPs
All-AMP Query with Sort

PE1 produces a three-step plan. PE1 gives first step to BYNET to send to all AMPs. BYNET sends step over forward channel to all AMPs. All AMPs acknowledge receipt over back channel.

PE to AMPs
All-AMP Query with Sort

1. PE1 sends out step 2 over the BYNET. 2. BYNET sends step to all AMPs.

AMPs to Merge Process

All-AMP Query with Sort

Each AMP sends its first block of sorted data to BYNET merge process.

AMP to Merge
All-AMP Query with Sort

Plan
1. GET NUMBER, LETTER WHERE NUMBER > 9 2. SORT ON LETTER 3. MERGE ON LETTER

1. The merge process continues to request sorted blocks from the AMPs until all AMPs have exhausted their spool supply.
2. When the merge process has an EOF from each AMP, the answer set is complete. Note: Spool is a temporary space used by the AMPs to store the intermediate results.

PE to Application
All-AMP Query with Sort

PE1 then sends the answer set to the requesting application.

Linear Growth and Expandability

Parsing Engine Parsing Engine Parsing Engine

AMP AMP

AMP
Disk Space

Disk Space

Node Node

Node

Components may be added as requirements grow without Loss of Performance Double the number of AMPs - Number of users remains the same - Performance will double. Double the number of AMPs and double the number of users - Performance will stay the same.

Teradata is linearly expandable

Data Protection
Teradata provides the following data Protection features: Protection Method Locks Fallback Raid Protection Cliques Transient Journal Permanent Journal Archive and Restore Type Software Software Software Hardware Software Software Software

Locks
There are four types of locks: Exclusiveprevents any other type of concurrent access Writeprevents other Read, Write, Exclusive locks Readprevents Write and Exclusive locks Accessprevents Exclusive locks only Locks may be applied at three database levels: Databaseapplies to all tables/views in the database Table/Viewapplies to all rows in the table/views Row Hashapplies to all rows with same row hash Lock types are automatically applied based on the SQL command:

SELECTapplies a Read lock UPDATEapplies a Write lock CREATE TABLEapplies an Exclusive lock

Access Locks
Advantages of Access locks: 1. Permit quicker access to table in multi-user environment. 2. Have minimal blocking effect on other queries.

3. Very useful for aggregating large numbers of rows.

4. Sometimes called a Dirty-Read or Stale-Read lock. 5. Disadvantages of Access locks: 6. May produce erroneous results if performed during table maintenance. Rule Lock requests are queued behind all outstanding incompatible lock requests for the same object.

A new ACCESS lock request is granted immediately.

Fallback
Fallback is a software mechanism. The fallback row is a copy of a primary row stored on a different AMP. A fallback table is fully available in the event of an unavailable AMP.
PE PE

BYNET
AMP 1 AMP 2 AMP 3 AMP 4

2 3

6 8

11 5 2

5 1 11

8 6 12

Primary rows Fallback rows

Benefits of Fallback 1. Permits access to table data during AMP off-line period 2. Adds a level of data protection beyond disk array RAID 3. Automatically restores data changed during AMP off-line 4. Critical for high availability applications Cost of Fallback 1. Twice the disk space for table storage is needed 2. Twice the I/O for INSERTs, UPDATEs and DELETEs is needed

Fallback Cluster
A defined number of AMPs treated as a fault-tolerant unit. Fallback rows for AMPs in a cluster reside in the cluster. Loss of an AMP in the cluster permits continued table access. Loss of two AMPs in the cluster causes the RDBMS to halt.

Two Clusters of Four AMPs Each

AMP 1
62 5 8 34 27 14 34 19

AMP 2
22 38 50 8 5 22

AMP 3
78 62 19 1 14 50

AMP 4
1 27 38 78

AMP 5
41 93 66 72 7 88 58 45

AMP 6
93 7 20 17 88 37

AMP 7
2 58 45 41 17 20

AMP 8
37 2 72 66

Lose AMP 3 from cluster -> AMPs 1, 2 and 4 experience 33% increase in workload. Lose AMP 6 from cluster -> AMPs 5, 7 and 8 experience 33% increase in workload. Lose AMP 7 from cluster ->System halts. System performance can be adversely affected where any AMP has a disproportionate burden.

Fallback vs. Non-Fallback Tables

FALLBACK TABLES
ONE AMP DOWN Data fully available
TWO OR MORE AMPs DOWN In different clusters Data fully available In the same cluster System halts

AMP

NON-FALLBACK TABLES
ONE AMP DOWN Data partially available Queries avoiding down AMP succeed
TWO OR MORE AMPs DOWN In different clusters Data partially available Queries avoiding down AMPs succeed In the same cluster System halts

AMP

Raid Protection
Two types of disk array protection:
RAID-1 (Mirroring)
1. 2. 3. 4. 5. 1. 2. 3. 4. 5. Each physical disk in the array has an exact copy in the same array. The array controller can read from either disk and write to both. When one disk of the pair fails, there is no change in performance. Mirroring reduces available disk space by 50%. Primary Array controller reconstructs failed disks quickly. For every 3 blocks of data, there is a parity block on a 4th disk. Parity Algorithm is applied to determine the parity block. If a disk fails, any missing block may be reconstructed using the other three disks. Parity reduces available disk space by 25% in a 4-disk rank. Array controller reconstruction of failed disks is longer than RAID 1.
Block 0 Parity Block 5 Block 1 Block 3 Parity Block 2 Block 8 Block 6 Parity Block 4 Block 7

Mirror

RAID-5 (Parity)

Summary
RAID-1 - Good performance with disk failures Higher cost in terms of disk space RAID-5 - Reduced performance with disk failures Lower cost in terms of disk space

Recovery Journal For Down AMPs

Recovery Journal is: Automatically activated when an AMP is taken off-line Maintained by other AMPs in the cluster Totally transparent to users of the system While AMP is off-line: Journal is active Table updates continue as normal Journal logs Row-IDs of changed rows for down-AMP When AMP is back on-line: Restores rows on recovered AMP to current status Journal discarded when recovery complete
AMP 1
AMP 2 AMP 3 AMP 4

41 93

66 72

7 88

58 45

93 7 Row-ID 7

20 17

88 37

2 58 Row-ID 41

45 41

17 20

37 2 Row-ID 66

72 66

Cliques
Cliques Pronounced as Clee-ques is a grouping of a set of nodes together. Two or more TPA nodes having access to the same disk arrays are called a clique.
SMP 1 SMP 2 SMP 3 SMP 4

AMP 1

AMP 2

AMP 3

AMP 4

AMP 5

AMP 6

AMP 7

AMP 8

D A C

AMP vprocs can run on any node within the clique and still have full access to their disk array space. If a node fails, AMPs migrate to another node in the clique.
SMP 1
AMP 3 AMP 1 AMP 2 AMP 5

SMP 2

SMP 3
AMP 4 AMP 6

SMP 4

AMP 7

AMP 8

D A C

Note: Failure of a Node within a Clique increases the workload for the other Nodes within the clique

Transient Journal 1. Consists of a journal of transaction before images. 3. Is automatic and transparent.

Transient Journal

2. Provides rollback in the event of transaction failure.

4. Before images are reapplied to table if transaction fails.

5. Before images are discarded upon transaction completion. BEGIN TRANSACTION UPDATE Row A Before image Row A recorded (Add $100 to checking) UPDATE Row B Before image Row B recorded (Subtract $100 from savings) END TRANSACTION Discard before images BEGIN TRANSACTION UPDATE Row A UPDATE Row B (Failure occurs) (Rollback occurs) (Terminate TXN) Before image Row A recorded Before image Row B recorded Reapply before images Discard before images Failed Transaction

Successful Transaction

The Permanent Journal

An optional, user-specified, system-maintained journal used for database recovery to a specified point in time. 1. Used for recovery from unexpected hardware or software disasters. May be specified for: One or more tables One or more databases 2. Permits capture of BEFORE images for database rollback . 3. Permits capture of AFTER images for database rollforward. 4. Permits archiving change images during table maintenance. 5. Reduces need for full-table backups. 6. Provides a means of recovering NO FALLBACK tables. 7. Requires additional disk space for change images. 8. Requires user intervention for archive and recovery activity. Note: The user cannot directly query the permanent journal table. Permanent Journal occupies Permanent space and hence needs to be cleaned up periodically.

Archiving and Recovering Data

ARC Utility
1. 2. 3. 4. 5. The Archive/Restore utility Runs on IBM, UNIX and NT Archives data from RDBMS Restores data from archive media Permits data recovery to a specified checkpoint

Common uses of ARC

1. Dump database objects for backup or disaster recovery

2. Restore non-fallback tables after disk failure. 3. Restore tables after corruption from failed batch processes.

4. Recover accidentally dropped tables, views, or macros.

5. Recover from miscellaneous user errors.

Summary
In todays session we have learnt about:
1.
2. 3. 4. 5. 6.

The Teradata architecture and how it achieves the best parallelism and scalability. The concept of Shared-Nothing Architecture. The way data is distributed using Hashing algorithm. The significance of PI in row distribution. How the data rows are fetched? The various protection features in Teradata

References
Teradata Basics Official curriculum Published by NCR Teradata Solutions Group

Customized Text Search in ACCE Using IBM® Content Search Service
No ratings yet
Customized Text Search in ACCE Using IBM® Content Search Service
35 pages
Vertica - Dba Interview
No ratings yet
Vertica - Dba Interview
3 pages
Talend Data Integration Advanced
No ratings yet
Talend Data Integration Advanced
2 pages
Pe Syllabus g12
100% (2)
Pe Syllabus g12
8 pages
MAK Halliday The Language of Science
100% (5)
MAK Halliday The Language of Science
268 pages
Teradata Interview Prep Questions
No ratings yet
Teradata Interview Prep Questions
52 pages
Teradata Frequently Asking Questions
No ratings yet
Teradata Frequently Asking Questions
46 pages
PL-SQL Interview Prep Guide
No ratings yet
PL-SQL Interview Prep Guide
3 pages
SSIS Practical Training Course
No ratings yet
SSIS Practical Training Course
4 pages
Informatica Lookup Transformation Guide
No ratings yet
Informatica Lookup Transformation Guide
2 pages
QTP & VBScript Interview Prep
No ratings yet
QTP & VBScript Interview Prep
6 pages
Ibm DB2 RDBMS
No ratings yet
Ibm DB2 RDBMS
14 pages
Informatica MDM Course Contents
No ratings yet
Informatica MDM Course Contents
7 pages
Understanding DB2 Bufferpool Tuning 2005 Final
No ratings yet
Understanding DB2 Bufferpool Tuning 2005 Final
40 pages
Web Programming
No ratings yet
Web Programming
0 pages
Informatica MDM Sample Resume 3
No ratings yet
Informatica MDM Sample Resume 3
6 pages
Getting Started With Oracle SoA 12
No ratings yet
Getting Started With Oracle SoA 12
62 pages
A Interview Faq's - 2
No ratings yet
A Interview Faq's - 2
22 pages
New - Datastage Architecture
No ratings yet
New - Datastage Architecture
5 pages
Informatica 9.X Level 1 and Level 2 Training
No ratings yet
Informatica 9.X Level 1 and Level 2 Training
4 pages
Datapower Soa Appliances: Agenda
No ratings yet
Datapower Soa Appliances: Agenda
24 pages
Get Off To A Fast Start With Db2 V9 Purexml, Part 2
No ratings yet
Get Off To A Fast Start With Db2 V9 Purexml, Part 2
16 pages
Informatica Advanced Training
100% (3)
Informatica Advanced Training
94 pages
Microsoft Testking 70-764 v2019-02-01 by Henry 170q
No ratings yet
Microsoft Testking 70-764 v2019-02-01 by Henry 170q
208 pages
SAP GUI Protocol LoadRunner Guide
No ratings yet
SAP GUI Protocol LoadRunner Guide
41 pages
Reusable Transformations
No ratings yet
Reusable Transformations
4 pages
IDQ Functionality Imp
No ratings yet
IDQ Functionality Imp
7 pages
Oracle BI Cheat Sheet 11 Feb 2014 Download
No ratings yet
Oracle BI Cheat Sheet 11 Feb 2014 Download
4 pages
IBM Storage For Containers and IBM Cloud Paks V1 Quiz
No ratings yet
IBM Storage For Containers and IBM Cloud Paks V1 Quiz
5 pages
Admin Task: 1) Integration With 3 Party Tools Barcode Labelling Software To Print Labels and Shipping Purpose
No ratings yet
Admin Task: 1) Integration With 3 Party Tools Barcode Labelling Software To Print Labels and Shipping Purpose
11 pages
But Why Anyone Will Need A Dynamic Cache?
No ratings yet
But Why Anyone Will Need A Dynamic Cache?
8 pages
Narayana
No ratings yet
Narayana
4 pages
Quora - Informatica DW BI Ques ANS
No ratings yet
Quora - Informatica DW BI Ques ANS
7 pages
Prathap Reddy.C: Rofessional Ummary
No ratings yet
Prathap Reddy.C: Rofessional Ummary
4 pages
Informatica Powermart / Powercenter 6 Basics Hands-On Lab Guide
No ratings yet
Informatica Powermart / Powercenter 6 Basics Hands-On Lab Guide
309 pages
File Net Interview Questions
No ratings yet
File Net Interview Questions
36 pages
Computer Engineering Internship
No ratings yet
Computer Engineering Internship
34 pages
Informatica FAQ's
No ratings yet
Informatica FAQ's
5 pages
Certified ETL Testing Professional
No ratings yet
Certified ETL Testing Professional
6 pages
Dbms Lab # 3: SQL Aggregate & Scalar Functions
No ratings yet
Dbms Lab # 3: SQL Aggregate & Scalar Functions
20 pages
Talend Course Content
100% (1)
Talend Course Content
2 pages
Informatica MDM High Availability
No ratings yet
Informatica MDM High Availability
6 pages
Kenan
No ratings yet
Kenan
11 pages
IBM DataStage Interview Q&A Guide
No ratings yet
IBM DataStage Interview Q&A Guide
9 pages
HYP Qeustions
No ratings yet
HYP Qeustions
12 pages
Informatica 9.x Course Curriculum
No ratings yet
Informatica 9.x Course Curriculum
8 pages
Developing Rich Web Applications With Oracle ADF
No ratings yet
Developing Rich Web Applications With Oracle ADF
196 pages
Setting Up A Searchable Flexfield
No ratings yet
Setting Up A Searchable Flexfield
7 pages
Informatica Data Management Guide
No ratings yet
Informatica Data Management Guide
33 pages
Performance Tuning of Datastage Parallel Jobs
No ratings yet
Performance Tuning of Datastage Parallel Jobs
12 pages
Connecting Maximo TPAE To LDAP Project Experiences
No ratings yet
Connecting Maximo TPAE To LDAP Project Experiences
73 pages
Vishnu Sangishetty HCM Resume
No ratings yet
Vishnu Sangishetty HCM Resume
5 pages
IBM iSeries 400 Training Guide
0% (1)
IBM iSeries 400 Training Guide
8 pages
What Is Postgresql?
No ratings yet
What Is Postgresql?
12 pages
Teradata Overview
No ratings yet
Teradata Overview
64 pages
Teradata Database Architecture Guide
No ratings yet
Teradata Database Architecture Guide
72 pages
Teradata Architecture
100% (1)
Teradata Architecture
27 pages
Teradata Architecture
No ratings yet
Teradata Architecture
5 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
7 pages
Teradata Architecture PDF Free
No ratings yet
Teradata Architecture PDF Free
89 pages
AC, TLS, and Encoders
No ratings yet
AC, TLS, and Encoders
25 pages
SAP Area Menu Maintenance
No ratings yet
SAP Area Menu Maintenance
21 pages
Ejercicios Self Compassion
No ratings yet
Ejercicios Self Compassion
5 pages
Sample Transportation Problems
No ratings yet
Sample Transportation Problems
4 pages
Jigsaw Jumbled Test Sheet
No ratings yet
Jigsaw Jumbled Test Sheet
75 pages
Grade 6, Physics SQP-1 - Revision
No ratings yet
Grade 6, Physics SQP-1 - Revision
4 pages
Lecture 01 Properties of Sea Water PDF
No ratings yet
Lecture 01 Properties of Sea Water PDF
6 pages
Kujawski Anna 7 1
No ratings yet
Kujawski Anna 7 1
9 pages
p77253 Btec l3 Applied Science 31617h Unit 1b Jan 2024
No ratings yet
p77253 Btec l3 Applied Science 31617h Unit 1b Jan 2024
12 pages
Paper RASD2010 005 Halfpenny Kihm
No ratings yet
Paper RASD2010 005 Halfpenny Kihm
12 pages
Role Play Rubric
100% (2)
Role Play Rubric
2 pages
Book Tactile
No ratings yet
Book Tactile
46 pages
Ground Improvement of Soft Clay Using Compacted Lime Column Technique
No ratings yet
Ground Improvement of Soft Clay Using Compacted Lime Column Technique
11 pages
Grade 10 Singapore and Asian Schools Math Olympiad: Choose Correct Answer(s) From The Given Choices
No ratings yet
Grade 10 Singapore and Asian Schools Math Olympiad: Choose Correct Answer(s) From The Given Choices
2 pages
Superscalar Vs Superpipeline Processor
No ratings yet
Superscalar Vs Superpipeline Processor
17 pages
Additional English - 4th Semester Full
No ratings yet
Additional English - 4th Semester Full
48 pages
DTT TMT TelecomIndRprt 03824
No ratings yet
DTT TMT TelecomIndRprt 03824
24 pages
Atlas of Human Hair Microscopic Characteristics, 1st Edition Complete Digital Book
No ratings yet
Atlas of Human Hair Microscopic Characteristics, 1st Edition Complete Digital Book
15 pages
Drdo Research Project
No ratings yet
Drdo Research Project
5 pages
GT-100 System Update Procedure
No ratings yet
GT-100 System Update Procedure
4 pages
Freud vs. Frankl: Student Coping Strategies
No ratings yet
Freud vs. Frankl: Student Coping Strategies
1 page
RICPI21ATP550-Atlas Fire Hydrant Non Traffic Symetric Outlets
No ratings yet
RICPI21ATP550-Atlas Fire Hydrant Non Traffic Symetric Outlets
7 pages
0173e1 PDF
No ratings yet
0173e1 PDF
12 pages
Comprehensive Multi-Modality Online Student Engagement Dataset With High-Quality Labels
No ratings yet
Comprehensive Multi-Modality Online Student Engagement Dataset With High-Quality Labels
11 pages
Sabp A 021 PDF
No ratings yet
Sabp A 021 PDF
13 pages
Intro to Operating Systems
No ratings yet
Intro to Operating Systems
13 pages
School Monitoring, Evaluation, and Adjustment (Smea) : (Tools/Instrument)
No ratings yet
School Monitoring, Evaluation, and Adjustment (Smea) : (Tools/Instrument)
8 pages
p222 1358 M H Klaiman Grammatical Voice Cambridge University Press 1991
100% (1)
p222 1358 M H Klaiman Grammatical Voice Cambridge University Press 1991
342 pages

Introduction To Teradata

Uploaded by

Introduction To Teradata

Uploaded by

Teradata

Advantage Teradata DBA

Advantage Teradata Warehouse

CLI MTDP MOSI

BYNET AMP AMP AMP AMP

Other Components Continued..

CLI (Call Level Interface) 1. Library of routines for blocking/unblocking requests

Teradata Director Program (TDP)

The Parsing Engine

BYNET AMP AMP AMP AMP

Good data distribution depends directly on Row Hash uniqueness.

The logical storage location of the row.

Primary Index (PI) Hash Mapping

DSW - Destination Selection Word

DSW (first 16 bits)

Row Hash (32 bits)

Hash Map - 65,536 entries (memory resident)

Message Passing Layer (PDE and BYNET)

EBCDIC Parsing Engine(s) ASCII Message Passing Layer Parsing Engine(s)

Converted and Hashed

Unique Primary Index (UPI)

All AMP operation

Single AMP operation - Illustration

SELECT FROM WHERE ; ANSWER : N

LETTER SAMPLE NUMBER = 19

AMP AMP AMP AMP AMP AMP AMP AMP 7 1 2 3 4 5 6 8

All-AMP operation with a Sort

AMPs to Merge Process

PE1 then sends the answer set to the requesting application.

Linear Growth and Expandability

Teradata is linearly expandable

3. Very useful for aggregating large numbers of rows.

A new ACCESS lock request is granted immediately.

Primary rows Fallback rows

Two Clusters of Four AMPs Each

Fallback vs. Non-Fallback Tables

Recovery Journal For Down AMPs

2. Provides rollback in the event of transaction failure.

4. Before images are reapplied to table if transaction fails.

The Permanent Journal

Archiving and Recovering Data

Common uses of ARC

1. Dump database objects for backup or disaster recovery

4. Recover accidentally dropped tables, views, or macros.

You might also like