0% found this document useful (0 votes)

8 views9 pages

Components & Runtime Behaviour

The document outlines various components of AbInitio, categorized into folders such as Sort, Transform, Departition, Partition, Datasets, Database, Miscellaneous, and Validate. Each component is described with its parameters and functionalities, detailing operations like sorting, filtering, joining, and partitioning data records. Additionally, it includes examples of string functions and programming constructs used within the AbInitio environment.

Uploaded by

lakshmikrishnappa513

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views9 pages

Components & Runtime Behaviour

Uploaded by

lakshmikrishnappa513

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 9

AbInitio Components:

==============================
Sort Folder
--------------
Sort
Sort Within Groups
Partition by key and sort

Transform Folder
-------------------
Dedup Sorted
Filter By Expression
Fuse
Join
Normalize
ReFormat
RollUp
Scan

Departition
----------------
Concatenate
Gather
InterLeave
Merge

Partition
-------------
Partition By Round-Robin
Partition By Key
Partition By Expression
Partition By Range
Partition By Percentage
Partition with Load Balance
Broadcast

Datasets
--------------
Input File
Output File
LookUp File & Dynamic LookUp File
Intermediate File
Input Table
Output Table

Data base
---------------
Input Table
Output Table
Run SQL
Update Table
Truncate Table

Miscellaneous
-------------------
Gather Logs
Meta Pivot
Redefine Format
Replicate
Run Program
Trash

Validate
-----------
Generate Records
Validate Records
Check Order
Compare Records
Compute Checksums
Compare Checksum

AbInitio Components
----------------------
Runtime Behaviour
Parameters

=============
Sort
=============
It will be taking the input file parameter and process the input & store the
processed data in the output file.
Parameters:
Key :Here key means column it needs to be specified.Based on the key it will sort
the data and send to out port.
Max_Core:10MB
Total memory allocated for the component for performing the entire operation.

===================
Sort within group
====================
The data should be already sorted based on one key
Parameters:
MajorKey
MinorKey
Max_Core:10MB
Allow unsorted:false

==========================
Partition by key and sort
==========================
Repartitions the data records by key values and then sorts the records within each
partition.The no.of input and output partitions can be different.
Parameters:
Key
InputLayout
MaxCore
OutPutLayout

====================
Dedup Sorted:
====================
Dedup Sorted separates one specified data record in each group of data records from
the rest of the records in the group.

Dedup Sorted requires grouped input.

input port-->output port,duplicate port

optional ports-reject,error,log
Parameters:
Key:Name(s) of the key field(s) you want Dedup Sorted to use when determining
groups of data records.
Select:Filter for records before Dedup Sorted separates duplicates
Keep:first,last,unique
first keeps the first record of a group. This is the default.
last keeps the last record of a group.
unique-only keeps only records with unique key values
logging
reject-threshold
-----------------
Abort on first reject — Write Multiple Files stops the execution of the graph at
the first reject event it generates.
Never abort — the component does not stop the execution of the graph, no matter how
many reject events it generates.
Use ramp/limit — the component uses the settings in the ramp and limit parameters
to determine how many reject events to allow before it stops the execution of the
graph.

=========================
Filter By Expression
=========================
Filter by Expression filters data records according to a specified DML expression.
Basically it can be compared with the where clause of sql select statement.
Different functions can be used in the select expression of the filter by
expression component
even lookup can also be used.
It filters data records according to a DML expression.
input port
output port,deselect port
reject,error,log
Parameters:
Select expr:condition, Filter for data records.
reject-threshold
logging

=============
ReFormat
=============
Reformat changes the record format of data records by dropping fields, or by using
DML expressions to add fields, combine fields, or transform the data in the
records.

lookup("lkp_file", in.ProductID).Category
Parameters:
Count
select
transform()
rejectthreshold
logging
Output_index
out::output_index(in)=
begin

end;
Output_indexes

=================
LookUp File
=================
It is used as a reference file.Here we can map the input file and lookup file with
an primary key reference and get the required columns from the both the tables in
the output file.
Key:on which column reference basis we will get the records
RecordFormat:specify the columns in lookup file.

==============
Join
==============
2 input ports(default)
1 output port
2 unused ports(default)
2 reject ports(default)
2 error ports(default)
1 log port(default)
Parameters:
count
sorted-input
key
transform
join-type
record-required0
record-required1
dedup0
dedup1
select0
select1
override-key0
override-key1
driving
maintain-order
max-core
reject-threshold
logging

Checkpoint Sort:
===================
Parameters:
Key
Max_Core:100 MB

========
Fuse
========
Fuse combines multiple input flows into a single output flow by applying a
transform function to corresponding records of each flow.
2 i/p ports,1 o/p port
optional ports:reject,log,error
Parameters:
Count
Transform
Reject-Threshold
Logging

=============
Normalize
=============
Generates multiple output data records from each input data record.
Normalize can separate a data record with a vector field into several individual
records, each containing one element of the vector.
Parameters:
transform
reject-threshold
Logging

=========
RollUp
=========
Generates data records that summarize groups of data records. Rollup in Memory
maximizes performance by keeping intermediate results in main memory.
Parameters:
sorted-input
key-method:key specifier/key change Function
key
transform
reject-threshold
logging

=======
Scan
=======
Generates a series of cumulative summary records--such as year-to-date totals--for
groups of data records. Scan Sorted requires grouped input.
Parameters:
sorted-input
key-method:key specifier/key change Function
key
transform
reject-threshold
logging

===========
Concatenate
============
Appends multiple flow partitions of data records one after another.

========
Gather
=========
Combines data records from multiple flow partitions arbitrarily.

===========
InterLeave
===========
Combines blocks of data records from multiple flow partitions in round-robin
fashion.
Parameters:
Blocksize

=========
Merge
=========
Combines data records from multiple flow partitions that have been sorted according
to the key specifier, and maintains the sort order.
Parameters:
key
=========================
Partition By Round-Robin
=========================
Distributes data records evenly to each output flow in round-robin fashion.

Use the Interleave component to reverse the effects of Partition by Round-robin.

Parameters:
Blocksize

================
Partition By Key
================
Distributes data records to its output flow partitions according to key values.
Parameters:
key

========================
Partition By Expression
========================
Distributes data records to its output flow partitions according to a specified DML
expression.
Parameters:
Function

==================
Partition By Range
==================
Distributes data records to its output flow partitions according to ranges of key
values specified for each partition.
Parameters:
key

========================
Partition By Percentage
========================
Distributes a specified percentage of the total number of input data records to
each output flow.
Parameters:
Percentages

=============================
Partition with Load Balance
=============================
Distributes data records to output flow partitions, writing more records to the
flow partitions that consume records faster.

===========
Broadcast
===========
Distributes data by combining input data records into a single flow and writing a
copy of that flow to each output flow partition.

===========
Gather Logs
===========
Collects the output from log ports of components for analysis of a graph after
execution.
Parameters:
LogFile
StartText
EndText

============
Meta Pivot
============
Pivots around one or more fields in the input
Parameters:
name_field
value_field
pivot1
pivot2
pivot3

=============
Redefine Format
=============
Copies data records from its input to its output without changing the values. Use
Redefine Format to change a record format or rename fields.

=============
Replicate
=============
Arbitrarily combines all the data records it receives into a single flow and writes
a copy of that flow to each of its output flows.

===============
Run Program
===============
Executes a standard UNIX or Windows NT program.
Parameters:
commandline

=======
Trash
=======
Ends a flow by discarding all input data records.

===========
LookUp File
===========
Lookup Files are components containing shared data. Use lookup files with the DML
lookup functions to access records according to a key.
Parameters:
key
RecordFormat

String functions
===================
string_length("abc def")->7
string_length("")->0
string_compare("aaa","bbb")-> -1
string_compare("bbb","aaa")->1

string_index("ABCD,FG,HJ,KL",",")->5
string_index("to be late be","be")->4
string_index("abc","x")->0
string_index("abc","")->1

string_substring("abcdefgh",3,4)->cdef

string("|")str = "ABCD,FG,HJ,KL";
Integer(4) l = string_length(str);
Integer("|") n = string_index(str,",")->5
string_substring(str,1,n-1)->ABCD
L=l-n;--->8
string_substring(str,n+1,l);-->"FG,HJ,KL"

string_split("ABCD,FG,HJ,KL")-->[vector "ABCD","FG","HJ","KL"];
string_split("Rini Jain","")-->[vector "Rini","Jain"];
first_name=string_split(in.FULLNAME,"")[0];
last_name=string_split(in.FULLNAME,"")[1];

string(16) str = "abc";

string(16)[2] str = [vector "abc","def"]

string_rindex(s,"n")--->9

string_filter("ABC","ABC")-->0
string_filter("AxByCz","ABC")-->ABC

string_like("abcdef","abc%")-->1
string_like("abcdef","abc_")-->0
string_like("abcdef","abc_ef)-->1

decimal(",") phone = "9870651233";

decimal(",")[2] phone = [vector "5432167898","1234567654"];

type my_rec=
record
string(",") s;
decimal(",") d;
end;

my_rec r=[record s "abc" d 3000];

====================================
out::function(c,n)=
begin

end;
dt is the same record type as lookup file
lookup("dept",in.key);
========================================
while(i<9)
begin
end
===================
integer(4) i=0;
for(i,i<4)
begin
end
=====================
integer(4) i = 2000;
string(16) s = (string(16))(i);

date('yyyy-dd-mm) dt='2000-02-03'
date('MMDDYYYY')dt1=(date("MMDDYYYY"))(dt);
======================================
record type
syn of function,loop,type conversion,vector basics
return value of lookup function;
==========================================
string(",") FULLNAME="shri Ganeshaya Namah";
string(",")[3] s=string_split(FULLNAME," ");
s[0]="Shri"
s[1]="Ganeshaya"
s[2]="Namah"

firstName=string_split(FULLNAME," ")[0];
middleName=string_split(FULLNAME," ")[1];
lastName=string_split(FULLNAME," ")[2];

out::reformat(in) =
begin
let string("|") s ="";
let integer(4) i =0;
let integer(4) c =lookup_count("DEPT",in.dn);

for(i,i<c)
begin
s=string_concat(s,"|",lookup_next("DEPT").dname);
end

out.id :: in.id;
out.name :: in.name;
out.dn :: in.dn;
out.dname :: lookup("DEPT",in.dn).dname;
out.lkp_count :: c;
out.lkp_s :: s;
end;

Dataware house class

SQL
Unix

Mainframe Data Sorting Guide
No ratings yet
Mainframe Data Sorting Guide
4 pages
Abinitio Components
No ratings yet
Abinitio Components
10 pages
Day 2
No ratings yet
Day 2
100 pages
Data Types and DML
No ratings yet
Data Types and DML
2 pages
Lookup File
No ratings yet
Lookup File
8 pages
1.ab Initio - Unix - DB - Concepts & Questions - !
No ratings yet
1.ab Initio - Unix - DB - Concepts & Questions - !
35 pages
Windows Administrator L2 Interview Question - System Administrator
63% (48)
Windows Administrator L2 Interview Question - System Administrator
44 pages
AbInitio Components
No ratings yet
AbInitio Components
6 pages
Alteryx Designer Tool Sheet 11.0 PDF
0% (1)
Alteryx Designer Tool Sheet 11.0 PDF
24 pages
Ab Initio Interview Questions
100% (1)
Ab Initio Interview Questions
6 pages
Abinitio Components PDF
100% (2)
Abinitio Components PDF
36 pages
Calculates Totals or Other Aggregate Functions For Each Group. The Summed Totals For Each Group Are Output From The Stage Thro' Output Link
100% (1)
Calculates Totals or Other Aggregate Functions For Each Group. The Summed Totals For Each Group Are Output From The Stage Thro' Output Link
106 pages
Ab Initio Questionnaire Beginner Level C Grade
No ratings yet
Ab Initio Questionnaire Beginner Level C Grade
25 pages
List of A Pratical Plan
No ratings yet
List of A Pratical Plan
4 pages
Components Description
No ratings yet
Components Description
1 page
Dabacon Error Codes in PDMS
100% (1)
Dabacon Error Codes in PDMS
7 pages
Components Definitions
No ratings yet
Components Definitions
2 pages
20191120124231-Abinitio ETL Developer
No ratings yet
20191120124231-Abinitio ETL Developer
3 pages
Ab Initio Transform Components: We Have An Total of 13 Transformation Components
No ratings yet
Ab Initio Transform Components: We Have An Total of 13 Transformation Components
11 pages
Ab Initio Playbook 1
No ratings yet
Ab Initio Playbook 1
11 pages
Power BI Course Syllabus - by Murali P N, Besant Technologies
No ratings yet
Power BI Course Syllabus - by Murali P N, Besant Technologies
6 pages
Ab Initio Comp Help
No ratings yet
Ab Initio Comp Help
20 pages
Interview 3
No ratings yet
Interview 3
6 pages
Re Factoring Databases Evolutionary Database Design
No ratings yet
Re Factoring Databases Evolutionary Database Design
25 pages
Backup and Restore Zabbix Server
100% (1)
Backup and Restore Zabbix Server
3 pages
Database Design Student Tasks
No ratings yet
Database Design Student Tasks
12 pages
Ab Initio Components Summary
No ratings yet
Ab Initio Components Summary
3 pages
Interview Qs 1
100% (1)
Interview Qs 1
48 pages
Hadoop 1
No ratings yet
Hadoop 1
109 pages
Top 25 Basic SAS Interview Questions
No ratings yet
Top 25 Basic SAS Interview Questions
2 pages
Unit Iv: Transaction and Concurrency
No ratings yet
Unit Iv: Transaction and Concurrency
54 pages
Abinitio Transform Components
100% (1)
Abinitio Transform Components
14 pages
Hotel Management System by Shreejit Kanchan 1
No ratings yet
Hotel Management System by Shreejit Kanchan 1
26 pages
Build Analyse
No ratings yet
Build Analyse
10 pages
Abinitio Training
100% (4)
Abinitio Training
54 pages
SSAS 2008 R2 Performance Guide PDF
No ratings yet
SSAS 2008 R2 Performance Guide PDF
99 pages
Data Flow Partitioning Techniques
100% (1)
Data Flow Partitioning Techniques
18 pages
List of A Pratical Plan
No ratings yet
List of A Pratical Plan
4 pages
Abinitio Course Content
No ratings yet
Abinitio Course Content
6 pages
Data Scientist - Docx .2
No ratings yet
Data Scientist - Docx .2
10 pages
Esakov - Data Structures - An Advanced Approach Using C
100% (1)
Esakov - Data Structures - An Advanced Approach Using C
195 pages
Chapter 2 Query Optimization
No ratings yet
Chapter 2 Query Optimization
31 pages
Ab Initio
No ratings yet
Ab Initio
4 pages
Promax 2D Seismic Processing and Analysis: 626080 Rev. B May 1998
No ratings yet
Promax 2D Seismic Processing and Analysis: 626080 Rev. B May 1998
47 pages
Cs205 Mid by M. Qasim
100% (1)
Cs205 Mid by M. Qasim
15 pages
Informatica Power Center Best Practices
No ratings yet
Informatica Power Center Best Practices
8 pages
Microsoft Scope
No ratings yet
Microsoft Scope
23 pages
Tib Activespaces Developer
No ratings yet
Tib Activespaces Developer
119 pages
Abinitio Technical Guide
No ratings yet
Abinitio Technical Guide
4 pages
Advantages: (Multiple Database Following Homogenous Environment Following? Each
No ratings yet
Advantages: (Multiple Database Following Homogenous Environment Following? Each
10 pages
NetBackup Azure Stack Admin Guide
No ratings yet
NetBackup Azure Stack Admin Guide
25 pages
Informatica Transformations Guide
No ratings yet
Informatica Transformations Guide
12 pages
DataStage Parallel Stages Guide
100% (1)
DataStage Parallel Stages Guide
158 pages
DBMS Technical Questions TCS
No ratings yet
DBMS Technical Questions TCS
35 pages
DBMS Notes
No ratings yet
DBMS Notes
27 pages
DBMS Unit5
No ratings yet
DBMS Unit5
20 pages
Week 1:: Data Structure and Algorithm
No ratings yet
Week 1:: Data Structure and Algorithm
66 pages
IBM Data Science Certificate
No ratings yet
IBM Data Science Certificate
1 page
LTE Formula
No ratings yet
LTE Formula
22 pages
Abinitio Intvw Questions
100% (1)
Abinitio Intvw Questions
20 pages
Data Science With Python - Lesson 12 - Python Integration With Hadoop
No ratings yet
Data Science With Python - Lesson 12 - Python Integration With Hadoop
53 pages
Ab-Initio Interview Ques
67% (3)
Ab-Initio Interview Ques
39 pages
Components
No ratings yet
Components
11 pages
Comprehensive Oracle SQL Guide
No ratings yet
Comprehensive Oracle SQL Guide
9 pages
Component Reference
No ratings yet
Component Reference
19 pages
Abinitio Introduction
No ratings yet
Abinitio Introduction
9 pages
Data Warehousing Exam Guide
No ratings yet
Data Warehousing Exam Guide
8 pages
CLass 12 CS Practical File 2022-23
0% (1)
CLass 12 CS Practical File 2022-23
4 pages
Data Processing Services Template For Legal
No ratings yet
Data Processing Services Template For Legal
3 pages
Azure Data Engineering Guide
No ratings yet
Azure Data Engineering Guide
11 pages
Interview Questions and Answers
No ratings yet
Interview Questions and Answers
4 pages
Datastage Stage Desc
No ratings yet
Datastage Stage Desc
8 pages
A Guide
No ratings yet
A Guide
9 pages
Transformation 20
No ratings yet
Transformation 20
24 pages

Components & Runtime Behaviour

Uploaded by

Components & Runtime Behaviour

Uploaded by

AbInitio Components:

Dedup Sorted requires grouped input.

input port-->output port,duplicate port

Use the Interleave component to reverse the effects of Partition by Round-robin.

string(16) str = "abc";

decimal(",") phone = "9870651233";

my_rec r=[record s "abc" d 3000];

Dataware house class

You might also like