Module 1: Introduction to Ab Initio
What is Ab Initio?
Ab Initio architecture overview
Components of Ab Initio
Ab Initio vs other ETL tools
Ab Initio GDE (Graphical Development Environment)
Introduction to Co>Operating System
Module 2: Ab Initio Architecture & Components
GDE (Graphical Development Environment)
Co>Operating System
EME (Enterprise Meta Environment)
Data Profiler
Conduct>It (now called Control Center)
Component Library Overview
Module 3: Working with GDE
Installing and navigating GDE
Creating, saving, and running graphs
Port configuration and metadata
Record formats
Parameterization
Basic components: Input File, Output File, Filter, Sort, Join
Module 4: File Handling in Ab Initio
File formats: .dat, .dml, .out
Working with flat files
Multi-file systems (MFS)
Partitioning and De-partitioning techniques
Reformatting and Rollup components
Module 5: Transformations
Lookup, Join, Merge
Rollup, Scan, Normalize, Denormalize
Aggregation techniques
Working with expressions and variables
Using Functions and Conditional Statements
Module 6: Advanced Components
Input Table, Output Table (Database components)
Run Program
Redefine Format
Broadcast, Round Robin
Gather, Merge, Interleave
Replicate, Partition by Key
Lookup File and Lookup Table
Module 7: Error Handling & Debugging
Error trapping and handling strategies
Using reject ports
Debugging graphs
Log and trace files
Component-level error handling
Module 8: Database Integration
Connecting to databases (Oracle, Teradata, etc.)
Using SQL components
Database input/output
Working with truncate, insert, update
Module 9: Meta Programming & Parameterization
What is meta programming?
Creating reusable graphs
Using parameters and variables
Graph level vs component level parameters
Module 10: Performance Tuning
Performance bottlenecks
Optimizing graphs
Best practices for high-volume data
Partitioning strategies for performance
Module 11: EME and Version Control
Introduction to EME
Import/export graphs to/from EME
Check-in/check-out
Versioning and metadata management
Locking and unlocking
Module 12: Conduct>It / Control Center
Job scheduling concepts
Creating and managing plans
Dependency management
Monitoring and alerts
Module 13: Real-time Scenarios & Project Work
Design sample ETL processes
Implement data pipelines with multiple stages
Error handling and logging setup
Mini project or case study with end-to-end flow
Bonus Topics (Optional/Advanced)
Continuous flows (for real-time)
Integration with Hadoop or Big Data platforms
Ab Initio with Cloud platforms
Scripting with UNIX shell for Ab Initio jobs