0% found this document useful (0 votes)

313 views18 pages

Chatgpt: A Technical Perspective: Presented by Teamx

The document provides an overview of Large Language Models (LLMs), detailing their importance, working processes, and popular architectures like GPT and BERT. It discusses challenges such as hallucination, data bias, and ethical concerns, along with potential solutions like reinforcement learning and hybrid models. Future directions for LLMs include enhancing multimodal reasoning, persistent memory, and developing energy-efficient architectures.

Uploaded by

Uddipto Jana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

313 views18 pages

Chatgpt: A Technical Perspective: Presented by Teamx

Uploaded by

Uddipto Jana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

ChatGPT: A Technical

Perspective

Presented by TeamX
Introduction to Large Language
Models (LLMs)
1. What are LLMs?
Large Language Models (LLMs) are AI systems trained on
vast amounts of text data to understand and generate
human-like language.
2. Why are they important?
They power applications like chatbots, search engines, and
code generation tools.
3. Popular LLMs :
GPT-4 (OpenAI), Gemini (Google), Claude (Anthropic), LLaMA
(Meta), Falcon (TII).
Working and Training Processes
of a LLM
1. Key Components of LLMs:
Data: Text from books, websites, etc.
Architecture: Transformer model processes and understands text.

Training: Learns patterns using deep learning.

2. How LLMs Work:

Uses tokens to process text.
- Context vector : keeps track of conversation flow.
- Self-attention : helps understand word relationships.

3. Training Processes :
- Pre-training: Learns general language.
- Fine-tuning: Optimized for specific tasks.
Popular LLM
Architectures
1. GPT (Generative Pretrained Transformer)
- Autoregressive (predicts next word based on previous context).
- Used in ChatGPT, Copilot, Jasper AI.

2. BERT (Bidirectional Encoder Representations from

Transformers)
- Trained bidirectionally (looks at both left and right context).
- Used for search engines, question answering, text classification

3. T5 (Text-to-Text Transfer Transformer)

- Converts all tasks into text-to-text format.
Transformer Architecture
Overview
1. Introduction to Transformers
- Introduced in the paper Attention Is All You Need (2017).
- Revolutionized NLP by replacing recurrent networks (RNNs, LSTMs).
- Uses self-attention to process entire input sequences in parallel.
2. Key Features
- Parallelization: Faster training compared to RNNs.
- Scalability: Works well for large datasets.
- Context Awareness: Better understanding of long-range
dependencies.
Transformer Components
1. Input Embedding & Positional Encoding
- Converts words into high-dimensional vectors.
- Positional encoding adds word order information.
2. Multi-Head Self-Attention Mechanism
- Calculates relationships between all words in a sequence.
- Multiple attention heads capture different aspects of meaning.
- Helps in context understanding and relevance.
3. Feedforward Neural Networks & Layer Normalization
- Applies transformations after attention layers.
- Ensures stability with layer normalization.
Attention Mechanism in
Transformers
1. Scaled Dot-Product Attention
- Computes attention scores using query (Q), key (K), and value (V) vectors.
- Equation:
- Determines how much focus each word should have on others in a
sequence.
2. Multi-Head Attention
- Uses multiple attention heads to capture different types of relationships.
- Each head learns unique attention patterns for better understanding of
context.
3. Self-Attention Process
- Each word in a sequence attends to every other word, creating context-
rich embeddings.
- Enables capturing of long-range dependencies that RNNs struggle with.
Issue – Hallucination and Reliability
What is Hallucination in LLMs?
- Ms LLgenerate false, misleading, or fabricated information that appears factual.
- Occurs due to pattern-matching without real-world verification.
Examples:
- Generating fake references in research papers.
- Producing incorrect legal case rulings.
- Fabricating product details in customer support.
Why It Happens?
- Lack of external fact-checking.
- Overgeneralization from training data.
- No real-world context awareness.
Impact:
- Trust Issues: Users may lose confidence in AI-generated content.
- Misinformation Spread: Critical in healthcare, law, and finance.
- AI Safety Risks: Potential for misleading or biased responses.
Mitigation Efforts:
- Fact-checking integrations with external databases.
- Prompt engineering to guide responses.
- Human-in-the-loop validation for sensitive applications.
Issue – Data Bias & Ethical
Concerns
What is Data Bias?
- LLMs inherit biases from training data, leading to unfair outputs.
- AI may reinforce historical and social inequalities.
Types of Bias in LLMs:
- Gender Bias – AI-generated job ads preferring men for tech roles.
- Racial Bias – Discriminatory content in law enforcement predictions.
- Political Bias – Favoring specific viewpoints in discussions.
Ethical Concerns:
- Fairness & Accountability – Who is responsible for AI decisions?
- Harmful Stereotypes – Reinforces discrimination.
- Regulatory Issues – Aligning AI with ethical guidelines.
Mitigation Efforts:
- Diverse training datasets to reduce bias.
- Bias detection tools to audit AI outputs.
- Ethical AI principles to guide model development.
Solution – Reinforcement
Learning and Hybrid Models
1. Reinforcement Learning from Human Feedback (RLHF)
- Uses human reviewers to refine model behavior.
- Reduces harmful or unethical outputs.
- Enhances alignment with human values and fairness.

2. Hybrid Models with Retrieval Augmentation

- Combining LLMs with search engines or databases for factual accuracy.
- Minimizes hallucination risks by verifying information.
- Reduces dependency on purely generated text.

3. Transfer Learning for Efficiency

- Using smaller, pre-trained models to cut computational costs.
- Increases efficiency and accessibility of AI technology.
Solution – Fine-Tuning, Prompt
Engineering, and Future Outlook

1. Reinforcement Learning from Human Feedback (RLHF)

- Uses human reviewers to refine model behavior.
- Reduces harmful or unethical outputs.
- Enhances alignment with human values and fairness.
2. Hybrid Models with Retrieval Augmentation
- Combining LLMs with search engines or databases for factual accuracy.
- Minimizes hallucination risks by verifying information.
- Reduces dependency on purely generated text.
3. Transfer Learning for Efficiency
- Using smaller, pre-trained models to cut computational costs.
- Increases efficiency and accessibility of AI technology.
Solution – Fine-Tuning, Prompt
Engineering, and Future Outlook

1. Reinforcement Learning from Human Feedback (RLHF)

- Uses human reviewers to refine model behavior.
- Reduces harmful or unethical outputs.
- Enhances alignment with human values and fairness.
2. Hybrid Models with Retrieval Augmentation
- Combining LLMs with search engines or databases for factual accuracy.
- Minimizes hallucination risks by verifying information.
- Reduces dependency on purely generated text.
3. Transfer Learning for Efficiency
- Using smaller, pre-trained models to cut computational costs.
- Increases efficiency and accessibility of AI technology.
Where Do Current LLMs Fail ?
1. Energy & Computational Costs
- Training state-of-the-art LLMs is expensive and unsustainable.
- Example: GPT-3 required thousands of GPUs for weeks.

2. Limited Multimodal Understanding

- Struggles with deep cross-modal reasoning.
- Example: Text-to-image models lack fine-grained scene comprehension.

3. Long-Term Memory Deficiency

- Cannot retain context across sessions.
- Current solutions (e.g., embeddings) are inefficient.

4. Lack of Real-Time Adaptability

- Struggles with dynamic updates in fast-changing contexts.
- Example: Difficulty in real-time document editing with multiple AI agents.
Future Work Directions – Key Innovations

1. Collaborative Multi-Agent AI
- Enable seamless interaction between multiple LLMs.
- Example: AI-powered collaborative coding and content creation.

2. Enhanced Multimodal Reasoning

- Enable seamless fusion of text, images, video, and audio.
- Example: AI understanding spoken instructions while analyzing visual content.

3. Persistent & Context-Aware Memory

- Implement memory mechanisms for long-term interactions.
- Example: AI remembering user preferences over months.

4. Real-Time Model Adaptation

- Enable AI to refine outputs dynamically based on user feedback.
- Example: AI adjusting its writing style in response to real-time corrections.
Efficient & Scalable Architectures

1. Sparse & Modular Architectures

- Use mixture-of-experts to optimize processing power.
- Example: Activating only relevant parts of a model for specific tasks.

2. Quantization & Pruning

- Reduce model size while maintaining performance.
- Example: Deploying AI efficiently on edge devices like smartphones.

3. Decentralized & Adaptive Training

- Distribute training across multiple smaller systems.
- Example: Federated learning to enhance scalability without massive compute resources.

4. Hybrid AI Systems
- Combine symbolic reasoning with neural models for better reliability.
- Example: AI-assisted medical diagnosis integrating expert rule-based logic.
Conclusion
1. Key Challenges
- High computational costs and environmental concerns.
- Lack of persistent memory and real-time adaptability.
- Limitations in multimodal understanding and collaboration.

2. Future Directions
- Development of energy-efficient architectures (e.g., sparse models, quantization).
- Enhancing memory retention for long-term contextual awareness.
- Seamless multimodal integration to improve cross-domain reasoning.
- More interactive, scalable, and adaptable AI systems.

3. Impact of Next-Gen LLMs

- AI will become more cost-effective and environmentally sustainable.
- Persistent memory will improve user experience in long-term interactions.
- Multimodal reasoning will enable AI to understand and process complex real-world scenarios.
- Adaptive AI systems will enhance real-time interactivity across applications.
TeamX Members
• Sayantan Choudhury –
2251231
• Rahul Mondal – 2251230
• Uddipto Jana – 2251232
• Gourav Dey - 2251219

Cbahi Saudi Arabia Hospital Standards
77% (44)
Cbahi Saudi Arabia Hospital Standards
383 pages
100 Bed Hospital Project Report
86% (63)
100 Bed Hospital Project Report
38 pages
Manual of Hospital Planning and Designing 2022
82% (11)
Manual of Hospital Planning and Designing 2022
549 pages
Managing Health Services Organizations and Systems, Sixth Edition Excerpt
31% (70)
Managing Health Services Organizations and Systems, Sixth Edition Excerpt
20 pages
Draft SOPs For Hospital - Management
95% (21)
Draft SOPs For Hospital - Management
362 pages
Marketing Plan For A Hospital
82% (97)
Marketing Plan For A Hospital
34 pages
NABH Hospital Accreditation Standard 6th Edition January 2025
80% (5)
NABH Hospital Accreditation Standard 6th Edition January 2025
242 pages
1704-1702 CBAHI Common Questions
100% (8)
1704-1702 CBAHI Common Questions
32 pages
TOC - Hospital Administration by D.C. Joshi
79% (19)
TOC - Hospital Administration by D.C. Joshi
15 pages
CBAHI New Standard
100% (3)
CBAHI New Standard
267 pages
Budgeting in Hospitals
92% (13)
Budgeting in Hospitals
25 pages
Hospital Design Portfolio
90% (21)
Hospital Design Portfolio
49 pages
Guiding Principles For Construction of A 100 Bedded New Hospital
89% (70)
Guiding Principles For Construction of A 100 Bedded New Hospital
54 pages
Quick Start Guide to LLMs 2nd Ed
No ratings yet
Quick Start Guide to LLMs 2nd Ed
279 pages
Hospital Operations Management (Mba Hospital Management)
80% (5)
Hospital Operations Management (Mba Hospital Management)
117 pages
Hospitals & Health Care Organizations - Management Strategies, Operational Techniques, Tools, Templates, and Case Studies (PDFDrive)
100% (5)
Hospitals & Health Care Organizations - Management Strategies, Operational Techniques, Tools, Templates, and Case Studies (PDFDrive)
409 pages
Hospital Planning and Project Management
86% (7)
Hospital Planning and Project Management
181 pages
Cbahi Standards by Observation Checklist 2018
No ratings yet
Cbahi Standards by Observation Checklist 2018
39 pages
JCI Compliance Checklist
90% (10)
JCI Compliance Checklist
58 pages
Hospital Medical Records Management Manual
77% (60)
Hospital Medical Records Management Manual
202 pages
Multi-Speciality Hospital Design: 150 Bedded
71% (7)
Multi-Speciality Hospital Design: 150 Bedded
89 pages
Five Year Strategic Plan For A Hospital Example
100% (3)
Five Year Strategic Plan For A Hospital Example
40 pages
BM Sakharkar - Principles of Hospital Administration and Planning, 2nd Edition PDF
90% (29)
BM Sakharkar - Principles of Hospital Administration and Planning, 2nd Edition PDF
394 pages
Endogenic Processes 1
100% (2)
Endogenic Processes 1
59 pages
Healthcare Quality
89% (9)
Healthcare Quality
381 pages
Hospital Policy Manual
100% (2)
Hospital Policy Manual
26 pages
LLM Presentation
No ratings yet
LLM Presentation
11 pages
Data Seminar
No ratings yet
Data Seminar
10 pages
ML A Deep Dive in The World of AI and LLM Tun'Up Munich - 241021 - 130023
No ratings yet
ML A Deep Dive in The World of AI and LLM Tun'Up Munich - 241021 - 130023
34 pages
Llms
No ratings yet
Llms
3 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
Advances in Large Language Models
No ratings yet
Advances in Large Language Models
2 pages
LLM
No ratings yet
LLM
3 pages
Day 2
No ratings yet
Day 2
3 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
Pe 1
No ratings yet
Pe 1
5 pages
Large Language Models and Their Use Cases
No ratings yet
Large Language Models and Their Use Cases
3 pages
LLM Seminar PDF
No ratings yet
LLM Seminar PDF
10 pages
1st Note
No ratings yet
1st Note
3 pages
All The Basics That You Need To Know About LLMs
No ratings yet
All The Basics That You Need To Know About LLMs
26 pages
Attention Is All You Need.
No ratings yet
Attention Is All You Need.
5 pages
LLMs
No ratings yet
LLMs
72 pages
Large Language Models
No ratings yet
Large Language Models
3 pages
LLM's For Code Generation
No ratings yet
LLM's For Code Generation
31 pages
Large Language Models (LLMS) - Architecture, Training, Applications, and Challenges
No ratings yet
Large Language Models (LLMS) - Architecture, Training, Applications, and Challenges
5 pages
AI Studies - Notes
No ratings yet
AI Studies - Notes
21 pages
LLM Report
No ratings yet
LLM Report
10 pages
Presentation On Ai
No ratings yet
Presentation On Ai
10 pages
LLM Advancements Applications Challenges 20000 Words
No ratings yet
LLM Advancements Applications Challenges 20000 Words
3 pages
Python BAKMR010399001
No ratings yet
Python BAKMR010399001
3 pages
Notes 4 Large Language Model
No ratings yet
Notes 4 Large Language Model
4 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
Innovations in LLMs Presentation Expanded MSOffice
No ratings yet
Innovations in LLMs Presentation Expanded MSOffice
24 pages
Understanding Large Language Models (LLMS)
No ratings yet
Understanding Large Language Models (LLMS)
2 pages
Large Language Models LLMs Transforming Our World
No ratings yet
Large Language Models LLMs Transforming Our World
10 pages
LLM Intro
No ratings yet
LLM Intro
8 pages
Chen Et Al. - An Agile Framework For Efficient LLM Accelerator Development and Model Inference
No ratings yet
Chen Et Al. - An Agile Framework For Efficient LLM Accelerator Development and Model Inference
9 pages
4-HC24.PrimisAI - Hans Bouwmeester.v4
No ratings yet
4-HC24.PrimisAI - Hans Bouwmeester.v4
29 pages
Large Language Models
No ratings yet
Large Language Models
10 pages
Prompt Engineering NLP Master Guide
No ratings yet
Prompt Engineering NLP Master Guide
14 pages
LLM Model
No ratings yet
LLM Model
3 pages
Generative Ai Terminology
67% (3)
Generative Ai Terminology
26 pages
All You Should Kno About LLM'S
No ratings yet
All You Should Kno About LLM'S
10 pages
SW Post 1
No ratings yet
SW Post 1
5 pages
Pieces DZ RC 393 Getting Started Llms 2024
No ratings yet
Pieces DZ RC 393 Getting Started Llms 2024
8 pages
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
No ratings yet
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
11 pages
Fine Tuning Techniques For Large Language Models LLMs
100% (4)
Fine Tuning Techniques For Large Language Models LLMs
15 pages
State of AI - by Eduardo Mace - ScalePV 2023
No ratings yet
State of AI - by Eduardo Mace - ScalePV 2023
36 pages
LLM Presentation
No ratings yet
LLM Presentation
10 pages
Summary - Foundations On LLMs
No ratings yet
Summary - Foundations On LLMs
6 pages
AI and Prompt
No ratings yet
AI and Prompt
18 pages
Generative AI and LLMs
No ratings yet
Generative AI and LLMs
6 pages
AI 900 M2 Notes
No ratings yet
AI 900 M2 Notes
7 pages
Day 5
No ratings yet
Day 5
48 pages
LLM Mastery Pathways
No ratings yet
LLM Mastery Pathways
8 pages
Training Large Language Models
No ratings yet
Training Large Language Models
7 pages
PE Assignment-1 Answers
No ratings yet
PE Assignment-1 Answers
10 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
Creating LLM
No ratings yet
Creating LLM
3 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
Mod 4
No ratings yet
Mod 4
69 pages
Group 1 - Hospital Design
100% (3)
Group 1 - Hospital Design
79 pages
High Performance in Hospital Management
100% (2)
High Performance in Hospital Management
232 pages
Saudi Arabia Value Based Healthcare Action Plan
No ratings yet
Saudi Arabia Value Based Healthcare Action Plan
31 pages
Mercy Hospital Business Plan Haiti
No ratings yet
Mercy Hospital Business Plan Haiti
14 pages
Hospital Spaces PDF
No ratings yet
Hospital Spaces PDF
65 pages
Improving Hospital Performance
75% (8)
Improving Hospital Performance
48 pages
Lab - CC14 - 183146-21-0060 - Sayantan Sarcar
No ratings yet
Lab - CC14 - 183146-21-0060 - Sayantan Sarcar
101 pages
Python Prac Answers Mcqs
No ratings yet
Python Prac Answers Mcqs
3 pages
Types of Cyberattacks Explained
No ratings yet
Types of Cyberattacks Explained
13 pages
Udpchat
No ratings yet
Udpchat
3 pages
Humanities 3rd Sem (Complete)
No ratings yet
Humanities 3rd Sem (Complete)
44 pages
CH 11
No ratings yet
CH 11
21 pages
The Life and Death of Planet Earth How The New Science of Astrobiology Charts The Ultimate Fate of Our World 1st Edition Peter Ward Download
No ratings yet
The Life and Death of Planet Earth How The New Science of Astrobiology Charts The Ultimate Fate of Our World 1st Edition Peter Ward Download
51 pages
SK1-BRK-01-Brake System Bleeding-Rev 1.0
No ratings yet
SK1-BRK-01-Brake System Bleeding-Rev 1.0
9 pages
Course Syllabus - Upper Intermediate - Spring - 2022
No ratings yet
Course Syllabus - Upper Intermediate - Spring - 2022
3 pages
Vipin Kumar Resume
No ratings yet
Vipin Kumar Resume
1 page
Ethiopian Construction Claims Study
100% (1)
Ethiopian Construction Claims Study
128 pages
Abs Paris
No ratings yet
Abs Paris
2 pages
Faircode Technologies Private Limited - Home
No ratings yet
Faircode Technologies Private Limited - Home
1 page
Dual Clutch Transmission
0% (1)
Dual Clutch Transmission
18 pages
2024-Spring - 2242-Biol-1345-001 3
No ratings yet
2024-Spring - 2242-Biol-1345-001 3
5 pages
Mathematics 9 - Q3 - Mod11 - Conditions Proving For Triangles Similar - v3
100% (2)
Mathematics 9 - Q3 - Mod11 - Conditions Proving For Triangles Similar - v3
28 pages
Sample ICT Action Plan
100% (2)
Sample ICT Action Plan
2 pages
Student Animal Research Booklets
100% (1)
Student Animal Research Booklets
45 pages
Images Line Drawings and Backplanes
No ratings yet
Images Line Drawings and Backplanes
27 pages
Day 4 English Worksheets-21.9.2024
No ratings yet
Day 4 English Worksheets-21.9.2024
3 pages
Automatic Door Solutions Guide
No ratings yet
Automatic Door Solutions Guide
5 pages
Science Quiz Bee
No ratings yet
Science Quiz Bee
5 pages
en - GASP 2020 2022 Global Aviation Safety Plan
No ratings yet
en - GASP 2020 2022 Global Aviation Safety Plan
144 pages
Ocular Ischemic Syndrome Case Report
No ratings yet
Ocular Ischemic Syndrome Case Report
18 pages
Three-Dimensional Printing (3D Printing) : by Dr. Vineet Srivastava
No ratings yet
Three-Dimensional Printing (3D Printing) : by Dr. Vineet Srivastava
9 pages
Physics1 PDF
No ratings yet
Physics1 PDF
7 pages
Namma Kalvi 12th Zoology Question Bank em 217045
No ratings yet
Namma Kalvi 12th Zoology Question Bank em 217045
45 pages
A Review On Artabotrys Odoratissimus (Annonaceae) : Saritha Kodithala and R Murali
No ratings yet
A Review On Artabotrys Odoratissimus (Annonaceae) : Saritha Kodithala and R Murali
3 pages
Advanced Flight Ops Training
No ratings yet
Advanced Flight Ops Training
3 pages
Vernalisation in Details
No ratings yet
Vernalisation in Details
3 pages
Goodwill Valuation in Accountancy
No ratings yet
Goodwill Valuation in Accountancy
4 pages
Wearable Devices For The Detection of Covid-19
No ratings yet
Wearable Devices For The Detection of Covid-19
21 pages
DLL Speech Style
100% (1)
DLL Speech Style
2 pages