0% found this document useful (0 votes)

6 views4 pages

Deep Learning

Deep learning

Uploaded by

sanjeeva jayasuriya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views4 pages

Deep Learning

Deep learning

Uploaded by

sanjeeva jayasuriya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Deep learning

Deep learning employs artificial neural networks with many layers—hence “deep”—rather than
the explicitly designed algorithms of traditional machine learning. Though neural networks were
introduced early in the history of machine learning, it wasn’t until the late 2000s and early 2010s,
enabled in part by advancements in GPUs, that they became dominant in most subfields of AI.

Loosely inspired by the human brain, neural networks comprise interconnected layers of
“neurons” (or nodes), each of which performs its own mathematical operation (called an
“activation function”). The output of each node’s activation function serves as input to each of
the nodes of the following layer and so on until the final layer, where the network’s final output
is computed. Crucially, the activation functions performed at each node are nonlinear, enabling
neural networks to model complex patterns and dependencies.

Each connection between two neurons is assigned a unique weight: a multiplier that increases or
decreases one neuron’s contribution to a neuron in the following layer. These weights, along
with bias terms between layers, are the parameters to be optimized through machine learning.

The backpropagation algorithm enables the computation of how each individual node contributes
to the overall output of the loss function, allowing even millions or billions of model weights to
be individually optimized through gradient descent algorithms. Because of the volume and
granularity of updates required to achieve optimal results, deep learning requires very large
amounts of data and computational resources compared to traditional ML.

That distributed structure affords deep learning models their incredible power and versatility.
Imagine training data as data points scattered on a 2-dimensional graph. Essentially, traditional
machine learning aims to find a single curve that runs through every one of those data points;
deep learning pieces together an arbitrary number of smaller, individually adjustable lines to
form the desired shape. Neural networks are universal approximators: it has been theoretically
proven that for any function, there exists a neural network arrangement that can reproduce it.

Having said that, just because something is theoretically possible doesn’t mean it’s practically
achievable through existing training methods. For many years, adequate performance on certain
tasks remained out of reach even for deep learning models—but over time, modifications to the
standard neural network architecture have unlocked new capabilities for ML models.

Convolutional neural networks (CNNs)

Convolutional neural networks (CNNs) add convolutional layers to neural networks. In

mathematics, a convolution is an operation where one function modifies (or convolves) the shape
of another. In CNNs, convolutional layers are used to extract important features from data
by applying weighted “filters”. CNNs are primarily associated with computer vision models and
image data, but have a number of other important use cases.

Recurrent neural networks (RNNs)

Recurrent neural networks (RNNs) are designed to work on sequential data. Whereas
conventional feedforward neural networks map a single input to a single output, RNNs map
a sequence of inputs to an output by operating in a recurrent loop in which the output for a given
step in the input sequence serves as input to the computation for the following step. In effect this
creates an internal “memory,” called the hidden state, that allows RNNs to understand context
and order.

Transformers

Transformer models, first introduced in 2017, are largely responsible for the advent of LLMs and
other pillars of generative AI, achieving state-of-the-art results across most subdomains of
machine learning. Like RNNs, transformers are ostensibly designed for sequential data, but
clever workarounds have enabled most data modalities to be processed by transformers. The
unique strength of transformer models comes from their innovative attention mechanism, which
enables the models to selectively focus on the parts of the input data most relevant at a specific
moment in a sequence.

Mamba models

Mamba models are a relatively new neural network architecture, first introduced in 2023, based
on a unique variation of state space models (SSMs). Like transformers, Mamba models provide
an innovative means of selectively prioritizing the most relevant information at a given moment.
Mamba has recently emerged as a rival to the transformer architecture, particularly for LLMs.

Machine learning use cases

Most applications of machine learning fall into one or more of the following categories, which
are defined primarily by their use cases and the data modalities they operate upon.

Computer vision

Computer vision is the subdomain of AI concerned with image data, video data other data
modalities that require a model or machine to “see,” from healthcare diagnostics to facial
recognition to self-driving cars. Notable subfields of computer vision include image
classification, object detection, image segmentation and optical character recognition (OCR).
Natural language processing (NLP)

The field of natural language processing (NLP) spans a diverse array of tasks concerning text,
speech and other language data. Notable subdomains of NLP include chatbots, speech
recognition, language translation, sentiment analysis, text generation, summarization and AI
agents. In modern NLP, large language models continue to advance the state of the art at an
unprecedented pace.

Time series analysis

Time series models are applied anomaly detection, market analysis and related pattern
recognition or prediction tasks. They use machine learning on historical data for a variety of
forecasting use cases.

Image generation

Diffusion models, variational autoencoders (VAEs) and generative adversarial networks

(GANs) can be used to generate original images that apply pixel patterns learned from training
data.

Machine learning operations (MLOps)

Machine learning operations (MLOps) is a set of practices for implementing an assembly line
approach to building, deploying and maintaining machine learning models.

Careful curation and preprocessing of training data, as well as appropriate model selection, are
crucial steps in the MLOps pipeline. Thoughtful post-training validation, from the design of
benchmark datasets to the prioritization of particular performance metrics, is necessary to ensure
that a model generalizes well (and isn’t just overfitting the training data).

Following deployment, models must be monitored for model drift, inference efficiency issues
and other adverse developments. A well-defined practice of model governance is essential to
continued efficacy, especially in regulated or fast-changing industries.

Machine learning libraries

A number of open source tools, libraries and frameworks exist for building, training and testing
machine learning projects. While such libraries offer an array of pre-configured modules and
abstractions to streamline the process of building ML-based models and workflows, practitioners
will need to familiarize themselves with commonly used programming languages—
particularly Python—to make full use of them.
Prominent open source libraries, particularly for building deep learning models,
include PyTorch, TensorFlow, Keras and the Hugging Face Transformers library.

Notable open source machine learning libraries and toolkits focused on traditional ML include
Pandas, Scikit-learn, XGBoost, Matplotlib, SciPy and NumPy among many others.

IBM itself maintains and updates a significant library of tutorials for beginners and advanced ML
practitioners alike.

NVIDIA GEN AI Cheat Sheet
No ratings yet
NVIDIA GEN AI Cheat Sheet
97 pages
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
23 pages
Deep Learning Lab Manual
100% (1)
Deep Learning Lab Manual
19 pages
Fundamental Components of AI
No ratings yet
Fundamental Components of AI
5 pages
Fundamentals of ML and AI - AWS
No ratings yet
Fundamentals of ML and AI - AWS
16 pages
Deep Learning UNIT 5
No ratings yet
Deep Learning UNIT 5
182 pages
Deep Learning
No ratings yet
Deep Learning
50 pages
TensorFlow Regression
No ratings yet
TensorFlow Regression
445 pages
Unveiling Deep Learning A Beginners Guide
No ratings yet
Unveiling Deep Learning A Beginners Guide
10 pages
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
No ratings yet
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
24 pages
Fundamentals of AI Hide01.Ir
No ratings yet
Fundamentals of AI Hide01.Ir
114 pages
Reinforcement Learning: B.Tech., Last Year, Semester-Viii
No ratings yet
Reinforcement Learning: B.Tech., Last Year, Semester-Viii
49 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
Module 3
No ratings yet
Module 3
97 pages
DL Unit 1
No ratings yet
DL Unit 1
199 pages
Unit-I Deep Learning Techniques
No ratings yet
Unit-I Deep Learning Techniques
20 pages
DL Unit I & II
No ratings yet
DL Unit I & II
51 pages
Lecture 2
No ratings yet
Lecture 2
71 pages
JNTUK R20 B.Tech CSE 4-1 Deep Learning Techniques Unit 1 Notes
No ratings yet
JNTUK R20 B.Tech CSE 4-1 Deep Learning Techniques Unit 1 Notes
15 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
Deep Learning
100% (3)
Deep Learning
32 pages
DLT Unit 1
No ratings yet
DLT Unit 1
4 pages
Ai Edx
No ratings yet
Ai Edx
9 pages
Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
UNIT I Part 1 Notes
No ratings yet
UNIT I Part 1 Notes
28 pages
What Is Artificial Intelligence
No ratings yet
What Is Artificial Intelligence
3 pages
Unit 4 Notes New
No ratings yet
Unit 4 Notes New
49 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
4 pages
Unit I
No ratings yet
Unit I
48 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Main Dataset
No ratings yet
Main Dataset
300 pages
ET Assign Deep Learning
No ratings yet
ET Assign Deep Learning
3 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
CP4252 ML Unit - V
No ratings yet
CP4252 ML Unit - V
17 pages
Unit 3
No ratings yet
Unit 3
29 pages
Lecture 2 Deep Learning
No ratings yet
Lecture 2 Deep Learning
24 pages
Data Science Vs Machine Learning and Artificial Intelligence
No ratings yet
Data Science Vs Machine Learning and Artificial Intelligence
12 pages
PP&DS 5
No ratings yet
PP&DS 5
31 pages
ML Unit 4
No ratings yet
ML Unit 4
16 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Deep Learning
No ratings yet
Deep Learning
22 pages
Deep Learning Trial Lecture
No ratings yet
Deep Learning Trial Lecture
12 pages
Machine Learning Semester Paper
No ratings yet
Machine Learning Semester Paper
31 pages
Deep Learning File
No ratings yet
Deep Learning File
58 pages
Deep Learning File
No ratings yet
Deep Learning File
60 pages
Deep Learning for Beginners
No ratings yet
Deep Learning for Beginners
28 pages
Advancements and Applications of Deep Learning
No ratings yet
Advancements and Applications of Deep Learning
4 pages
Expanded Deep Learning Document-1
No ratings yet
Expanded Deep Learning Document-1
11 pages
Will AI Replace Some Tedious Job
No ratings yet
Will AI Replace Some Tedious Job
3 pages
Mlnov 2024
No ratings yet
Mlnov 2024
2 pages
DeepLearning Introduction
No ratings yet
DeepLearning Introduction
19 pages
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-2
No ratings yet
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-2
12 pages
What Is The Difference Between Machine Learning and Deep Learning
No ratings yet
What Is The Difference Between Machine Learning and Deep Learning
3 pages
AI & Machine Learning Essentials
No ratings yet
AI & Machine Learning Essentials
44 pages
Nueral Network Mcqs
No ratings yet
Nueral Network Mcqs
6 pages
Report On Neural Networks
No ratings yet
Report On Neural Networks
15 pages
Unit - 1 Deep Learning Techniques
No ratings yet
Unit - 1 Deep Learning Techniques
18 pages
Machine Learning Overview & Applications
No ratings yet
Machine Learning Overview & Applications
8 pages
A Guide To Deep Learning and Neural Networks
No ratings yet
A Guide To Deep Learning and Neural Networks
15 pages
Reading+10+ +Introduction+to+Deep+Learning
No ratings yet
Reading+10+ +Introduction+to+Deep+Learning
21 pages
Ann Book
No ratings yet
Ann Book
16 pages
Unit I
No ratings yet
Unit I
10 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
4 pages
Artificial Intelligence (AI) vs. Machine Learning (ML)
No ratings yet
Artificial Intelligence (AI) vs. Machine Learning (ML)
5 pages
Cognitive Psychology History
No ratings yet
Cognitive Psychology History
8 pages
AI and Computer Vision
No ratings yet
AI and Computer Vision
6 pages
Backpropagation Algorithm
No ratings yet
Backpropagation Algorithm
3 pages
Btaic601
No ratings yet
Btaic601
2 pages
B.E Syllabus For DL
No ratings yet
B.E Syllabus For DL
4 pages
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
No ratings yet
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
47 pages
Machine Learning Insights Unveiled
No ratings yet
Machine Learning Insights Unveiled
18 pages
Question Text: Clear My Choice
0% (2)
Question Text: Clear My Choice
10 pages
Boltzmann Machine
No ratings yet
Boltzmann Machine
47 pages
Ai - Ds - Ad3501-Dl GMT 3 QP and Key
No ratings yet
Ai - Ds - Ad3501-Dl GMT 3 QP and Key
10 pages
Building Neural Networks - A Hands-On Journey From Scratch With Python - by Long Nguyen - Medium
No ratings yet
Building Neural Networks - A Hands-On Journey From Scratch With Python - by Long Nguyen - Medium
21 pages
Tech-Enhanced Language Teaching
No ratings yet
Tech-Enhanced Language Teaching
5 pages
Module 5
No ratings yet
Module 5
27 pages
Stanford Deep Learning Exam
No ratings yet
Stanford Deep Learning Exam
14 pages
AI Course Assignment Questions
No ratings yet
AI Course Assignment Questions
3 pages
What Is The History of AI
No ratings yet
What Is The History of AI
6 pages
TEC in Language Learning
No ratings yet
TEC in Language Learning
21 pages
Module 3 - Convolutional Neural Networks: History
No ratings yet
Module 3 - Convolutional Neural Networks: History
3 pages
Deep Learning for Data Scientists
No ratings yet
Deep Learning for Data Scientists
21 pages
Deep Learning Interview Prep Guide
No ratings yet
Deep Learning Interview Prep Guide
15 pages
chp3 Hebb Network
No ratings yet
chp3 Hebb Network
4 pages
RNN Basics
No ratings yet
RNN Basics
17 pages
7856
No ratings yet
7856
7 pages
4 DL Deep Neural Nets
No ratings yet
4 DL Deep Neural Nets
56 pages
Jurnal Sistem Pendeteksi Pejalan Kaki
No ratings yet
Jurnal Sistem Pendeteksi Pejalan Kaki
12 pages
AAFU Volume 49 Issue 6 Page 371 395
No ratings yet
AAFU Volume 49 Issue 6 Page 371 395
25 pages
ResNet & VGGNet Deep Learning Guide
No ratings yet
ResNet & VGGNet Deep Learning Guide
44 pages
Dropout Improves Recurrent Neural Networks For Handwriting Recognition
No ratings yet
Dropout Improves Recurrent Neural Networks For Handwriting Recognition
6 pages
TPLS, 14
No ratings yet
TPLS, 14
5 pages
Multilayer Perceptron Algorithm
No ratings yet
Multilayer Perceptron Algorithm
3 pages
BE Comp - Deep Learning
No ratings yet
BE Comp - Deep Learning
1 page
Deep Learning
No ratings yet
Deep Learning
1 page
Keras Guide for Deep Learning Beginners
No ratings yet
Keras Guide for Deep Learning Beginners
1 page

Deep Learning

Uploaded by

Deep Learning

Uploaded by

Deep learning

Convolutional neural networks (CNNs)

Convolutional neural networks (CNNs) add convolutional layers to neural networks. In

Recurrent neural networks (RNNs)

Machine learning use cases

Time series analysis

Diffusion models, variational autoencoders (VAEs) and generative adversarial networks

Machine learning operations (MLOps)

Machine learning libraries

You might also like