0% found this document useful (0 votes)

78 views8 pages

Project

This document discusses developing an end-to-end neural network approach for captioning complex images. The core problem areas are computer vision and generating detailed descriptions of entire scenes from images. Existing methods struggle with detecting multiple overlapping objects and labeling complex, dense images. The proposed approach uses a convolutional network to localize regions of interest, with a recurrent neural network language model to generate natural language captions describing each region and the full scene. The goal is an architecture that jointly performs localization and generation of descriptive label sequences for complex images.

Uploaded by

Shreyas Kash Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views8 pages

Project

Uploaded by

Shreyas Kash Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 8

AN END-TO-END TRAINING BASED APPROACH FOR CAPTIONING

COMPLEX IMAGES USING NEURAL NETWORKS

Shreyas V Kashyap - 1BG13CS097

Swaroop Gupta B.A -1BG13CS113
Spandana Rao B.A - 1BG13CS103

Under the Guidance of

Smt.Jebah Jaykumar
AREA OF RESEARCH

We have hereby chose the area of research to be a combination of the following

research extensive areas.

Machine Learning

Image Processing
CORE AREA OF PROBLEM

Our core area of problem is

Computer Vision

Program a computer to "understand" a scene or features in an image.

Concerned with the automatic extraction, analysis and understanding of useful

information from a single image or a sequence of images
RELEVANCE OF THE PROBLEM

Object detection

Currently the existing state of the art methods can detect a single object or
multiple non overlapping objects

This makes the detection useless for any analysis of an entire scene

Object labelling

The labelling of single prominent object in an image

Unable to describe a scene in a complex dense detailed image

APPLICATION OF THE PROBLEM

Currently the problem persists in the following applications of Computer Vision

Image search

The existing image search works only on the file name of an image, and not on the
details of the scene
This should be overcome and the search should happen only on the basis of what is
there in the image, instead of the filename

Image scene detection

The existing Image detection, detects a single prominent part of the image and
cannot detect if there are variations of viewpoint.
This should be overcome and multiple objects need to be detected and the whole
scene has to be described.
PROBLEM STATEMENT

Our ability to effortlessly describe all aspects of an image relies on a strong semantic
understanding of a visual scene and all of its elements. However, despite numerous
potential applications, this ability remains a challenge for our state of the art visual
recognition systems

Our goal is to design an architecture that jointly localizes regions of interest and the
describes each with natural language

Architecture is composed of a Convolutional Network, an efficient dense localization

layer, and Recurrent Neural Network language model that generates the label
sequences for the complex images.
INPUT / OUTPUT EXAMPLE

Sample Input :

Output of existing System :

Output Expected :
THANK YOU

AI Image Captioning for CSE Students
No ratings yet
AI Image Captioning for CSE Students
17 pages
Image Summarizer: Seeing Through Machine Using Deep Learning Algorithm
No ratings yet
Image Summarizer: Seeing Through Machine Using Deep Learning Algorithm
7 pages
Implementation of Simple and Efficient P
No ratings yet
Implementation of Simple and Efficient P
8 pages
Minor
No ratings yet
Minor
14 pages
Generating Caption From Images Using Flickr Image Dataset
No ratings yet
Generating Caption From Images Using Flickr Image Dataset
7 pages
Complex Image Captioning Guide
No ratings yet
Complex Image Captioning Guide
1 page
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
No ratings yet
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
15 pages
Review 2
No ratings yet
Review 2
34 pages
Image Captioning Using CNN & RNN
No ratings yet
Image Captioning Using CNN & RNN
4 pages
Sample Project doc-REC
No ratings yet
Sample Project doc-REC
66 pages
Project Report
No ratings yet
Project Report
35 pages
Building A Voice Based Image Caption Generator With Deep Learning
No ratings yet
Building A Voice Based Image Caption Generator With Deep Learning
6 pages
Final - Done (1) 2.0
No ratings yet
Final - Done (1) 2.0
16 pages
Automatic Image Captioning Using Neural Networks
No ratings yet
Automatic Image Captioning Using Neural Networks
9 pages
Image To Caption Generator
No ratings yet
Image To Caption Generator
7 pages
Internship Report (Sanjay Final)
No ratings yet
Internship Report (Sanjay Final)
45 pages
CNN-Based Semantic Image Segmentation
No ratings yet
CNN-Based Semantic Image Segmentation
10 pages
Research Paper Final
No ratings yet
Research Paper Final
5 pages
Project Report
No ratings yet
Project Report
53 pages
Image Captioning Research Paper
No ratings yet
Image Captioning Research Paper
59 pages
Computer Vision Based Object Detection and Recognition System For Image Searching
No ratings yet
Computer Vision Based Object Detection and Recognition System For Image Searching
4 pages
Final Copy For Gireesh
No ratings yet
Final Copy For Gireesh
61 pages
Batch 17 Paper
No ratings yet
Batch 17 Paper
10 pages
Caption Credits
No ratings yet
Caption Credits
25 pages
Automatic Image Caption Generation System
No ratings yet
Automatic Image Caption Generation System
4 pages
Conference
No ratings yet
Conference
11 pages
DenseCap - Fully Convolutional Localization Networks For Dense Captioning
No ratings yet
DenseCap - Fully Convolutional Localization Networks For Dense Captioning
10 pages
Pami Im2Show and Tell: Lessons Learned From The 2015 MSCOCO Image Captioning Challenge
No ratings yet
Pami Im2Show and Tell: Lessons Learned From The 2015 MSCOCO Image Captioning Challenge
12 pages
Materials Today: Proceedings: K. Loganathan, R. Sarath Kumar, V. Nagaraj, Tegil J. John
No ratings yet
Materials Today: Proceedings: K. Loganathan, R. Sarath Kumar, V. Nagaraj, Tegil J. John
5 pages
Pblsynopsis
No ratings yet
Pblsynopsis
27 pages
ROHAN PRASAD FinalProjectReport - Rohan Gamer
No ratings yet
ROHAN PRASAD FinalProjectReport - Rohan Gamer
39 pages
CNN and RNN
No ratings yet
CNN and RNN
82 pages
Image Recognition System Project
No ratings yet
Image Recognition System Project
13 pages
Image Caption Generator Report
No ratings yet
Image Caption Generator Report
27 pages
Image Recognition System: Project Report
No ratings yet
Image Recognition System: Project Report
19 pages
10 1109icsccc 2018 8703316
No ratings yet
10 1109icsccc 2018 8703316
6 pages
BTP Report
No ratings yet
BTP Report
27 pages
Ijariie 26613
No ratings yet
Ijariie 26613
5 pages
Automated Neural Image Caption Generator For Visually Impaired People
No ratings yet
Automated Neural Image Caption Generator For Visually Impaired People
6 pages
Semantic Aware Scene Recognition
No ratings yet
Semantic Aware Scene Recognition
47 pages
Deep Learning for Image Captioning
No ratings yet
Deep Learning for Image Captioning
2 pages
Deep Learning for Image Captioning
No ratings yet
Deep Learning for Image Captioning
6 pages
Major Report 1
No ratings yet
Major Report 1
25 pages
FYP CSEB Batch37 First Review (Final)
No ratings yet
FYP CSEB Batch37 First Review (Final)
13 pages
Deep Learning Image Captioning
No ratings yet
Deep Learning Image Captioning
7 pages
Image Caption Generator Using CNN and LSTM
No ratings yet
Image Caption Generator Using CNN and LSTM
8 pages
Image Captioning Generator Using Deep Machine Learning
No ratings yet
Image Captioning Generator Using Deep Machine Learning
3 pages
Image Captioning with Deep Learning
No ratings yet
Image Captioning with Deep Learning
5 pages
Image Captioning: - A Deep Learning Approach
No ratings yet
Image Captioning: - A Deep Learning Approach
14 pages
(Ankitveer)
No ratings yet
(Ankitveer)
18 pages
CNNs for Image Detection & Recognition
No ratings yet
CNNs for Image Detection & Recognition
6 pages
DL Group 6 Rep
No ratings yet
DL Group 6 Rep
11 pages
Facial Recognition Using Deep Learning
No ratings yet
Facial Recognition Using Deep Learning
6 pages
Fin Irjmets1689950550
No ratings yet
Fin Irjmets1689950550
5 pages
A Novel Approach of Image Caption Generator Using Deep Learning
No ratings yet
A Novel Approach of Image Caption Generator Using Deep Learning
6 pages
Le y Yang - Tiny ImageNet Visual Recognition Challenge
No ratings yet
Le y Yang - Tiny ImageNet Visual Recognition Challenge
6 pages
Deep Learning in Image Processing
No ratings yet
Deep Learning in Image Processing
9 pages
DL U-III Computer Vision
100% (1)
DL U-III Computer Vision
30 pages
Wa0000.
No ratings yet
Wa0000.
9 pages
Free RPA UiPath Certification Exam Dumps
100% (1)
Free RPA UiPath Certification Exam Dumps
9 pages
Shreyas V Kashyap - Amusement Park
43% (7)
Shreyas V Kashyap - Amusement Park
47 pages
2011-12 CSE Batch Placement Report
No ratings yet
2011-12 CSE Batch Placement Report
3 pages
COMEDK UGET PCM Rank List PDF
0% (1)
COMEDK UGET PCM Rank List PDF
1,097 pages
2011-12 CSE Batch Placement Report
No ratings yet
2011-12 CSE Batch Placement Report
3 pages
Hygromatik Electrode Steam Humidifiers EU 2011
No ratings yet
Hygromatik Electrode Steam Humidifiers EU 2011
6 pages
Wireless World 1983 03
No ratings yet
Wireless World 1983 03
126 pages
2 Template 11& 14, Annex 3A
No ratings yet
2 Template 11& 14, Annex 3A
7 pages
Property Dispute: No Forgery Found
No ratings yet
Property Dispute: No Forgery Found
1 page
Bstm20oe201 2ND Sem Sy2024 2025
No ratings yet
Bstm20oe201 2ND Sem Sy2024 2025
1 page
Volume 3 ENG
0% (1)
Volume 3 ENG
475 pages
User Manual: Di1611/Di1811p/Di2011 Twain Driver
No ratings yet
User Manual: Di1611/Di1811p/Di2011 Twain Driver
21 pages
An Introduction To Hadoop
No ratings yet
An Introduction To Hadoop
12 pages
AFES English Manual
100% (7)
AFES English Manual
290 pages
TR Bro Updated Erl221
No ratings yet
TR Bro Updated Erl221
4 pages
RRU5903 (850Mhz) - Technical Specifications
No ratings yet
RRU5903 (850Mhz) - Technical Specifications
8 pages
SIDF Corporate Profile 2022
No ratings yet
SIDF Corporate Profile 2022
63 pages
VCDS Diagnostic Report
No ratings yet
VCDS Diagnostic Report
7 pages
Hydraulic Sealing Surface Insights
No ratings yet
Hydraulic Sealing Surface Insights
7 pages
Object Oriented Development in PL/SQL
No ratings yet
Object Oriented Development in PL/SQL
27 pages
How To Earn Online Webinar
No ratings yet
How To Earn Online Webinar
29 pages
Lab Report On Basics Logic Gate
80% (10)
Lab Report On Basics Logic Gate
9 pages
HW 9 Bootstrap, Jackknife, and Permutation Tests
No ratings yet
HW 9 Bootstrap, Jackknife, and Permutation Tests
7 pages
Applied Electronics Paper - IV: B.E. Sixth Semester (Aeronautical Engineering) (C.B.S.)
No ratings yet
Applied Electronics Paper - IV: B.E. Sixth Semester (Aeronautical Engineering) (C.B.S.)
2 pages
Computer Engineering Technician - Sample Resume
No ratings yet
Computer Engineering Technician - Sample Resume
2 pages
Sulfuro Hach Dr3900
No ratings yet
Sulfuro Hach Dr3900
6 pages
Choosing the Best Test Automation Framework
No ratings yet
Choosing the Best Test Automation Framework
3 pages
Pantry Evaluation Proposal Internship
No ratings yet
Pantry Evaluation Proposal Internship
6 pages
Plant Maintenance
No ratings yet
Plant Maintenance
14 pages
Engineer Onboarding Form
No ratings yet
Engineer Onboarding Form
12 pages
7.3 Options - Pricing Binomial-1
No ratings yet
7.3 Options - Pricing Binomial-1
25 pages
Analytical VaR VaR Mapping
No ratings yet
Analytical VaR VaR Mapping
13 pages
PT - 1 Apr 2025
No ratings yet
PT - 1 Apr 2025
4 pages
Brand Ambassador Playbook Roster
No ratings yet
Brand Ambassador Playbook Roster
27 pages
BRS Embryology 6th Edition by Ronald W. Dudek ISBN 1469873702 9781469873701 - Get Instant Access To The Full Ebook Content
100% (19)
BRS Embryology 6th Edition by Ronald W. Dudek ISBN 1469873702 9781469873701 - Get Instant Access To The Full Ebook Content
68 pages

Project

Uploaded by

Project

Uploaded by

AN END-TO-END TRAINING BASED APPROACH FOR CAPTIONING

COMPLEX IMAGES USING NEURAL NETWORKS

Shreyas V Kashyap - 1BG13CS097

Under the Guidance of

We have hereby chose the area of research to be a combination of the following

Our core area of problem is

Program a computer to "understand" a scene or features in an image.

Concerned with the automatic extraction, analysis and understanding of useful

The labelling of single prominent object in an image

Unable to describe a scene in a complex dense detailed image

Currently the problem persists in the following applications of Computer Vision

Image scene detection

Architecture is composed of a Convolutional Network, an efficient dense localization

Output of existing System :

You might also like