0% found this document useful (0 votes)

78 views8 pages

Project

This document discusses developing an end-to-end neural network approach for captioning complex images. The core problem areas are computer vision and generating detailed descriptions of entire scenes from images. Existing methods struggle with detecting multiple overlapping objects and labeling complex, dense images. The proposed approach uses a convolutional network to localize regions of interest, with a recurrent neural network language model to generate natural language captions describing each region and the full scene. The goal is an architecture that jointly performs localization and generation of descriptive label sequences for complex images.

Uploaded by

Shreyas Kash Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views8 pages

Project

Uploaded by

Shreyas Kash Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 8

AN END-TO-END TRAINING BASED APPROACH FOR CAPTIONING

COMPLEX IMAGES USING NEURAL NETWORKS

Shreyas V Kashyap - 1BG13CS097

Swaroop Gupta B.A -1BG13CS113
Spandana Rao B.A - 1BG13CS103

Under the Guidance of

Smt.Jebah Jaykumar
AREA OF RESEARCH

We have hereby chose the area of research to be a combination of the following

research extensive areas.

Machine Learning

Image Processing
CORE AREA OF PROBLEM

Our core area of problem is

Computer Vision

Program a computer to "understand" a scene or features in an image.

Concerned with the automatic extraction, analysis and understanding of useful

information from a single image or a sequence of images
RELEVANCE OF THE PROBLEM

Object detection

Currently the existing state of the art methods can detect a single object or
multiple non overlapping objects

This makes the detection useless for any analysis of an entire scene

Object labelling

The labelling of single prominent object in an image

Unable to describe a scene in a complex dense detailed image

APPLICATION OF THE PROBLEM

Currently the problem persists in the following applications of Computer Vision

Image search

The existing image search works only on the file name of an image, and not on the
details of the scene
This should be overcome and the search should happen only on the basis of what is
there in the image, instead of the filename

Image scene detection

The existing Image detection, detects a single prominent part of the image and
cannot detect if there are variations of viewpoint.
This should be overcome and multiple objects need to be detected and the whole
scene has to be described.
PROBLEM STATEMENT

Our ability to effortlessly describe all aspects of an image relies on a strong semantic
understanding of a visual scene and all of its elements. However, despite numerous
potential applications, this ability remains a challenge for our state of the art visual
recognition systems

Our goal is to design an architecture that jointly localizes regions of interest and the
describes each with natural language

Architecture is composed of a Convolutional Network, an efficient dense localization

layer, and Recurrent Neural Network language model that generates the label
sequences for the complex images.
INPUT / OUTPUT EXAMPLE

Sample Input :

Output of existing System :

Output Expected :
THANK YOU

AI Image Captioning for CSE Students
No ratings yet
AI Image Captioning for CSE Students
17 pages
Image Summarizer: Seeing Through Machine Using Deep Learning Algorithm
No ratings yet
Image Summarizer: Seeing Through Machine Using Deep Learning Algorithm
7 pages
Implementation of Simple and Efficient P
No ratings yet
Implementation of Simple and Efficient P
8 pages
Minor
No ratings yet
Minor
14 pages
Generating Caption From Images Using Flickr Image Dataset
No ratings yet
Generating Caption From Images Using Flickr Image Dataset
7 pages
Complex Image Captioning Guide
No ratings yet
Complex Image Captioning Guide
1 page
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
No ratings yet
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
15 pages
Review 2
No ratings yet
Review 2
34 pages
Image Captioning Using CNN & RNN
No ratings yet
Image Captioning Using CNN & RNN
4 pages
Sample Project doc-REC
No ratings yet
Sample Project doc-REC
66 pages
Project Report
No ratings yet
Project Report
35 pages
Building A Voice Based Image Caption Generator With Deep Learning
No ratings yet
Building A Voice Based Image Caption Generator With Deep Learning
6 pages
Final - Done (1) 2.0
No ratings yet
Final - Done (1) 2.0
16 pages
Automatic Image Captioning Using Neural Networks
No ratings yet
Automatic Image Captioning Using Neural Networks
9 pages
Image To Caption Generator
No ratings yet
Image To Caption Generator
7 pages
Internship Report (Sanjay Final)
No ratings yet
Internship Report (Sanjay Final)
45 pages
CNN-Based Semantic Image Segmentation
No ratings yet
CNN-Based Semantic Image Segmentation
10 pages
Research Paper Final
No ratings yet
Research Paper Final
5 pages
Project Report
No ratings yet
Project Report
53 pages
Image Captioning Research Paper
No ratings yet
Image Captioning Research Paper
59 pages
Computer Vision Based Object Detection and Recognition System For Image Searching
No ratings yet
Computer Vision Based Object Detection and Recognition System For Image Searching
4 pages
Final Copy For Gireesh
No ratings yet
Final Copy For Gireesh
61 pages
Batch 17 Paper
No ratings yet
Batch 17 Paper
10 pages
Caption Credits
No ratings yet
Caption Credits
25 pages
Automatic Image Caption Generation System
No ratings yet
Automatic Image Caption Generation System
4 pages
Conference
No ratings yet
Conference
11 pages
DenseCap - Fully Convolutional Localization Networks For Dense Captioning
No ratings yet
DenseCap - Fully Convolutional Localization Networks For Dense Captioning
10 pages
Pami Im2Show and Tell: Lessons Learned From The 2015 MSCOCO Image Captioning Challenge
No ratings yet
Pami Im2Show and Tell: Lessons Learned From The 2015 MSCOCO Image Captioning Challenge
12 pages
Materials Today: Proceedings: K. Loganathan, R. Sarath Kumar, V. Nagaraj, Tegil J. John
No ratings yet
Materials Today: Proceedings: K. Loganathan, R. Sarath Kumar, V. Nagaraj, Tegil J. John
5 pages
Pblsynopsis
No ratings yet
Pblsynopsis
27 pages
ROHAN PRASAD FinalProjectReport - Rohan Gamer
No ratings yet
ROHAN PRASAD FinalProjectReport - Rohan Gamer
39 pages
CNN and RNN
No ratings yet
CNN and RNN
82 pages
Image Recognition System Project
No ratings yet
Image Recognition System Project
13 pages
Image Caption Generator Report
No ratings yet
Image Caption Generator Report
27 pages
Image Recognition System: Project Report
No ratings yet
Image Recognition System: Project Report
19 pages
10 1109icsccc 2018 8703316
No ratings yet
10 1109icsccc 2018 8703316
6 pages
BTP Report
No ratings yet
BTP Report
27 pages
Ijariie 26613
No ratings yet
Ijariie 26613
5 pages
Automated Neural Image Caption Generator For Visually Impaired People
No ratings yet
Automated Neural Image Caption Generator For Visually Impaired People
6 pages
Semantic Aware Scene Recognition
No ratings yet
Semantic Aware Scene Recognition
47 pages
Deep Learning for Image Captioning
No ratings yet
Deep Learning for Image Captioning
2 pages
Deep Learning for Image Captioning
No ratings yet
Deep Learning for Image Captioning
6 pages
Major Report 1
No ratings yet
Major Report 1
25 pages
FYP CSEB Batch37 First Review (Final)
No ratings yet
FYP CSEB Batch37 First Review (Final)
13 pages
Deep Learning Image Captioning
No ratings yet
Deep Learning Image Captioning
7 pages
Image Caption Generator Using CNN and LSTM
No ratings yet
Image Caption Generator Using CNN and LSTM
8 pages
Image Captioning Generator Using Deep Machine Learning
No ratings yet
Image Captioning Generator Using Deep Machine Learning
3 pages
Image Captioning with Deep Learning
No ratings yet
Image Captioning with Deep Learning
5 pages
Image Captioning: - A Deep Learning Approach
No ratings yet
Image Captioning: - A Deep Learning Approach
14 pages
(Ankitveer)
No ratings yet
(Ankitveer)
18 pages
CNNs for Image Detection & Recognition
No ratings yet
CNNs for Image Detection & Recognition
6 pages
DL Group 6 Rep
No ratings yet
DL Group 6 Rep
11 pages
Facial Recognition Using Deep Learning
No ratings yet
Facial Recognition Using Deep Learning
6 pages
Fin Irjmets1689950550
No ratings yet
Fin Irjmets1689950550
5 pages
A Novel Approach of Image Caption Generator Using Deep Learning
No ratings yet
A Novel Approach of Image Caption Generator Using Deep Learning
6 pages
Le y Yang - Tiny ImageNet Visual Recognition Challenge
No ratings yet
Le y Yang - Tiny ImageNet Visual Recognition Challenge
6 pages
Deep Learning in Image Processing
No ratings yet
Deep Learning in Image Processing
9 pages
DL U-III Computer Vision
100% (1)
DL U-III Computer Vision
30 pages
Wa0000.
No ratings yet
Wa0000.
9 pages
Free RPA UiPath Certification Exam Dumps
100% (1)
Free RPA UiPath Certification Exam Dumps
9 pages
Shreyas V Kashyap - Amusement Park
43% (7)
Shreyas V Kashyap - Amusement Park
47 pages
2011-12 CSE Batch Placement Report
No ratings yet
2011-12 CSE Batch Placement Report
3 pages
COMEDK UGET PCM Rank List PDF
0% (1)
COMEDK UGET PCM Rank List PDF
1,097 pages
2011-12 CSE Batch Placement Report
No ratings yet
2011-12 CSE Batch Placement Report
3 pages
2h Thursday
No ratings yet
2h Thursday
2 pages
Various Interface Styles
No ratings yet
Various Interface Styles
45 pages
Greatest and Least Integer Functions
No ratings yet
Greatest and Least Integer Functions
11 pages
The Business of English Dave Boss Clubhouse: When Do Americans Pronounce A T As A D Sound?
No ratings yet
The Business of English Dave Boss Clubhouse: When Do Americans Pronounce A T As A D Sound?
9 pages
2018 Summer Model Answer Paper
No ratings yet
2018 Summer Model Answer Paper
15 pages
Education Session III
No ratings yet
Education Session III
61 pages
7th Grade Unpacked
100% (1)
7th Grade Unpacked
45 pages
500-PG-8700!2!7 - Design of Space Flight Field Programmable Gate Arrays
No ratings yet
500-PG-8700!2!7 - Design of Space Flight Field Programmable Gate Arrays
34 pages
IELTS INFORMATION TO CANDIDATES IDP YOGYA at LBUSD
No ratings yet
IELTS INFORMATION TO CANDIDATES IDP YOGYA at LBUSD
2 pages
Outline PDF
No ratings yet
Outline PDF
3 pages
Ieltsspeakingpart2 181010031744
No ratings yet
Ieltsspeakingpart2 181010031744
20 pages
Dsat Rew
No ratings yet
Dsat Rew
61 pages
Unit 6 Test Study Guide
No ratings yet
Unit 6 Test Study Guide
6 pages
Hobbies and Sport 3
No ratings yet
Hobbies and Sport 3
10 pages
Nature of Hebrew - Elaiza Naniong
No ratings yet
Nature of Hebrew - Elaiza Naniong
9 pages
English Grammar Exercises
No ratings yet
English Grammar Exercises
3 pages
Holliday - 2006 - Native-Speakerism
No ratings yet
Holliday - 2006 - Native-Speakerism
3 pages
Hector Taipe Quispe Inglés (Estadounidense) Nivel 2
No ratings yet
Hector Taipe Quispe Inglés (Estadounidense) Nivel 2
4 pages
1 Complete The Sentences With The Correct Form of These Verbs. Some Verbs Are Used More Than Once
No ratings yet
1 Complete The Sentences With The Correct Form of These Verbs. Some Verbs Are Used More Than Once
4 pages
CAE Use of English Practice Test
0% (1)
CAE Use of English Practice Test
11 pages
KV Class 9 ENGLISH Annual Exam Sample Question Paper
No ratings yet
KV Class 9 ENGLISH Annual Exam Sample Question Paper
3 pages
Jeff 108
No ratings yet
Jeff 108
14 pages
Islamic Music
No ratings yet
Islamic Music
20 pages
Hanauer 2012 Growing Up in The Unseen Shadow of The Kindertransport A Poetic Narrative Autoethnography
No ratings yet
Hanauer 2012 Growing Up in The Unseen Shadow of The Kindertransport A Poetic Narrative Autoethnography
7 pages
Pipiwharauroa Te Rawhiti Newsletter Volume 2 Issue 2
No ratings yet
Pipiwharauroa Te Rawhiti Newsletter Volume 2 Issue 2
12 pages
B2 First Speaking Part 2
No ratings yet
B2 First Speaking Part 2
7 pages
Artificial Intelligence Research Paper
No ratings yet
Artificial Intelligence Research Paper
13 pages
History of English Literature (1896)
100% (1)
History of English Literature (1896)
334 pages
NCERT Solutions Class 11 Computer Science Strings
No ratings yet
NCERT Solutions Class 11 Computer Science Strings
18 pages
Sb-A3 Final Exam Individual Presentation Rubric
No ratings yet
Sb-A3 Final Exam Individual Presentation Rubric
1 page

Project

Uploaded by

Project

Uploaded by

AN END-TO-END TRAINING BASED APPROACH FOR CAPTIONING

COMPLEX IMAGES USING NEURAL NETWORKS

Shreyas V Kashyap - 1BG13CS097

Under the Guidance of

We have hereby chose the area of research to be a combination of the following

Our core area of problem is

Program a computer to "understand" a scene or features in an image.

Concerned with the automatic extraction, analysis and understanding of useful

The labelling of single prominent object in an image

Unable to describe a scene in a complex dense detailed image

Currently the problem persists in the following applications of Computer Vision

Image scene detection

Architecture is composed of a Convolutional Network, an efficient dense localization

Output of existing System :

You might also like