Silent sound technology
A
TECHNICAL SEMINAR REPORT ON
“SILENT SOUND TECHNOLOGY”
Submitted in the partial fulfillment of the requirement for the award of
degree of
BACHELOR OF TECHNOLOGY
IN
COMPUTER SCIENCE AND ENGINEERING
“SAI SPURTHI INSTITUTE OF TECHNOLOGY” TELANGANA
SUBMITTED
BY
Y.LIKHITHA [16C51A0559]
UNDER THE STEEMED GUIDANCE OF
Mr. DASTHGIRI [M.TECH]
Assistant Professor
SAI SPURTHI INSTITUTE OF TECHNOLOGY
1
Silent sound technology
DEPARTMENT OF COMPUTER SCIENCE ENGINEERING
(Accredited by NACC, Approved by AICTE, New Delhi & Affiliated to
JNTU-H)
B.GANGARAM (v), SATHUPALLY (M), KHAMMAM (DIST),
TELANGANA (ST)
B.GANGARAM (v), SATHUPALLY (M), KHAMMAM (DIST),
TELANGANA (ST)
CERTIFICATE
This is certify that the mini project report on “SILENT SOUND
TECHNOLOGY” has been
Y.LIKHITHA [16C51A0559]
In the partial fulfillment of the requirement for the award of the B.TECH
Degree in COMPUTER SCIENCE ENGINEERING from the Jawaharlal
Nehru Technology University is a bonafide work carried out by
Them during the academic year 2018-2019
2
Silent sound technology
Mr. N. VENKATESWAR RAO Mr.
DASTHGIRI [M.TECH]
Head of the department Project Guide
Dr.Ch. VIJAYA KUMAR EXAMINER
Principal
ACKNOWLEDGEMENT
A seminar report is a golden opportunity for learning about
upcoming new technologies and self-development. I consider myself
very lucky and honored to have so many wonderful people lead me
through in completion of this report. I respect and thank Mr.
PARDHASARADHI REDDY (Director of SSIT) for giving me an
opportunity to do the seminar report .And I heartily thank Mr. N.
VENKATESWAR RAO (Assistant Professor & Head of Department of
3
Silent sound technology
Computer Science and Engineering), for his guidance and suggestions
during this seminar report.
I own my profound gratitude to our seminar report guide Mr.
DASTHGIRI (Assistant Professor) who took keen interest on our project
work and guided us all along, till the completion of our report work by
providing all the necessary information for developing a good system.
THANKS AGAIN TO ALL WHO HAVE HELPED
A humble Thanks’
SILENT SOUND TECHNOLOGY
ABSTRACT
Everybody has the experience of talking aloud in the cell
phone in the midst of the disturbance while travelling in trains or buses.
There is no need of shouting anymore for this purpose. ‘Silent sound
technology’ is the answer for this problem.
4
Silent sound technology
The Silent sound technology is an amazing solution for
those who had lost their voice but wish to speak over phone. It is
developed at the Karlsruhe Institute of Technology and you can expect
to see it in the near future. When demonstrated, it seems to detect every
lip movement and internally converts the electrical pulses into sounds
signals and sends them neglecting all other surrounding noise. It is
definitely going to be a good solution for those feeling annoyed when
other speak loud over phone. ‘Silent Sound’ technology aims to notice
every movement of the lips and transform them into sounds, which
could help people who lose voices to speak, and allow people to make
silent calls without bothering others. Rather than making any sounds,
your handset would decipher the movements your mouth makes by
measuring muscle activity, then convert this into speech that the person
on the other end of the call can hear. So, basically, it reads your lips.
This new technology will be very helpful whenever a person loses his
voice while speaking or allow people to make silent calls without
disturbing others, even we can tell our PIN number to a trusted friend or
relative without eaves dropping.
Signature of guide Student signature
Y. Likhitha
16C51A0559
5
Silent sound technology
LIST OF CONTENTS
CHAPTER NAME PAGE NO
CERTIFICATE FROM COLLEGE
I
ACKNOWLEDGEMENT II
ABSTRACT III
LIST OF CONTENTS IV
LIST OF FIGURE V
ABSTRACT 3
1. INTRODUCTION 4
2. WHAT IS SILENT SOUND TECHNOLOGY?
5
3. NEED FOR SILENT SOUND TECHNOLOGY
6
4. CONCEPT OF SILENT SOUND TECHNOLOGY
7
5. ELECTROMYOGRAPHY
9
6
Silent sound technology
5.1 ELECTRICAL CHARACTERSTICS
10
5.2 HISTORY 10
5.3 PROCEDURE
11
5.4 NORMAL RESULT
14
5.5 ABNORMAL RESULT
14
5.6 EMG SIGNAL DECOMPOSITION
15
5.7 APPLICATION OF EMG 15
6. BLOCK DIAGRAM OF ELECTROMYOGRAPHY
16
7. WORKING OF ELECTROMYOGRAPHY
17
8.IMAGE PROCESSING
8. FEATURES OF SILENT SOUND TECHNOLOGY
18
7
Silent sound technology
9. RESEARCH 19
10. APPLICATIONS 20
11. ADVANTAGES 21
12. DISADVANTAGES 22
13. CONCLUSION 23
14. REFERENCES 24
LIST OF FIGURES:
CHAPTER 3:
3.1 Best result getting area 6
CHAPTER 4:
4.1Common people talking at same place without disturbance
7
CHAPTER 5:
5.1 Electromorphography signal generation
9
5.2 Electromyography instruments
12
5.3 Interfacing with electromyographer and body
13
8
Silent sound technology
CHAPTER 6:
6.1 diagram of electromyography 16
CHAPTER 7:
7.1 working of electromyography 17
ABSTRACT
Everybody has the experience of talking aloud in the cell phone in
the midst of the disturbance while travelling in trains or buses. There is
no need of shouting anymore for this purpose. ‘Silent sound technology’
is the answer for this problem.
The Silent sound technology is an amazing solution for those who
had lost their voice but wish to speak over phone. It is developed at the
Karlsruhe Institute of Technology and you can expect to see it in the
near future. When demonstrated, it seems to detect every lip movement
and internally converts the electrical pulses into sounds signals and
sends them neglecting all other surrounding noise. It is definitely going
9
Silent sound technology
to be a good solution for those feeling annoyed when other speak loud
over phone.
‘Silent Sound’ technology aims to notice every movement of the
lips and transform them into sounds, which could help people who lose
voices to speak, and allow people to make silent calls without bothering
others. Rather than making any sounds, your handset would decipher the
movements your mouth makes by measuring muscle activity, then
convert this into speech that the person on the other end of the call can
hear. So, basically, it reads your lips. This new technology will be very
helpful whenever a person loses his voice while speaking or allow
people to make silent calls without disturbing others, even we can tell
our PIN number to a trusted friend or relative without eavesdropping. At
the other end, the listener can hear a clear voice. the awesome feature
added to this technology is that "it is an instant polyglot" I.E,
movements can be immediately transformed into the language of the
user's choice. This translation works for languages like English, French
& German. But, for the languages like Chinese, different tones can hold
many different meanings. This poses Problem said Wand. he also said
that in five or may be in ten years this will Be used in everyday's
technology.
10
Silent sound technology
1. INTRODUCTION:
“Silence is the best answer for all the situations …even your mobile
understands!”
The word Cell Phone has become greatest buzz word in Cellular
Communication industry.
There are lots and lots of technology that tries to reduce the Noise
pollution and make the environment a better place to live in.
I will tell about a new technology known as Silent Sound
Technology that will put an end to Noise pollution
1.1. INVENTION:
The idea of interpreting silent sound electronically or with a
computer has been around for a long time and was popularized in 1968
Stanley Kubrick science fiction film-2001 A Space Odessey’
This device is presently being developed by Karlsruhe Institute
Of Technology (KIT), Germany using electromyography. In the year
2010 at CEBIT, one of the largest trade fair silent sound technology was
developed.
11
Silent sound technology
The device, developed by the Karlsruhe Institute of Technology
(KIT), uses electromyography, monitoring tiny muscular movements
that occur when we speak and converting them into electrical pulses that
can then be turned into speech, without a sound uttered.
2. WHAT IS SILENT SOUND TECHNOLOGY?
It is a technology that helps you to transmit information without
using your vocal cords. This technology aims to notice lip movements &
transform them into a computer generated sound that can be transmitted
over a phone. Hence person on other end of phone receives the
information in audio.
Think of a situation where there is an urgent need of talking over a
mobile in a noisy environment, silent sound technology provides us a
simple solution for this problem. This technology enables the users to
communicate through a mobile in a noisy environment which require
simple lip movements.
12
Silent sound technology
People suffering disorders of vocal chords cannot answer any
phone call so they may not communicate over a phone, hence silent
sound technology is of a great help for those people. Those people who
cannot speak can simply give the lip movements according to the words
they required to speak, this technology converts the silent lip movements
into voice and transmits through the mobile phone.
This technology helps in saying secret pin numbers or any credit
card numbers over the mobile phone which required to be conversed
secretly.
This technology can even be useful to astronauts for
communication in space (vacuum) where sound waves cannot be
transmitted.
Hence silent sound technology has a great scope in future as it has
a wide range of applications and more over it is a real time application.
13
Silent sound technology
3. NEED FOR SILENT SOUND TECHNOLOGY:
Silent Sound Technology will put an end to embarrassed situation such
as
An person answering his silent, but vibrating cell phone in a
meeting, lecture or performance, and Whispering loudly, I can’t
talk to you right now”
In the case of an urgent call, apologetically rush out of the room in
order to answer or call the person back.
Fig 3.1 Best result getting area
This new technology will be very helpful whenever a person loses
his voice while speaking or allow people to make silent calls without
disturbing others, even we can tell our pin number to a trusted friend or
relative without eaves DROPPING. At the other end, the listener can
hear a clear voice. The awesome feature added to this technology is that
"it is an instant polyglot" i.e., movements can be immediately
transformed into the language of the user's choice. This translation
14
Silent sound technology
works for languages like English, French & German. But, for the
languages like Chinese, different tones can hold many different
meanings. This poses problem said wand. He also said that in five or
may be in ten years this will be used in everyday's technology.
4. CONCEPT OF SILENT SOUND TECHNOLOGY:
You are in a movie theater or noisy restaurant or a bus etc where
there is lot of noise around is big issue while talking on a mobile phone.
But in the future this problem is eliminated with ”silent sounds”, a new
technology that detect very lip movement and internally converts the
electrical pulses into sound signals and sends them neglecting all other
surrounding noise
15
Silent sound technology
Fig 4.1common people talking at same place without disturbance
“SILENT SOUND TECHNOLOGY‟ aims to notice every
movement of the lips and transform them into sounds, which could help
people who lose voices to speak, and allow people to make silent calls
without bothering others. Rather than making any sounds, your handset
would decipher the movements your mouth makes by measuring muscle
activity, then convert this into Speech that the person on the other end of
the call can hear. So, basically, it reads your lips.
16
Silent sound technology
“We currently use electrodes which are glued to the skin”
Now imagine a technology that could allow everyone to make
phone calls, and talk, without making a sound screne, peaceful &
heavenly. Recently, it’s proved that the articulator muscle become active
irrespective of whether air passes through them or not. Even by saying
words without producing sound, weak electric currents are sent from
your brain to the speech muscle. These electrical signals are known as
“ELECTROMYOGRAMS”.
These electric signals play a major role in interpreting silent
sounds from silent lip movements as these electromyographic signals are
analyzed and converted to respective voice signals.
17
Silent sound technology
5. ELECTROMYOGRAPHY:
Electromyography (EMG) is a technique for evaluating and recording
the electrical activity produced by skeletal muscles. EMG is performed
using an instrument called an electromyograph, to produce a record
called an electromyogram. An electromyograph detects the electrical
potential generated by muscle cells when these cells are electrically or
neurologically activated. The signals can be analyzed to detect medical
abnormalities, activation level, and recruitment order or to analyze the
biomechanics of human or animal movement.
The Silent Sound Technology uses electromyography, monitoring
tiny muscular movements that occur when we speak.
Monitored signals are converted into electrical pulses that can then
be turned into speech, without a sound uttered.
Electromyography (EMG) is a technique for evaluating and
recording the electrical activity produced by skeletal muscles.
18
Silent sound technology
An electromyography detects the electrical potential generated by
muscle cells, when these cells are electrically or neurologically
activated.
Fig: 5.1 Electromorphography signal generation
5.1. ELECTRICAL CHARACTERSTICS:
The electrical source is the muscle membrane potential of about
-90 mV. Measured EMG potentials range between less than 50 μV and
up to 20 to 30 mV, depending on the muscle under observation.
Typical repetition rate of muscle motor unit firing is about 7–
20 Hz, depending on the size of the muscle (eye muscles versus seat
(gluteal) muscles), previous axonal damage and other factors. Damage to
motor units can be expected at ranges between 450 and 780 mV.
5.2. HISTORY:
The first documented experiments dealing with EMG started with
Francesco Redi’s works in 1666. Redi discovered a highly specialized
19
Silent sound technology
muscle of the electric ray fish (Electric Eel) generated electricity. By
1773, Walsh had been able to demonstrate that the Eel fish’s muscle
tissue could generate a spark of electricity. In 1792, a publication
entitled De Viribus Electricitatis in Motu Musculari Commentarius
appeared, written by Luigi Galvani, in which the author demonstrated
that electricity could initiate muscle contractions. Six decades later, in
1849, Dubois-Raymond discovered that it was also possible to record
electrical activity during a voluntary muscle contraction. The first actual
recording of this activity was made by Marey in 1890, who also
introduced the term electromyography. In 1922, Gasser and Erlanger
used an oscilloscope to show the electrical signals from muscles.
Because of the stochastic nature of the myoelectric signal, only rough
information could be obtained from its observation. The capability of
detecting electromyographic signals improved steadily from the 1930s
through the 1950s, and researchers began to use improved electrodes
more widely for the study of muscles. Clinical use of surface EMG
(sEMG) for the treatment of more specific disorders began in the 1960s.
Hardyck and his researchers were the first (1966) practitioners to use
sEMG. In the early 1980s, Cram and Steger introduced a clinical method
for scanning a variety of muscles using an EMG sensing device.
20
Silent sound technology
It is not until the middle of the 1980s that integration techniques in
electrodes had sufficiently advanced to allow batch production of the
required small and lightweight instrumentation and
amplifiers. At present, a number of suitable amplifiers are commercially
available. In the early 1980s, cables that produced signals in the desired
microvolt range became available. Recent research has resulted in a
better understanding of the properties of surface EMG recording.
Surface electromyography is increasingly used for recording from
superficial muscles in clinical or kinesiological protocols, where
intramuscular electrodes are used for investigating deep muscles or
localized muscle activity.
There are many applications for the use of EMG. EMG is used
clinically for the diagnosis of neurological and neuromuscular problems.
It is used diagnostically by gait laboratories and by clinicians trained in
the use of biofeedback or ergonomic assessment. EMG is also used in
many types of research laboratories, including those involved in
biomechanics, motor control, neuromuscular physiology, movement
disorders, postural control, and physical therapy.
5.3. PROCEDURE:
21
Silent sound technology
There are two kinds of EMG in widespread use: surface EMG and
intramuscular (needle and fine-wire) EMG. To perform intramuscular
EMG, a needle electrode or a needle containing two fine-wire electrodes
is inserted through the skin into the muscle tissue. A trained professional
(such as a neurologist, physiatrist, or physical therapist) observes the
electrical activity while inserting the electrode. The insertional activity
provides valuable information about the state of the muscle and its
innervating nerve. Normal muscles at rest make certain, normal
electrical signals when the needle is inserted into them. Then the
electrical activity when the muscle is at rest is studied. Abnormal
spontaneous activity might indicate some nerve and/or muscle damage.
Then the patient is asked to contract the muscle smoothly. The shape,
size, and frequency of the resulting motor unit potentials are judged.
Then the electrode is retracted a few millimeters, and again the activity
is analyzed until at least 10–20 units have been collected. Each electrode
track gives only a very local picture of the activity of the whole muscle.
Because skeletal muscles differ in the inner structure, the electrode has
to be placed at various locations to obtain an accurate study.
22
Silent sound technology
Fig: 5.2 Electromyography instruments
Intramuscular EMG may be considered too invasive or
unnecessary in some cases. Instead, a surface electrode may be used to
monitor the general picture of muscle activation, as opposed to the
activity of only a few fibres as observed using an intramuscular EMG.
This technique is used in a number of settings; for example, in the
physiotherapy clinic, muscle activation is monitored Using surface EMG
and patients have an auditory or visual stimulus to help them know when
they are activating the muscle (biofeedback)
23
Silent sound technology
Fig 5.3 Interfacing with electromyographer and body
A motor unit is defined as one motor neuron and all of the muscle
fibers it innervates. When a motor unit fires, the impulse (called an
action potential) is carried down the motor neuron to the muscle. The
area where the nerve contacts the muscle is called the neuromuscular
junction, or the motor end plate. After the action potential is transmitted
across the neuromuscular junction, an action potential is elicited in all of
24
Silent sound technology
the innervated muscle fibers of that particular motor unit. The sum of all
this electrical activity is known as a motor unit action potential (MUAP).
Thiselectrophysiologic activity from multiple motor units is the signal
typically evaluated during an EMG. The composition of the motor unit,
the number of muscle fibres per motor unit, the metabolic type of muscle
fibres and many other factors affect the shape of the motor unit
potentials in the myogram.
Nerve conduction testing is also often done at the same time as an
EMG to diagnose neurological diseases.Some patients can find the
procedure somewhat painful, whereas others experience only a small
amount of discomfort when the needle is inserted. The muscle or
muscles being tested may be slightly sore for a day or two after the
procedure.
5.4. NORMAL RESULTS:
Muscle tissue at rest is normally electrically inactive. After the
electrical activity caused by the irritation of needle insertion subsides,
the electromyograph should detect no abnormal spontaneous activity
(i.e., a muscle at rest should be electrically silent, with the exception of
the area of the neuromuscular junction, which is, under normal
circumstances, very spontaneously active). When the muscle is
25
Silent sound technology
voluntarily contracted, action potentials begin to appear. As the strength
of the muscle contraction is increased, more and more muscle fibers
produce action potentials. When the muscle is fully contracted, there
should appear a disorderly group of action potentials of varying rates
and amplitudes (a complete recruitment and interference pattern).
5.5. ABNORMAL RESULTS:
EMG is used to diagnose diseases that generally may be
classified into one of the following categories: neuropathies,
neuromuscular junction diseases and myopathies.
Neuropathic disease has the following defining EMG
characteristics:
An action potential amplitude that is twice normal due to the
increased number of fibres per motor unit because of
reinnervation of denervated fibres
An increase in duration of the action potential
A decrease in the number of motor units in the muscle (as
found using motor unit number estimation techniques)
Myopathic disease has these defining EMG characteristics:
A decrease in duration of the action potential
26
Silent sound technology
A reduction in the area to amplitude ratio of the action
potential
A decrease in the number of motor units in the muscle (in
extremely severe cases only)
Because of the individuality of each patient and disease, some
of these characteristics may not appear in every case
5.6 .EMG SIGNAL DECOMPOSITION:
EMG signals are essentially made up of superimposed motor unit
action potentials (MUAPs) from several motor units. For a thorough
analysis, the measured EMG signals can be decomposed into their
constituent MUAPs. MUAPs from different motor units tend to have
different characteristic shapes, while MUAPs recorded by the same
electrode from the same motor unit are typically similar. Notably MUAP
size and shape depend on where the electrode is located with respect to
the fibers and so can appear to be different if the electrode moves
position. EMG decomposition is non-trivial, although many methods
have been proposed
5.7 .APPLICATIONS OF EMG:
27
Silent sound technology
EMG signals are used in many clinical and biomedical
applications. EMG is used as a diagnostics tool for identifying
neuromuscular diseases, assessing low-back pain, kinesiology, and
disorders of motor control. EMG signals are also used as a control signal
for prosthetic devices such as prosthetic hands, arms, and lower limbs.
EMG can be used to sense isometric muscular activity where no
movement is produced. This enables definition of a class of subtle
motionless gestures to control interfaces without being noticed and
without disrupting the surrounding environment. These signals can be
used to control a prosthesis or as a control signal for an electronic device
such as a mobile phone or PDA.
EMG signals have been targeted as control for flight systems. The
Human Senses Group at the NASA Ames Research Center at Moffett
Field, CA seeks to advance man-machine interfaces by directly
connecting a person to a computer. In this project, an EMG signal is
used to substitute for mechanical joysticks and keyboards. EMG has also
been used in research towards a "wearable cockpit," which employs
EMG-based gestures to manipulate switches and control sticks necessary
for flight in conjunction with a goggle-based display.
Unvoiced speech recognition recognizes speech by observing the EMG
activity of muscles associated with speech. It is targeted for use in noisy
28
Silent sound technology
environments, and may be helpful for people without vocal cords and
people with aphasia.
7. WORKING OF ELECTROMYOGRAPHY:
A needle containing two fine-wired electrodes is inserted through
the skin into the muscle tissue.
Normal muscles at rest make certain, normal electrical sounds
when the needle is inserted into them. Then the electrical activity
when the muscle is at rest is observed.
Each electrode track gives only a very local picture of the activity
of the whole muscle.
Because skeletal muscles differ in the inner structure, the electrode
has to be placed at various locations to obtain an accurate signal
29
Silent sound technology
Fig: 7.1 working of electromyography
Electromyographic sensors attached to the face records the electric
signals produced by the facial Muscles compare them with pre recorded
signal pattern of spoken words When there is a match that sound is
transmitted on to the other end of the line and person at the other end
listen to the spoken words
8.IMAGE PROCESSING:
In computer science, image processing is any form of signal
processing for which the input is an image, such as a photograph or
30
Silent sound technology
video frame; the output of image processing may be either an image
or, a set of characteristics or parameters related to the image. In the
silent sound technology the output of this image processing is an
audio record.
Image processing is a physical process used to convert an image signal
into a physical image. The image signal can be either digital or analog.
The actual output itself can be an actual physical image or the
characteristics of an image.
The most common type of image processing is photography. In this
process, an image is captured using a camera to create a digital or
analog image. In order to produce a physical picture, the image is
processed using the appropriate technology based on the input source
type.
In digital photography, the image is stored as a computer file. This file
is translated using photographic software to generate an actual image.
The colors and shadings are all captured at the time the photograph is
taken the software translates this information into an image.
When creating images using analog photography, the image is burned
into a film using a chemical reaction triggered by controlled exposure to
31
Silent sound technology
light. The image is processed in a darkroom, using special chemicals to
create the actual image.
In addition to photography, there are a wide range of other image
processing operations. The field of digital imaging has created a whole
range of new applications and tools that were previously impossible.
Face recognition software, medical image processing and remote
sensing are all possible due to the development of digital image
processing. Specialized computer programs are used to enhance and
correct images. These programs apply algorithms to the actual data and
are able to reduce signal distortion, clarify fuzzy images and add light to
an underexposed image.
Image processing techniques were first developed in 1960 through the
collaboration of a wide range of scientists and academics. The main
focus of their work was to develop medical imaging, character
recognition and create high quality images at the microscopic level.
During this period, equipment and processing costs were prohibitively
high.
There are three major benefits to digital image processing. The
consistent high quality of the image, the low cost of processing and the
ability to manipulate all aspects of the process are all great benefits. As
long as computer processing speed continues to increase while the cost
32
Silent sound technology
of storage memory continues to drop, the field of image processing will
grow.
9. BLOCK DIAGRAM OF IMAGE PROCESSING:
The block diagram clearly explains the implementation of silent
sound technology through electromyography.
From the diagram it is clear that the silent lip movements are
captured through sensor probes or it may be through image processing
technique and these movements are converted to speech through a silent
33
Silent sound technology
vocoder. These speech signals are transmitted through a mobile and
received as normal voice signal at the other end.
Here the recorded electromyographic signals are converted to
speech signals by comparing them with pre recorded voice signals and
transmitting them over the phone if there is a match else signal is not
transmitted
IMAGE PROCESSING TECHNIQUE :
Analysis of remotely sensed data is done using various image
processing techniques and methods that includes:
Analog image processing
Digital image processing
ANALOG IMAGE PROCESSING
Analog processing techniques is applied to hard copy data such as
photographs or printouts.
It adopts certain elements of interpretation, such as primary
element, spatial arrangement etc.,
With the combination of multi-concept of examining remotely
sensed data in multispectral, multitemporal, multiscales and in
34
Silent sound technology
conjunction with multidisciplinary, allows us to make a verdict not
only as to what an object is but also its importance.
Apart from these it also includes optical photogrammetric
techniques allowing for precise measurement of the height,
width, location, etc. of an object.
35
Silent sound technology
Figure5.1-Element of image intrepretation
DIGITAL IMAGE PROCESSING:
Digital Image Processing involves a collection of techniques for the
manipulation of digital images by computers. Digital image processing is
the use of computer algorithms to perform image processing on digital
images. As a subcategory or field of digital signal processing, digital
image processing has many advantages over analog image processing. It
allows a much wider range of algorithms to be applied to the input data
and can avoid problems such as the build-up of noise and signal
distortion during processing. Since images are defined over two
dimensions (perhaps more) digital image processing may be modeled in
the form of Multidimensional Systems.
Images are available in two forms - photographic film form and
digital form. Variations in the scene characteristics are represented as
variations in brightness on photographic films. A particular part of scene
reflecting more energy will appear bright while a different part of the
same scene that reflecting less energy will appear black. Digital image
consists of discrete picture elements called pixels. Associated with each
pixel is a number represented as DN (Digital Number), that depicts the
36
Silent sound technology
average radiance of relatively small area within a scene. The size of this
area effects the reproduction of details within the scene. As the pixel
size is reduced more scene detail is preserved in digital representation.
Digital processing is used in a variety of applications. The different types
of digital processing include image processing, audio processing, video
processing, signal processing, and data processing. In the most basic
terms, digital processing refers to any manipulation of electronic data to
produce a specific effect.
In a most generalized way, a digital image is an array of numbers
depicting spatial distribution of a certain field parameters (such as
reflectivity of EM radiation, emissivity, temperature or some
geophysical or topographical elevation. Digital image consists of
discrete picture elements called pixels. Associated with each pixel is a
number represented as DN (Digital Number), that depicts the average
radiance of relatively small area within a scene. The range of DN values
being normally 0 to 255. The size of this area effects the reproduction of
details within the scene. As the pixel size is reduced more scene detail is
preserved in digital representation.
Remote sensing images are recorded in digital forms and then
processed by the computers to produce images for interpretation
37
Silent sound technology
purposes. Images are available in two forms - photographic film form
and digital form. Variations in the scene characteristics are represented
as variations in brightness on photographic films. A particular part of
scene reflecting more energy will appear bright while a different part of
the same scene that reflecting less energy will appear black. Digital
image consists of discrete picture elements called pixels. Associated
with each pixel is a number represented as DN (Digital Number), that
depicts the average radiance of relatively small area within a scene. The
size of this area effects the reproduction of details within the scene. As
the pixel size is reduced more scene detail is preserved in digital
representation.
Data Formats For Digital Satellite Imagery
Digital data from the various satellite systems supplied to the user in
the form of computer readable tapes or CD-ROM. As no worldwide
standard for the storage and transfer of remotely sensed data has been
agreed upon, though the CEOS (Committee on Earth Observation
Satellites) format is becoming accepted as the standard. Digital remote
sensing data are often organised using one of the three common
formats used to organise image data . For an instance an image
consisting of four spectral channels, which can be visualised as four
superimposed images, with corresponding pixels in one band
38
Silent sound technology
registering exactly to those in the other bands. These common formats
are:
Band Interleaved by Pixel (BIP)
Band Interleaved by Line (BIL)
Band Sequential (BQ)
Digital image analysis is usually conducted using Raster data structures -
each image is treated as an array of values. It offers advantages for
manipulation of pixel values by image processing system, as it is easy to
find and locate pixels and their values. Disadvantages becomes
apparent when one needs to represent the array of pixels as discrete
patches or regions, where as Vector data structures uses polygonal
patches and their boundaries as fundamental units for analysis and
manipulation. Though vector format is not appropriate to for digital
analysis of remotely sensed data.
Image Resolution
Resolution can be defined as "the ability of an imaging system to record
fine details in a distinguishable manner". A working knowledge of
resolution is essential for understanding both practical and conceptual
details of remote sensing. Along with the actual positioning of spectral
bands, they are of paramount importance in determining the suitability
39
Silent sound technology
of remotely sensed data for a given applications. The major
characteristics of imaging remote sensing instrument operating in the
visible and infrared spectral region are described in terms as follow:
Spectral resolution
Radiometric resolution
Spatial resolution
Temporal resolution
Spectral Resolution refers to the width of the spectral bands. As
different material on the earth surface exhibit different spectral
reflectances and emissivities. These spectral characteristics define the
spectral position and spectral sensitivity in order to distinguish
materials. There is a tradeoff between spectral resolution and signal to
noise. The use of well -chosen and sufficiently numerous spectral bands
is a necessity, therefore, if different targets are to be successfully
identified on remotely sensed images.
Radiometric Resolution or radiometric sensitivity refers to the number
of digital levels used to express the data collected by the sensor. It is
commonly expressed as the number of bits (binary digits) needs to
store the maximum level. For example Landsat TM data are quantised
to 256 levels (equivalent to 8 bits). Here also there is a tradeoff
40
Silent sound technology
between radiometric resolution and signal to noise. There is no point in
having a step size less than the noise level in the data. A low-quality
instrument with a high noise level would necessarily, therefore, have a
lower radiometric resolution compared with a high-quality, high signal-
to-noise-ratio instrument. Also higher radiometric resolution may
conflict with data storage and transmission rates.
Spatial Resolution of an imaging system is defines through various
criteria, the geometric properties of the imaging system, the ability to
distinguish between point targets, the ability to measure the periodicity
of repetitive targets ability to measure the spectral properties of small
targets.
The most commonly quoted quantity is the instantaneous field of view
(IFOV), which is the angle subtended by the geometrical projection of
single detector element to the Earth's surface. It may also be given as
the distance, D measured along the ground, in which case, IFOV is
clearly dependent on sensor height, from the relation: D = hb, where h
is the height and b is the angular IFOV in radians. An alternative
measure of the IFOV is based on the PSF, e.g., the width of the PDF at
half its maximum value.
A problem with IFOV definition, however, is that it is a purely geometric
definition and does not take into account spectral properties of the
41
Silent sound technology
target. The effective resolution element (ERE) has been defined as "the
size of an area for which a single radiance value can be assigned with
reasonable assurance that the response is within 5% of the value
representing the actual relative radiance". Being based on actual image
data, this quantity may be more useful in some situations than the IFOV.
Other methods of defining the spatial resolving power of a sensor are
based on the ability of the device to distinguish between specified
targets. Of the concerns the ratio of the modulation of the image to
that of the real target.
It contain some flaws. To overcome the flaws and deficiencies in order
to get the originality of the data, it needs to undergo several steps of
processing.
Digital Image Processing undergoes three general steps:
Pre-processing
Display and enhancement
Information extraction
The Flow diagram that explains the steps that takes place
during the Digital Image Processing is shown below:
42
Silent sound technology
F igure5.2-Digital preprocessing
Pre-Processing:
Pre-processing consists of those operations that prepare data for
subsequent analysis that attempts to correct or compensate for
systematic errors.
Then analyst may use feature extraction to reduce the
dimensionality of the data.
43
Silent sound technology
Thus feature extraction is the process of isolating the most useful
components of the data for further study while discarding the less
useful aspects.
It reduces the number of variables that must be examined,
thereby saving time and resources.
Image Enhancement:
Improves the interpretability of the image by increasing apparent
contrast among various features in the scene
Common enhancements include image reduction, image
rectification, image magnification, contrast adjustments, principal
component analysis texture transformation and so on.
Image Enhancement operations are carried out to improve the
interpretability of the image by increasing apparent contrast
among various features in the scene. The enhancement techniques
depend upon two factors mainly
The digital data (i.e. with spectral bands and resolution)
The objectives of interpretation
As an image enhancement technique often drastically alters the
original numeric data, it is normally used only for visual (manual)
44
Silent sound technology
interpretation and not for further numeric analysis. Common
enhancements include image reduction, image rectification, image
magnification, transect extraction, contrast adjustments, band
ratioing, spatial filtering, Fourier transformations, principal
component analysis and texture transformation.
INFORMATION EXTRACTION:
In Information Extraction the remotely sensed data is subjected to
quantitative analysis to assign individual pixels to specific classes.
It is then classified.
It is necessary to evaluate its accuracy by comparing the
categories on the classified images with the areas of known
identity on the ground.
The final result of the analysis consists of maps (or images), data
and a report. Then these are converted to corresponding signals.
Information Extraction is the last step toward the final output of
the image analysis. After pre-processing and image enhancement
the remotely sensed data is subjected to quantitative analysis to
assign individual pixels to specific classes. Classification of the
45
Silent sound technology
image is based on the known and unknown identity to classify the
remainder of the image consisting of those pixels of unknown
identity. After classification is complete, it is necessary to evaluate
its accuracy by comparing the categories on the classified images
with the areas of known identity on the ground. The final result of
the analysis consists of maps (or images), data and a report.
8. FEATURES OF SILENT SOUND TECHNOLOGY:
Native speakers can silently utter a sentence in their language, and the
receivers can hear the translated sentence in their language. It appears
as if the native speaker
46
Silent sound technology
Produced speech in a foreign language. The translation technology
works for languages like English, French and German, except
Chinese, where different tones can hold many different meanings.
Allow people to make silent calls without bothering others.
The Technology opens up a host of application such as mentioned
below
Helping people who have lost their voice due to illness or accident.
Telling a trusted friend your PIN number over the phone without
anyone eavesdropping — assuming no lip-readers are around.
Silent Sound Techniques is applied in Military for communicating
secret/confidential matters to others.
47
Silent sound technology
9. RESEARCH:
With all of the millions of phones in circulation, there is great
potential for increasing earnings by saving 'lost calls' - telephone calls
that go unanswered or uninitiated because the user is in a situation in
which he or she cannot speak - not just in business meetings, but
everyday situations. According to research, these 'lost calls' are worth
$20 billion per year worldwide. For the cellular operator, these are
potential earnings that are currently being left on the table. When these
'lost calls' become answerable, and can be conducted without making a
sound, there is a tremendous potential for increased profits. Now the
research is going on technology that can be used in Office Environment
too.
48
Silent sound technology
10. APPLICATIONS:
Helping people who have lost their voice due to illness or accident.
Telling a trusted friend your PIN number over the phone without
anyone eavesdropping — assuming no lip-readers are around.
Silent Sound Techniques is applied in Military for communicating
secret/confidential matters to others.
Native speakers can silently utter a sentence in their language, and the
receivers can hear the translated sentence in their language. It appears
as if the native speaker produced speech in a foreign language. The
translation technology works for languages like English, French and
German, except Chinese, where different tones can hold many
different meanings.
Allow people to make silent calls without bothering others.
49
Silent sound technology
11. ADVANTAGES:
Helping people who have lost their voice due to illness or accident.
Since the electrical signals are universal they can be translated into
any language. Native speakers can translate it before sending it to the
other side. Hence it can be converted into any language of choice
currently being German, English & French.
To tell a secret PIN no. , or credit card no. on the phone now be easy
as there is no one eavesdrop anymore.
12. DISADVANTAGES:
Translation into majority of languages but for languages such as
Chinese different tone holds different meaning, facial movements
being the same. Hence this technology is difficult to apply in such
situations
Even differentiating between people and emotions cannot be done.
This means you will always feel you are talking to a robot.
50
Silent sound technology
From security point of view recognising who you are talking to gets
complicated
This device presently needs nine leads to be attached to our face
which is quite impractical to make it usable.
13. CONCLUSION:
Thus Silent Sound Technology, one of the recent trends in the field of
information technology implements”Talking without Talking”.
It will be one of the innovation and useful technology and in mere
future this technology will be use in our day to day life.
‘Silent Sound’ technology aims to notice every movement of the lips
and transform them into sounds, which could help people who lose
voices to speak, and allow people to make silent calls without
bothering others. Rather than making any sounds, your handset would
decipher the movements your mouth makes by measuring muscle
activity, then convert this into speech that the person on the other end
of the call can hear. So, basically, it reads your lips.
51
Silent sound technology
14. REFERENCES:
www.google.com
www.techpark.net
www.telecomspace.com
www.wikipedia.com
http://www.seminarprojects.com
www.slideshare.net
52