Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
43 views8 pages

3D Sound Source Localization Using A Spherical Mic

The paper presents a method for 3D sound source localization using a spherical microphone array (Eigenmike) by correlating synthesized spherical wave spectra with measured impulse responses. The study examines the accuracy of the localization algorithm under various spatial conditions, revealing that the accuracy is influenced by the microphone spacing and the size of the synthesis matrix. Results indicate that the algorithm can effectively determine sound source directions, although frequency resolution is limited due to the microphone arrangement.

Uploaded by

matmore83
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views8 pages

3D Sound Source Localization Using A Spherical Mic

The paper presents a method for 3D sound source localization using a spherical microphone array (Eigenmike) by correlating synthesized spherical wave spectra with measured impulse responses. The study examines the accuracy of the localization algorithm under various spatial conditions, revealing that the accuracy is influenced by the microphone spacing and the size of the synthesis matrix. Results indicate that the algorithm can effectively determine sound source directions, although frequency resolution is limited due to the microphone arrangement.

Uploaded by

matmore83
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/326126874

3D sound source localization using a spherical microphone array

Conference Paper · January 2014

CITATIONS READS

13 322

2 authors:

Manuel Brandner Alois Sontacchi


Kunstuniversität Graz Kunstuniversität Graz
13 PUBLICATIONS 55 CITATIONS 117 PUBLICATIONS 678 CITATIONS

SEE PROFILE SEE PROFILE

All content following this page was uploaded by Manuel Brandner on 02 July 2018.

The user has requested enhancement of the downloaded file.


Proceedings of ICSA 2014 in Erlangen --- ISBN 978-3-98 12830-4-4

3D sound source localization using a spherical microphone array


M. Brandner1 , A. Sontacchi2
1
University of Music and Performing Arts Graz, Austria, Email: [email protected]
2
University of Music and Performing Arts Graz, Austria, Email: [email protected]

Abstract
Directional detection of sound sources under defined ambience conditions using a spherical microphone array
(Eigenmike) is examined. The used spatial detection algorithm correlates synthesized spherical wave spectra derived
from theory with a set of concrete spherical spectra calculated from measured impulse responses. Thus, measurement
signals were recorded with the 32 microphone equipped Eigenmike microphone array and two measurement sets were
created for spatial sampling positions along an enclosing spherical surface with different radial distances. In order
to simulate free field conditions and to compare the applicability of the proposed algorithm under real conditions the
derived impulse responses are windowed adequately. Based on the Fourier transform of these resulting responses
the calculation of the spherical wave spectra for specific source positions is possible. Under free-field conditions,
the calculation of the synthesized spherical wave spectra of various spatial positions only depends on the structural
properties of the microphone array and the position of the measured omnidirectional sound source. Correlation of
measured and synthesized spherical wave spectra results in a data set with a maximum value for the sought direction
of the sound source. Another aim of investigation is to understand the context between the size of the synthesized
data - which serves as a lookup-table - and the directional accuracy. Within the contribution valuable information
about functionality as well as the boundaries of the directional detection under defined spatial conditions with the
spherical microphone array Eigenmike is given. The results show a limited frequency resolution as expected, due to
the arrangement of the microphones on the sphere. The measurement is carried out at 612 equiangular source positions
for the first measurement set and at 480 source positions for the second one. For the ideal case and a synthesis matrix
including all measured source positions the algorithm yields for the full frequency range from 172 Hz up to the aliasing
frequency (falias = 5.2kHz) an diminishing median deviation error. The accuracy is directly connected to the spatial
sampling (microphone capsule spacing) and the size of the synthesis matrix.

1. Introduction field. If not possible adequate windowing is necessary. If this


conditions are predominant, we can calculate the sound field
Acoustic source localization (ASL) is a common subject in around an microphone array analytically with the spherical
scientific literature and is feasible trough different approaches. base solutions [3]
The angular information estimated by an source localization
n
∞ X
algorithm can be used as a steering indicator for microphone X
p(kr, Θ) = [bnm jn (kr) + cnm h(2)
n (kr)]Ynm (Θ)
array beamforming [1]. A common used method for ASL is
n=0 m=−n
to calculate the sought direction through TDOA (direction ∞ X n
dependent time difference of arrival) which prerequisite is
X
= [bnm Rnm (kr, Θ) + cnm h(2) m
n Sn (kr, Θ)].
a spatially distributed array. Within this method the array n=0 m=−n
spacing is directly related to the magnitude detection of (1)
TDOA and large spacings are common. A different approach
for ASL is to use near-conincident microphone arrays The coefficients bnm and cnm can be called the wave spectrum
(NCMA), where the spacing of the microphone capsules of the incident and radiating field, respectively. Jn (kr) are
(2)
is reduced to a minimum. NCMA design emphasizes on the spherical Bessel functions, and hn (kr) are the spherical
spacing two up to more microphones as close as possible to Hankel functions of the second kind. Rnm (kr, Θ) describes
the acoustic center of the array. The principle of determining the regular incident field solution and Snm (kr, Θ)) is the
the source direction here, is to use level differences between singular radiating field solution. Ynm are the normalized
microphone signals. spherical harmonics depending on the solid/room angle (unit
Common Methods calculate the localization through the vector). The indices n and m stand for the order and
sound intensity vector approach [2] which calculates the degree, respectively with the ranges 0 6 n 6 N , and
sought direction in the frequency domain. This paper presents −n 6 m 6 n. With the spherical base solutions the
an approach, also frequency dependent, but the evaluation of sound pressure distribution of an sound field can be fully
a source direction is made via spherical harmonics. described analytically. On the other hand of course measuring
As a foundation for an ideal measurement setup we want the such a sound field with an microphone array is possible. If
room influences to be zero or at least reduced to a minimum, e.g. the analytically calculated pressure values from a sound
later we want to have a look at the feasibility for an arbitrary source at a specific location are compared with the measured
room situation. The ideal setup means that only the direct pressure values at this very position, they should show the
sound is recorded without any influences of a diffuse sound same pressure contour around the sphere of the microphone
array. The source direction can be determined by a maximum

3
Proceedings of ICSA 2014 in Erlangen --- ISBN 978-3-98 12830-4-4

search of the correlation function obtained via correlation of a calculation of the coefficients cnm (cf.(7)). The derivation
set of analytically derived wave spectra with measured wave
spectra. i ′ ′
jn (krm )bnm + hn (krm )cnm = 0 (7)
ρ0 c

2. Method yields the coefficients cnm


This paper presents a novel algorithm that correlates a syn- ′
jn (krm )
thesized spherical wave spectrum in order to get the source cnm = − bnm . (8)
h′n (krm )
direction towards an acoustic sound source, which is calcu-
lated analytically with measured wave spectra. The measured Equation (4) will be expandend with the coefficients
wave spectrum of an source position can be calculate via the

spherical harmonic transform (SHT) [4](cf.(2) and (3)). The j (krm )hn (krm )
correlation of a measured spectrum with a lookup-table of Anm,s = jn (krm )bnm − n ′ bnm (9)
hn (krm )
synthesized spectra should yield in the ideal case a matching
profile for all possible source locations with a maximum at the and can be expressed as follows
position for the sought direction.
′ ′
jn (krm )hn (krm ) − jn (krm )hn (krm )
SHTnm {g(ϑ, ϕ)} = γnm (2) Anm,s = bnm . (10)
h′n (krm )
Z 2π Z π
γnm = g(ϑ, ϕ)Ynm (ϑ, ϕ) sin(ϑ)dϑdϕ (3) The denominator (10) can be simplified via 1/i(kr)2 , which
ϕ=0 ϑ=0
is the Wronski determinant. With this simplification and
equation (6) we can state the synthesis-vector.
2.1. Synthesized Spectra
hn (krq )
For the calculation of the pressure distribution from incident Anm,s = − 2 h′ (kr ) Ynm (Θ). (11)
krmic n m
and radiating fields the spherical base solutions are the math-
ematical fundamentals cf.(1). After transforming the equation The first multiplicand in (11) is the radial-filter term with the
the spherical wave spectrum can be stated as follows: paramters, frequency (k = ω/c), radius from the center to
the microphone surface rm and source radius rq . The second
Anm,s = jn (krm )bnm + hn (krm )cnm . (4) multiplicand states the evaluation of the spherical harmonic
decomposition of a source position from direction (ϕ, ϑ). To
If a point source is examined at (rq , Θ) the expression can be get an set of spherical wave spectra for every source direction
expanded: (eq.12), it is necessary to build a Ynm -Matrix (eq.13) with all
angle pairs {ϕl , ϑl }l=1...L at the respective source location,
bnm = −ikhn (krq )Ynm (Θ)
(5) diag {hn (krq )}
Anm,s = − 2 diag {h′ (kr )} Ynm (Θl )l=1...L (12)
Anm,s = −ikjn (krm )hn (krq )Ynm (Θ) + cnm hn (krm ). krmic n m
(6)
 T
The spherical Bessel functions jn and hn the spherical Hankel Y00 (Θ1 ) Y00 (Θ2 ) ··· Y00 (Θl )
functions are dependent on the order of the examined system. Y1−1 (Θ1 ) Y1−1 (Θ2 ) ··· Y1−1 (Θl )
 
Equation (6) describes the synthesized spherical spectrum and  Y10 (Θ1 ) Y10 (Θ2 ) ··· Y10 (Θl ) 
Ynm (Θl ) =  Y11 (Θ1 ) .
 
holds two parts in the formula. The first part is the sound  Y11 (Θ2 ) ··· Y11 (Θl ) 

field of a point source from the source direction (Θ) decom-  .. .. .. .. 
 . . . . 
posed in spherical harmonics. The distance rq is the source
Ynm (Θ1 ) Ynm (Θ2 ) ··· Ynm (Θl )
radius starting from the microphone array center whereas
rm describes the distance from the center to the enclosing (13)
surface of the rigid sphere microphone array. Here only the
far field conditions are taken into account. The second part 2.2. Measured Spectra
in equation (6) describes the reflection of the incident sound
field on the surface of the microphone array. The sound An Investigation of a specific source position is expressed as
field decomposition from the center to the microphone surface in equation (1) for a specific frequency and yields with a trans-
can be analytically denoted via the coefficients cnm and the form a measured spherical spectrum. The vector p in equation
propagation term hn . (14) is already fourier transformed and therefore frequency
The derivation of the sound pressure yields the sound particle dependent. Because we want to investigate the spherical wave
velocity. For the spectral terms the same is true, the derivation spectrum, equation (14) needs to be transformed to (15)
of the spherical wave spectrum of the pressure distribution  
pϕmic1 ,ϑmic1 (f )
yields the spherical wave spectrum of the sound particle ve-  pϕmic2 ,ϑmic2 (f ) 
locity. On a rigid sphere we know that the normal component 
 = Ynm Anm

(14)
 ..
of the sound particle velocity has to be zero. The derivation  . 
of equation (4) will be set to zero. This steps allows us the pϕmic32 ,ϑmic32 (f )

4
Proceedings of ICSA 2014 in Erlangen --- ISBN 978-3-98 12830-4-4

Anm = pY−1
nm (15) 2.4. Examining the ASL Algorithm
To prevent that the matrix inversion yields a singular solution, Since an analytically feasible acoustic source localization
the Ynm -matrix can be inverted via a pseudo inverse: algorithm for arbitrary source direction is found, the next
step is to examine the feasibility for different scenarios. The
Anm = (YT Y)−1 YT p (16) first empirical evaluation of the algorithm is made with a
measurement setup fulfilling the free field condition. Impulse
Nevertheless, adequate sampling is necessary to yield a con- responses with an exponential sweep are calculated and ade-
dition number as small as possible. The configuration of quate windowing with a window with 20 samples fade in and
the capsules on the sphere of the Eigenmike yield such a 80 samples fade out prevent spatial influences.
small condition number. Equation (16) results in a (N + 1)2 For further investigations the behavior of the algorithm by
long vector and corresponds to the spherical wave spectrum reducing the synthesis matrix - reducing the sets grid of
calculated from the measured sound pressure distribution. possible source locations - to a minimum is examined. After
minimizing the synthesis matrix, which holds the possible
2.3. Matching profile of Spherical Wave Spectra source locations, an vector interpolation approach is made.
In theory, the three strongest components of the matching
The feasibility of the algorithm is examined by the autocor- profile should hold the most valuable directional information.
relation of the measured wave spectra. For this case the Therefore, these components are summed up and weighted to
angular deviation is zero. Further investigations are made gain an improvement for small synthesis sets.
on examining the correlation of a synthesized set of wave Another task is examining the robustness of the algorithm for
spectra (calculated analytically) with a measured spectrum different window lengths, which is directly connected to a
for a specific source position. To get to an matching profile change of the D/R - ratio (ratio of the direct sound field to
(correlation vector) we first have to calculate the correlation reverberant sound field) and similar to changing the distance
coefficient of the Anm -vector and the Anm -matrix [5]. This to the acoustic sound source.
synthesis matrix (Anm -matrix) holds all angle dependent Depending on the radial distance of a sound source to the
source positions, so the correlation coefficient has to be microphone array, a radial filter is necessary to focus the
calculated for all angle pairs {ϕl , ϑl }l=1...L . The result of localization algorithm onto the source. These dependencies
each correlation will be stated as in (17), where Anm,s is a are also subject of interest. Investigations on the performance
(NxM)-Matrix (N source positions and M sampling points) are also in relation to the structural characteristics of the
and denotes the synthesized spectrum. Anm is a vector of microphone array and therefor the limits of the aperture have
length M and denotes the measured spectra. The highest to be taken into account.
correlation value of the AnmM AT CH -vector for a successful
source localization includes the sought angle pair (seen in
Figure 1). 3. Measurement Setup
Both measurements were carried out at the Institute of Elec-
AH nm,s Anm
AnmM AT CH = q (17) tronic Music and Acoustics (IEM) in the ”IEM-CUBE” in
AH H
nm,s Anm,s Anm Anm Graz. The CUBE is an 10.3 x 12 x 4.8 m large room used
as a lab, for concerts and also lectures (reverberation time
RT60 ≈ 0.7s, rH = 1.9m). For the first setup the task was to
minimize the room influences and gain free field conditions.
1
N=4, complex Therefore a radius at rq = 0.7m was chosen and a loudspeaker
0.9 N=4, abs with nearly spherical radiation, one acoustic center and linear
0.8
frequency response was used (see Figure 2, Figure 3 shows
Magnitude − Correlation vector

the measurement chain). The different source positions of the


0.7 loudspeaker were precisely set up with the optitrack tracking
0.6 system. The microphone array was placed on a turntable and
controlled via ethernet to prevent changes in orientation.
0.5
The first measurement was taken in 10◦ steps in azimuth and
0.4 elevation. The microphone was positioned at a height of
1.4m. The measurement was reduced to 17 positions in the
0.3
elevation because of the difficulty to position the loudspeaker
0.2 underneath the array and the fact that the microphone array
0.1
shows identical properties at top and bottom, resulting in a
set of 612 source positions. The second measurement was
0
[0°, 0°] [0°,40°] [0°,80°] [0°,120°] [0°,160°]
taken at the same site but at a distance of 1.5m (rH = 1.9m).
Source directions (phi, theta) The loudspeaker setup this time was a fixed system of 16
loudspeaker on a metallic ring, where no tracking of the
Figure 1: Matching profile AnmM AT CH for 4kHz over all source source position was necessary. This set holds 480 source
positions - red: absolute vector, blue: complex vector, sought positions and was taken in 12◦ steps in azimuth and 11.25◦
direction at phi=0◦ ,theta=90◦ , the x-axis shows all 612 source steps in elevation.
positions starting from the zenith

5
Proceedings of ICSA 2014 in Erlangen --- ISBN 978-3-98 12830-4-4

4. Performance Evaluation
In Figure 4 we can see the overall performance of the
proposed ASL algorithm, of course for the ideal case, where
the free field condition is fulfilled. In this ideal case the
algorithm still works beyond the aliasing frequency, because
only little dynamic for a correct localization is necessary - no
interferences are predominant. The same phenomenon occurs
for very low frequencies. The matching vector has a low
dynamic, but still enough to separate the sought direction.
In case of low dynamics we included the phase term which
should yield an improvement in the source localization. Due
to the fact that the for low frequencies the amplitudes of
adjacent capsules are a minimum an improvement is desired.
Figure 2: Measurement setup, left: measurement sketch, right: The empirical evaluation shows that including the phase
image of tracking and positioning the loudspeaker
term also leads to an improvement of the algorithm for the
rest of the interested frequency range (cf. Figure 1 and
Figure 7). For the investigation of harsher conditions a second
measurement set is evaluated. The radial distance to the
source positions is more than double the distance for the first
arrangement. This should enable the comparison of the ideal
case with a case at a source radius with higher room influences
and give an overview of the performance of the algorithm (see
Sec.4.4).

40
falias
35

30
angular deviation [°]

25

20

Figure 3: Measurement chain, arrangement of equipment, except


15
the tracking system the noisy equipment was located outside, to
minimize the noise influence of the tracking system fans a mobile
10
soundproofing was used
5
3.1. Limitation due to Measurement Arrange-
ment and Array Structure 172 1378 2584 3790 4996 6202 7407
frequency in Hz
On the one hand we experience an upper frequency limit,
namely the aliasing frequency (can be seen as the spatial Figure 4: boxplot of the angular deviation for all source positions
nyquist frequency), which is depending on the array spac- over frequency for 612pt synthesis matrix
ing. The Eigenmike consists of 32 microphone capsules
and is therefore an oversampled 4th order holographic array
(samplingpoints ≡ (N + 1)2 ). The aliasing frequency can
4.1. Reducing the Synthesis Matrix
be expressed as follows: The idea of reducing the synthesis matrix is to generate a
N ∗c lookup-table for source localization with minimum computa-
falias = = 5.2kHz. (18) tional costs. The microphone array is a 4th order system with
rmic ∗ 2 ∗ π
32 microphone capsules (to be precise, an overdetermined 4h
The lower frequency limitation is for practical reasons of the order system, (N + 1)2 < 32) and it is obvious that dividing
measurement setup. If we consider a loudspeaker at a radial the sphere by the main lobe of a 4th order beam should give
distance of 0.7m, the lower frequency bound can be expressed us the minimal sampling points necessary to still take the
as (free field condition): advantages of the HOA system. The approach leads us to a
2πf minimal synthesis set. If the synthesis matrix is thinned to this
r>2 (19) 25 points, the exact localization of the acoustic source is not
c
possible anymore. The actual position will be automatically
flower > 155.97Hz (20)
mapped to the nearest neighbor in the lookup-table. With
The loudspeaker’s frequency response shows the -3dB mark this knowledge and by knowing the 4th order beam angle at
for low frequencies at 150Hz. -3dB [6](cf.(21)), we can assume that the median in Figure 5

6
Proceedings of ICSA 2014 in Erlangen --- ISBN 978-3-98 12830-4-4

almost lies at the half of the beam width. Because the 4.2. Dependence of Spherical Harmonics Order
arrangement of the microphones does not correspond to a and Phase Information
platonic solid, a deviation of the mean value away from half
of the beamwidth can be expected. The deviation might be As we can see in Figure 1, including the phase information
approximated via the ratio of the order to the number of actual of the recorded signals yields a much higher dynamic. The
microphone capsules (µapprox = 14.6◦ ). phase information lies in the time difference of arrival at
the capsules. The impulse responses are always calculated
187◦ relatively to each other. In the case of the matching profile for
γbeam = = 37.4◦ . (21) 4 kHz we see a dynamic four times higher than without the
N +1
complex matching approach.
If the synthesis matrix is thinned out to 49 points (γbeam = If we have a look at the spherical harmonics order, we can
26.7◦ ) the median deviation is reduced to 11◦ (cf. Figure 6). see a reduction of ambiguity for higher order calculations
For the ideal case the algorithm is feasible beyond the alias in the matching profile which can not only be seen as an
frequency as long as the magnitudes of the spatial aliasing enhancement of dynamics, but also as an improvement for
beams are smaller than the one in the sought direction. We robust vector interpolation (cf. Figure 7). As already stated
can state that at least the reduction of the synthesis grid is an increase of accuracy for low frequencies is also given. The
feasible and leads also to a reduction of computational costs. higher the order, the smaller the side lobes of the spherical
Nevertheless the intrinsic error increases with reduction of the beam, which means also less ambiguities.
lookup-table.

35
falias 1
N=4, complex
30 0.9 N=1, complex

0.8
Magnitude − Correlation vector

25
angular deviation [°]

0.7
20
0.6

15 0.5

0.4
10
0.3
5
0.2

0 0.1
172 1378 2584 3790 4996
frequency in Hz 0
[0°, 0°] [0°, 40°] [0°, 80°] [0°,120°] [0°,160°]
Source directions (phi, theta)
Figure 5: boxplot of the angular deviation for all source positions
over frequency for 25pt synthesis matrix
Figure 7: Matching profile AnmM AT CH for 4kHz over all source
positions - red: N=1, blue dashed: N=4, sought direction at
phi=0◦ ,theta=90◦
40

35

30 4.3. Vector Interpolation Approach


angular deviation in degree

25 In order to make the algorithm more robust concerning ASL,


an approach of calculating the location via the vectorially
20 summation of the strongest three components of the correla-
tion vector (matching profile) is proposed. Two approaches
15 were made: the first one calculates the sought source direction
via the normalized summation, whereas by the second one the
10 median(Synthset 25pt) value of the component in the correlation vector is used as
max error(Synthset 25pt)
5 median(Synthset 49pt) a weight. The position of the value in the correlation vector
max error(Synthset 49pt) holds the information about the azimuth and elevation angle.
falias
0 Figure 8 depicts the reduction of the angle deviation for a
172 2584 4996 7407 reduced synthesis set of 25 points. A vector interpolation
frequency in Hz
for high density synthesis grids seems due to the minimal
Figure 6: angular deviation and maximum deviation for all source
deviation to the nearest neighbor not necessary at the moment.
positions over frequency for 25pt synthesis matrix and 49pt For realtime applications the interpolation has to be further
evaluated.

7
Proceedings of ICSA 2014 in Erlangen --- ISBN 978-3-98 12830-4-4

40 40 30
median(single loc value deviation angle measurement 1 28
35 median(vec interp) deviation angle measurement 2 26
median(vector interp norm) D/R−ratio measurement 1 24
f 30 D/R−ratio measurement 2

deviation error in degree


alias 22
30
angular deviation in degree

20

D/R−ratio in dB
25 18
20 16
20 14
12
15 10
10 8
10 6
4
5 2
0 0
128 256 512 1024 2048 4096 8192 16384
0
samples
172 1378 2584 3790 4996 6202 7407 8613
frequency in Hz
Figure 9: direct sound to reverberant sound ratio versus the window
length. For the first measurement set a radial distance of rq =
Figure 8: median angular deviation for all source positions
0.7m toward the acoustic sources was chosen. The course of the
over frequency for 25 point synthesis set in comparison of two
D/R ratio for the first set shows only a 16dB range due to the
vector interpolation approaches. The normalized approach is only
measurement arrangement. Because examining the algorithm in
a vectorially summation of the three highest correlation values,
a free field situation the room influences are a minimum. For
whereas the second interpolation approach uses corresponding
the second measurement set less damping material is used and the
correlation dependent weighting
distance of the radial arranged sources is almost the critical distance,
rH = 1.9m. Higher window lengths are in direct relation with a
4.4. D/R-Ratio - Direct-to-Reverberant Sound higher amount of room influences which is still visible in the plot.
To have a glance at a real performance situation and the
impact on the accuracy of the algorithm, different window radialfilter, which focuses the spectrum at a specific source
lengths were considered. The ratio of the direct sound distance. The higher the order of the spherical harmonics, the
field to the diffuse sound field was chosen as an adequate higher the amplification of the associated frequency compo-
parameter (cf.(22)) to evaluate every source position based nent gets (cf. Figure 10). If we consider different focusing
on a the predominant sound field and not only in relation (also wrong focusing) for one and the same source position, it
to the source distance. The window length of the direct is possible to examine the impact on the source localization.
sound was chosen with 128 samples, based on an evaluation The error angle is in direct relation with focusing on a source
of the impulse responses. Figure 9 depicts that the longer position via the radial filter (cf. Figure 11). Problems occur
the overall window length is chosen, the higher the deviation if the algorithm focuses on a source distance closer than the
error gets and the D/R-ratio decreases. It can be seen as actual radius. If further away focused, no increase in the
similar to changing the distance toward the source position. angular deviation is observed. As a limitation on focusing
If we increase the distance between the microphone array and on a source the amplification of the radial filter has to be
the acoustical source, the influences of disturbance increase, considered. It is not feasible to amplify a signal more than
therefore the D/R-ratio decreases. The evaluation was made 60dB due to the dynamic range of the microphone array.
for both measurement sets (measurement set 1, rq = 0.7m Therefor a maximum distance in regarding of the radial filter
and measurement set 2, rq = 1.5m). As we can see in limits the focus.
Figure 9 the increase of the distance yields a larger angle
deviation for higher window lengths. Only for short windows
(128 and 256 samples) very high accuracy with the ASL
5. Conclusion
algorithm is possible to achieve. Due to the fact that for the First of all, the evaluation showed the feasibility of the
second measurement set the source radius was almost at the algorithm for free field conditions. For this specific conditions
critical distance the angle error increases much faster. The the algorithm yields in the whole frequency range of interest
influences of less damping material on the floor for the second (up to 5.2kHz) excellent results. Especially including the
measurement can not be neglected. phase term in the source localization enhances the dynamic
range extremely. The approach of using a higher order
10log( s2dir )
P
D/R = . (22) ambisonics system yields much more information about the
10log( s2rev )
P
source and direction than a conventional approach. Second
of all, is shown that it is possible to reduce the synthesis
grid in order to cut back computational costs. Nevertheless,
4.5. Dependence of the Radial Filter
an increase of the intrinsic deviation error is the result. If
In Section 2.1 a method to calculate a synthesized spectrum the synthesized matrix is cropped to a minimum, a vector
was treated. Equation (12) consists of two multiplicands, interpolation yields a very decent enhancement, depending
whereas the second one states the spherical harmonics at on the frequency, of up to 10 degree. Of course the best
specific source position. The first fractional term states the result in respect of the deviation error angle yields the source

8
Proceedings of ICSA 2014 in Erlangen --- ISBN 978-3-98 12830-4-4

160
N=0 (rq=0.7) References
N=1 (rq=0.7)
140
N=2 (rq=0.7) [1] Tervo, S.: Direction estimation based on sound intensity
N=3 (rq=0.7)
120 N=4 (rq=0.7)
vectors. 17th European Signal Processing Conference,
N=0 (rq=1.5) Scotland (2009), 700-704
100 N=1 (rq=1.5)
magnitude in dB

N=2 (rq=1.5) [2] Freiberger, K. and Sontacchi, A.: Similarity-based sound


N=3 (rq=1.5)
80
N=4 (rq=1.5) source localization with a coincident microphone array.
Proc. of the 14th Int. Conference on Digital Audio Effects
60
(DAFx-11), France (2011), 185-190
40
[3] Earl G. Williams. Fourier Acoustics: Sound Radiation
20 and Near- field Acoustical Holography. Academic Press,
1999
0
[4] Plessas, P.: Rigid sphere microphone arrays for spatial
−20 2 3 4 recording and holography. Master’s thesis, Institute of
10 10 10
frequency in Hz Electronic Music and Acoustics University of Music and
Performing Arts Graz, Austria (2009)
Figure 10: radial filter for the first measurement set, rq = 0.7m and
rq = 1.5m, the radial filter show extremely high amplification for [5] Hohl, F.: Kugelmikrofonarray zur Abstrahlungsvermes-
higher orders in the lower frequency range. Especially if a noise is sung von Musikinstrumenten. MasterÕs thesis, Institute
added in the signal, the high amplification also enlarges the unwanted of Electronic Music and AcousticsUniversity of Music
signal components. and Performing Arts Graz, Austria (2009)
40 [6] Zotter, F.: Analysis and Synthesis of Sound-Radiation
without radial filter with Spherical Arrays. PhD thesis, Institute of Electronic
rq=0.07m
35
rq=0.7m
Music and Acoustics University of Music and Performing
rq=1.5m Arts, Austria (2009)
30 rq=20m
angular deviation in degree

falias
25

20

15

10

0
172 775 1378 1981 2584 3187 3790 4393 4996 5599
frequency in Hz

Figure 11: angular deviation for one source position in relation to


the focusing of the radial filter, rq = 0.7m. If no radial filter is used,
the source localization is not feasible. For focusing on sources nearer
than the actual source distance a higher deviation can be observed.
On the other hand, if focusing on source positions farther away than
the actual position, the source localization is stable.

localization algorithm if the synthesis matrix is maximized.


Furthermore, it is shown that changing the window size and
also changing the radial distance of the acoustic source is
directly connected to the performance of the localization.
The more influences of the room is omitted, the better the
algorithm works. Especially room reflections with directional
information impair the results. Another treated subject is the
impact of focusing on a specific radial distance and the result
of wrong focusing. If the source is closer focused than actual
located, the deviation error increases for low frequencies
noticeable. On the other hand for focusing on a positions
further away as the actual source position neglectable errors
occur. However, it is not feasible to focus on infinity, because
this would yield to high amplification for all frequency terms.

9
View publication stats

You might also like