FIELD: physics, acoustics.
SUBSTANCE: invention relates to estimation of the location of a sound source using particle filtering, particularly estimation of the location of a sound source for a multimodal audio-visual communication application. A sound source location is estimated by particle filtering where the particles represent a probability density function for a state variable comprising the sound source location. The method includes determining the weight coefficient for a particle in response to correlation between estimated acoustic transfer functions from the sound source to at least two sound recording positions. A weight coefficient update function may specifically be determined deterministically from the correlation and thus the correlation may be used as a pseudo-likelihood function for the measurement function of the particle filtering. The acoustic transfer functions may be determined from an audio beamforming towards the sound source. The audio weight coefficient may be combined with a video weight coefficient to generate a multimodal particle filtering approach.
EFFECT: high adaptability, easier estimation of the location of the sound source while increasing accuracy and improving efficiency.
15 cl, 9 dwg
Title | Year | Author | Number |
---|---|---|---|
ACOUSTIC ECHO CANCELLATION CONTROL FOR DISTRIBUTED AUDIO DEVICES | 2020 |
|
RU2818982C2 |
DETERMINING POSITION OF AUDIO SOURCE | 2011 |
|
RU2565338C2 |
VOICE COMMUNICATION DEVICE, VOICE COMMUNICATION METHOD AND PROGRAM | 2018 |
|
RU2744518C1 |
AUDIO FORMAT TRANSCODER | 2010 |
|
RU2519295C2 |
METHOD AND DEVICE FOR CAPTURING AUDIO INFORMATION USING DIRECTIONAL DIAGRAM FORMATION | 2017 |
|
RU2760097C2 |
ADAPTIVE AUDIO ENHANCEMENT FOR MULTICHANNEL SPEECH RECOGNITION | 2016 |
|
RU2698153C1 |
MULTICHANNEL ACOUSTIC ECHO CANCELLATION | 2010 |
|
RU2546717C2 |
AUDIO PROCESSING METHOD AND DEVICE | 2014 |
|
RU2664717C2 |
ECHO DETECTION | 2006 |
|
RU2427077C2 |
METHOD FOR CONTACT-DIFFERENCE ACOUSTIC PERSONAL IDENTIFICATION | 2011 |
|
RU2451346C1 |
Authors
Dates
2014-04-10—Published
2009-12-11—Filed