NEURAL NETWORK BASED CLASSFIER FOR SEPARATING AUDIO SOURCES FROM MONOPHONIC AUDIO SIGNAL Russian patent published in 2011 - IPC G10L15/16 

Abstract RU 2418321 C2

FIELD: physics.

SUBSTANCE: method is realised by breaking the monophonic audio signal into baseline frames (possibly overlapping), windowing the frames, extracting a number of descriptive features in each frame, and employing a pre-trained nonlinear neural network as a classifier. Each neural network output manifests the presence of a pre-determined type of audio source in each baseline frame of the monophonic audio signal. The classifier output signals can be used as input signals to create multiple audio channels for a source separation algorithm (e.g., ICA) or as parametres in a post-processing algorithm (e.g. categorise music, track sources, generate audio indices for the purposes of navigation, re-mixing, security and surveillance, telephone and wireless communications, and teleconferencing).

EFFECT: network classifier is well suited to address widely changing parametres of the signal and sources, time and frequency domain overlapping of the sources, and reverberation and occlusions in real-life signals.

28 cl, 14 dwg

Similar patents RU2418321C2

Title Year Author Number
AUDIO PROCESSING SYSTEM 2014
  • Cherling, Kristofer
  • Purnkhagen, Khejko
  • Villemoes, Lars
RU2625444C2
METHOD AND APPARATUS FOR PROCESSING AUDIO SIGNAL FOR SPEECH ENHANCEMENT USING REQUIRED FEATURE EXTRACTION FUNCTION 2009
  • Ule Kristian
  • Khellmut Oliver
  • Grill Bernkhard
  • Ridderbush Falko
RU2507608C2
AUDIO BANDWIDTH EXTENSION BY INSERTION OF TEMPORAL PRE-SHAPED NOISE IN FREQUENCY DOMAIN 2014
  • Dish Sasha
  • Multrus Markus
  • Shubert Benyamin
  • Shnell Markus
RU2666468C2
AUDIO OR VIDEO ENCODER, AUDIO OR VIDEO AND RELATED METHODS OF PROCESSING MULTI-CHANNEL AUDIO OR VIDEO SIGNALS USING VARIABLE PREDICTION DIRECTION 2011
  • Robijjar Zhjul'En
  • Nojzinger Mattias
  • Khel'Mrikh Kristian
  • Khil'Pert Jokhannes
  • Rettel'Bakh Nikolaus
  • Dish Sasha
  • Ehdler Bernd
RU2541864C2
METHOD AND DEVICE FOR OBTAINING SPECTRAL COEFFICIENTS FOR REPLACEMENT AUDIO FRAME, AUDIO DECODER, AUDIO RECEIVER AND AUDIO SYSTEM FOR AUDIO TRANSMISSION 2014
  • Sukovski Dzhanin
  • Shpershnajder Ralf
  • Markovich Goran
  • Egers Volfgang
  • Khelmrikh Kristian
  • Edler Bernd
  • Gajger Ralf
RU2632585C2
METHODS AND SYSTEMS FOR EFFICIENT RECOVERY OF HIGH FREQUENCY AUDIO CONTENT 2013
  • Tezing Robin
  • Shug Mikhael
RU2601188C2
GENERATION OF SCATTERED SOUND FOR BINAURAL CODING CIRCUITS USING KEY INFORMATION 2005
  • Allamankhe Ehrik
  • Dish Sasha
  • Faller Kristof
  • Kherre Jurgen
RU2384014C2
SYSTEMS AND METHODS OF CHANGING WINDOW WITH FRAME, ASSOCIATED WITH AUDIO SIGNAL 2007
  • Krishnan Venkatesh
  • Kandkhadaj Anantkhapadmanabkhan
RU2418323C2
DEVICE AND METHOD OF GENERATING EXPANDED SIGNAL USING INDEPENDENT NOISE FILLING 2015
  • Disch, Sascha
  • Geiger, Ralf
  • Niedermeier, Andreas
  • Neusinger, Matthias
  • Schmidt, Konstantin
  • Wilde, Stephan
  • Schubert, Benjamin
  • Neukam, Christian
RU2665913C2
SYSTEM AND METHOD FOR EXCHANGING SIGNALS OF AUDIO-VISUAL INFORMATION 2002
  • Rejnol'Dz Dzhodi Linn
  • Ingrehkhem Robert Uolter
RU2282888C2

RU 2 418 321 C2

Authors

Shmunk Dmitrij V.

Dates

2011-05-10Published

2006-10-03Filed