RECOGNIZING OF MIXED SPEECH Russian patent published in 2019 - IPC G10L15/06 G10L15/16 G10L15/20 

Abstract RU 2686589 C2

FIELD: data processing.

SUBSTANCE: invention relates to means of recognizing mixed speech. First neural network is trained to recognize a speech signal pronounced by a speaker with a higher level of speech characteristics from a sample of mixed speech. Second neural network is trained to recognize a speech signal pronounced by a speaker with a lower level of speech characteristics from a sample of mixed speech. Mixed speech sample is decoded by a first neural network and a second neural network by optimizing the combined probability of observing said two speech signals, where the combined probability means the probability that a particular frame is a switching point of the speech characteristic. Third neural network is taught to predict switching of the speech characteristic. Mixed speech sample is decoded based on said prediction.

EFFECT: high accuracy of recognizing mixed speech.

15 cl, 5 tbl, 6 dwg

Similar patents RU2686589C2

Title Year Author Number
ADAPTIVE AUDIO ENHANCEMENT FOR MULTICHANNEL SPEECH RECOGNITION 2016
  • Li, Bo
  • Weiss, Ron J.
  • Bacchiani, Michiel A.U.
  • Sainath, Tara N.
  • Wilson, Kevin William
RU2698153C1
DEVICE, METHOD, OR COMPUTER PROGRAM FOR GENERATING AN EXTENDED-BAND AUDIO SIGNAL USING A NEURAL NETWORK PROCESSOR 2018
  • Schmidt, Konstantin
  • Uhle, Christian
  • Edler, Bernd
RU2745298C1
TRAINING OF DNN-STUDENT BY MEANS OF OUTPUT DISTRIBUTION 2014
  • Chzhao Zhuj
  • Khuan Tszyuji-Tin
  • Li Tszinyuj
  • Gun Ifan
RU2666631C2
SYSTEM FOR VERIFYING THE SPEAKING PERSON IDENTITY 1996
  • Mehmmon Richard Dzh.
  • Farrel Kevin
  • Sharma Mehnish
  • Divehng Nejk
  • Zang Zjaoju
  • Assalekh Khaled
  • Leu Khan-Sheng
RU2161336C2
METHOD AND APPARATUS FOR DEFINING A DEEP FILTER 2020
  • Habets, Emanuel
  • Mack, Wolfgang
RU2788939C1
METHOD AND EQUIPMENT FOR RECOGNIZING EMOTIONS IN SPEECH 2019
  • Chzhan, Yan
  • Li, Tsyan
  • Verkholyak, Oksana
  • Karpov, Aleksej
RU2720359C1
METHOD AND DEVICE FOR INCREASING SPEECH INTELLIGIBILITY USING SEVERAL SENSORS 2004
  • Asero Alekhandro
  • Droppo Dzhejms G.
  • Deng Li
  • Sinkler Majkl Dzh.
  • Khuang Ksuedong Dehvid
  • Chzhehn Janli
  • Zhang Zhenzhiou
  • Liu Zicheng
RU2373584C2
SYSTEM AND METHOD OF CONVERTING VOICE SIGNAL INTO TRANSCRIPT PRESENTATION WITH METADATA 2014
  • Kneller Emmanuil Grigorevich
  • Karaulnykh Denis Vladimirovich
RU2589851C2
METHOD FOR HYBRID GENERATIVE-DISCRIMINATIVE SEGMENTATION OF SPEAKERS IN AUDIO-FLOW 2013
  • Khitrov Mikhail Vasil'Evich
  • Pekhovskij Timur Sakhievich
  • Shulipa Andrej Konstantinovich
RU2530314C1
METHOD AND DEVICE FOR VOICE RECOGNITION 2006
  • Ol'Sen Esper
RU2393549C2

RU 2 686 589 C2

Authors

Yu, Dong

Weng, Chao

Seltzer, Michael L.

Droppo, James

Dates

2019-04-29Published

2015-03-19Filed