RE-SPEECH RECOGNITION WITH EXTERNAL DATA SOURCES Russian patent published in 2019 - IPC G10L15/19 G10L15/02 G10L15/183 

Abstract RU 2688277 C1

FIELD: data processing.

SUBSTANCE: invention relates to means for producing transcription of a speech fragment. An initial version of the transcription of the speech fragment is obtained using an automated speech recognizer. Based on the language model not used by the automated speech recognizer when generating the original transcription, one or more terms which are phonetically similar to one or more terms which are already present in the original transcription version are identified. At the same time determination, whether phonetically terms are similar, provide definition of measure of similarity and comparison of measure with threshold, or determination, whether measure of similarity of measure of similarity relative to other pairs of terms exceeds. One or more additional transcription variants are generated based on the identified one or more terms. Transcription is selected from transcription variants.

EFFECT: technical result consists in improvement of accuracy of transcription of speech fragment.

20 cl, 3 dwg

Similar patents RU2688277C1

Title Year Author Number
SPEECH RECOGNITION METHOD BASED ON TWO-LEVEL MORPHOPHONEMIC PREFIX GRAPH 2015
  • Ronzhin Andrej Leonidovich
  • Karpov Aleksej Anatolevich
RU2597498C1
METHOD AND DEVICE FOR VOICE RECOGNITION 2006
  • Ol'Sen Esper
RU2393549C2
ADAPTIVE AUDIO ENHANCEMENT FOR MULTICHANNEL SPEECH RECOGNITION 2016
  • Li, Bo
  • Weiss, Ron J.
  • Bacchiani, Michiel A.U.
  • Sainath, Tara N.
  • Wilson, Kevin William
RU2698153C1
SPEECH SYNTHESIS METHOD 2009
  • Khitrov Mikhail Vasil'Evich
RU2421827C2
RECOGNITION ARCHITECTURE FOR GENERATING ASIAN HIEROGLYPHS 2008
  • Ko Shijun'-Tszu
  • Fejg Kevin Eh.
  • Gun Ifan'
  • Miva Taro
  • Chitrapu Arun
RU2477518C2
SYSTEM FOR VERIFYING THE SPEAKING PERSON IDENTITY 1996
  • Mehmmon Richard Dzh.
  • Farrel Kevin
  • Sharma Mehnish
  • Divehng Nejk
  • Zang Zjaoju
  • Assalekh Khaled
  • Leu Khan-Sheng
RU2161336C2
UNIVERSAL ORTHOGRAPHIC SYMBOLIC CIRCUITS 2005
  • Kel'Ba Chiprian I.
  • Chambers Robert L.
  • Movatt David
  • U Tsjan
RU2441287C2
METHOD OF IDENTIFYING SPEAKER FROM ARBITRARY SPEECH PHONOGRAMS BASED ON FORMANT EQUALISATION 2009
  • Koval' Sergej L'Vovich
RU2419890C1
METHOD FOR PRELIMINARY PROCESSING OF TEXT 2007
  • Gusev Mikhail Nikolaevich
  • Egorova Ol'Ga Borisovna
  • Smirnov Valentin Aleksandrovich
RU2386178C2
METHOD AND DEVICE FOR PROVIDING A TEXT MESSAGE 2004
  • Chzhan Jasin'
  • Kheh Sin'
  • Zhehn' Sjao-Lin'
  • Sun' Fan
RU2320082C2

RU 2 688 277 C1

Authors

Strohman, Trevor D.

Schalkwyk, Johan

Skobeltsyn, Gleb

Dates

2019-05-21Published

2016-11-18Filed