RE-SPEECH RECOGNITION WITH EXTERNAL DATA SOURCES Russian patent published in 2019 - IPC G10L15/19 G10L15/02 G10L15/183

Abstract RU 2688277 C1

FIELD: data processing.

SUBSTANCE: invention relates to means for producing transcription of a speech fragment. An initial version of the transcription of the speech fragment is obtained using an automated speech recognizer. Based on the language model not used by the automated speech recognizer when generating the original transcription, one or more terms which are phonetically similar to one or more terms which are already present in the original transcription version are identified. At the same time determination, whether phonetically terms are similar, provide definition of measure of similarity and comparison of measure with threshold, or determination, whether measure of similarity of measure of similarity relative to other pairs of terms exceeds. One or more additional transcription variants are generated based on the identified one or more terms. Transcription is selected from transcription variants.

EFFECT: technical result consists in improvement of accuracy of transcription of speech fragment.

20 cl, 3 dwg

Similar patents RU2688277C1

Title	Year	Author	Number
SPEECH RECOGNITION METHOD BASED ON TWO-LEVEL MORPHOPHONEMIC PREFIX GRAPH	2015	Ronzhin Andrej Leonidovich Karpov Aleksej Anatolevich	RU2597498C1
METHOD AND DEVICE FOR VOICE RECOGNITION	2006	Ol'Sen Esper	RU2393549C2
ADAPTIVE AUDIO ENHANCEMENT FOR MULTICHANNEL SPEECH RECOGNITION	2016	Li, Bo Weiss, Ron J. Bacchiani, Michiel A.U. Sainath, Tara N. Wilson, Kevin William	RU2698153C1
RECOGNITION ARCHITECTURE FOR GENERATING ASIAN HIEROGLYPHS	2008	Ko Shijun'-Tszu Fejg Kevin Eh. Gun Ifan' Miva Taro Chitrapu Arun	RU2477518C2
SPEECH SYNTHESIS METHOD	2009	Khitrov Mikhail Vasil'Evich	RU2421827C2
SYSTEM FOR VERIFYING THE SPEAKING PERSON IDENTITY	1996	Mehmmon Richard Dzh. Farrel Kevin Sharma Mehnish Divehng Nejk Zang Zjaoju Assalekh Khaled Leu Khan-Sheng	RU2161336C2
UNIVERSAL ORTHOGRAPHIC SYMBOLIC CIRCUITS	2005	Kel'Ba Chiprian I. Chambers Robert L. Movatt David U Tsjan	RU2441287C2
METHOD OF IDENTIFYING SPEAKER FROM ARBITRARY SPEECH PHONOGRAMS BASED ON FORMANT EQUALISATION	2009	Koval' Sergej L'Vovich	RU2419890C1
METHOD FOR PRELIMINARY PROCESSING OF TEXT	2007	Gusev Mikhail Nikolaevich Egorova Ol'Ga Borisovna Smirnov Valentin Aleksandrovich	RU2386178C2
METHOD AND DEVICE FOR PROVIDING A TEXT MESSAGE	2004	Chzhan Jasin' Kheh Sin' Zhehn' Sjao-Lin' Sun' Fan	RU2320082C2

RU 2 688 277 C1

Authors

Strohman, Trevor D.

Schalkwyk, Johan

Skobeltsyn, Gleb

Dates

2019-05-21—Published

2016-11-18—Filed