METHOD AND SERVER FOR SPEECH SYNTHESIS IN TEXT Russian patent published in 2017 - IPC G10L13/08 G10L15/16 G10L15/06

Abstract RU 2632424 C2

FIELD: physics.

SUBSTANCE: acoustic space model is trained on the basis of the training speech attribute data using deep neural networks to determine the interdependence factors between the speech attributes in the training data. The deep neural network creates a single continuous acoustic spatial model based on the interdependence factors. Acoustic spatial model, thus, takes into account many interdependent speech attributes and gives the ability to simulate a continuous spectrum of the interdependent speech attributes. Further, there is a text receipt; receiving selection of one or more speech attributes, wherein each speech attribute has a weight of the selected attribute. The text is converted to the synthesized speech using the acoustic space model, and the synthesized speech has a selected speech attribute. The synthesized speech is output as audio having the selected speech attribute.

EFFECT: increasing the human voice naturalness in the synthesized speech.

14 cl, 4 dwg

Similar patents RU2632424C2

Title	Year	Author	Number
METHOD FOR SPEECH SYNTHESIS WITH TRANSMISSION OF ACCURATE INTONATION OF THE CLONED SAMPLE	2020	Tagunov Petr Vladimirovich Gonta Vladislav Aleksandrovich	RU2754920C1
METHODS AND SERVERS FOR TRAINING MODEL TO DETECT SPEAKER CHANGE	2024	Gritskevich Evgenii Marianovich	RU2841235C1
METHOD AND SYSTEM FOR SPEECH SYNTHESIS FROM TEXT	2017	Kirichenko Vladimir Vladimirovich Luferenko Petr Vladislavovich	RU2692051C1
METHOD AND SYSTEM FOR GENERATING TEXT REPRESENTATION OF USER'S SPEECH FRAGMENT	2019	Galustyan Sergey Surenovich Minkin Fedor Aleksandrovich	RU2731334C1
METHODS AND ELECTRONIC DEVICES FOR DETERMINATION OF INTENT ASSOCIATED WITH UTTERED UTTERANCE OF USER	2018	Karpukhin Ivan Aleksandrovich	RU2711153C2
METHOD FOR TRANSCRIBING SPEECH FROM DIGITAL SIGNALS WITH LOW-RATE CODING	2023	Aladinskij Viktor Alekseevich Kuzminskij Sergej Vladislavovich Pavlov Andrej Petrovich Smirnov Pavel Leonidovich	RU2801621C1
METHOD AND SYSTEM FOR RECOGNIZING USER'S SPEECH FRAGMENT	2021	Ershov Vasily Alekseevich Kuralenok Igor Evgenevich	RU2808582C2
RE-SPEECH RECOGNITION WITH EXTERNAL DATA SOURCES	2016	Strohman, Trevor D. Schalkwyk, Johan Skobeltsyn, Gleb	RU2688277C1
METHOD OF RE-SOUNDING AUDIO MATERIALS AND APPARATUS FOR REALISING SAID METHOD	2012	Bredikhin Aleksandr Jur'Evich	RU2510954C2
ADAPTIVE AUDIO ENHANCEMENT FOR MULTICHANNEL SPEECH RECOGNITION	2016	Li, Bo Weiss, Ron J. Bacchiani, Michiel A.U. Sainath, Tara N. Wilson, Kevin William	RU2698153C1

RU 2 632 424 C2

Authors

Edrenkin Ilya Vladimirovich

Dates

2017-10-04—Published

2015-09-29—Filed