METHOD AND SERVER FOR SPEECH SYNTHESIS IN TEXT Russian patent published in 2017 - IPC G10L13/08 G10L15/16 G10L15/06 

Abstract RU 2632424 C2

FIELD: physics.

SUBSTANCE: acoustic space model is trained on the basis of the training speech attribute data using deep neural networks to determine the interdependence factors between the speech attributes in the training data. The deep neural network creates a single continuous acoustic spatial model based on the interdependence factors. Acoustic spatial model, thus, takes into account many interdependent speech attributes and gives the ability to simulate a continuous spectrum of the interdependent speech attributes. Further, there is a text receipt; receiving selection of one or more speech attributes, wherein each speech attribute has a weight of the selected attribute. The text is converted to the synthesized speech using the acoustic space model, and the synthesized speech has a selected speech attribute. The synthesized speech is output as audio having the selected speech attribute.

EFFECT: increasing the human voice naturalness in the synthesized speech.

14 cl, 4 dwg

Similar patents RU2632424C2

Title Year Author Number
METHOD FOR SPEECH SYNTHESIS WITH TRANSMISSION OF ACCURATE INTONATION OF THE CLONED SAMPLE 2020
  • Tagunov Petr Vladimirovich
  • Gonta Vladislav Aleksandrovich
RU2754920C1
METHOD AND SYSTEM FOR SPEECH SYNTHESIS FROM TEXT 2017
  • Kirichenko Vladimir Vladimirovich
  • Luferenko Petr Vladislavovich
RU2692051C1
METHOD AND SYSTEM FOR GENERATING TEXT REPRESENTATION OF USER'S SPEECH FRAGMENT 2019
  • Galustyan Sergey Surenovich
  • Minkin Fedor Aleksandrovich
RU2731334C1
METHODS AND ELECTRONIC DEVICES FOR DETERMINATION OF INTENT ASSOCIATED WITH UTTERED UTTERANCE OF USER 2018
  • Karpukhin Ivan Aleksandrovich
RU2711153C2
METHOD FOR TRANSCRIBING SPEECH FROM DIGITAL SIGNALS WITH LOW-RATE CODING 2023
  • Aladinskij Viktor Alekseevich
  • Kuzminskij Sergej Vladislavovich
  • Pavlov Andrej Petrovich
  • Smirnov Pavel Leonidovich
RU2801621C1
METHOD AND SYSTEM FOR RECOGNIZING USER'S SPEECH FRAGMENT 2021
  • Ershov Vasily Alekseevich
  • Kuralenok Igor Evgenevich
RU2808582C2
RE-SPEECH RECOGNITION WITH EXTERNAL DATA SOURCES 2016
  • Strohman, Trevor D.
  • Schalkwyk, Johan
  • Skobeltsyn, Gleb
RU2688277C1
METHOD OF RE-SOUNDING AUDIO MATERIALS AND APPARATUS FOR REALISING SAID METHOD 2012
  • Bredikhin Aleksandr Jur'Evich
RU2510954C2
METHOD FOR ASSESSING VARIABILITY OF A PASS PHRASE (VERSIONS) 2013
  • Khitrov Mikhail Vasilevich
  • Dyrmovskij Dmitrij Viktorovich
RU2598314C2
ADAPTIVE AUDIO ENHANCEMENT FOR MULTICHANNEL SPEECH RECOGNITION 2016
  • Li, Bo
  • Weiss, Ron J.
  • Bacchiani, Michiel A.U.
  • Sainath, Tara N.
  • Wilson, Kevin William
RU2698153C1

RU 2 632 424 C2

Authors

Edrenkin Ilya Vladimirovich

Dates

2017-10-04Published

2015-09-29Filed