FIELD: physics.
SUBSTANCE: acoustic space model is trained on the basis of the training speech attribute data using deep neural networks to determine the interdependence factors between the speech attributes in the training data. The deep neural network creates a single continuous acoustic spatial model based on the interdependence factors. Acoustic spatial model, thus, takes into account many interdependent speech attributes and gives the ability to simulate a continuous spectrum of the interdependent speech attributes. Further, there is a text receipt; receiving selection of one or more speech attributes, wherein each speech attribute has a weight of the selected attribute. The text is converted to the synthesized speech using the acoustic space model, and the synthesized speech has a selected speech attribute. The synthesized speech is output as audio having the selected speech attribute.
EFFECT: increasing the human voice naturalness in the synthesized speech.
14 cl, 4 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD FOR SPEECH SYNTHESIS WITH TRANSMISSION OF ACCURATE INTONATION OF THE CLONED SAMPLE | 2020 |
|
RU2754920C1 |
METHOD AND SYSTEM FOR SPEECH SYNTHESIS FROM TEXT | 2017 |
|
RU2692051C1 |
METHOD AND SYSTEM FOR GENERATING TEXT REPRESENTATION OF USER'S SPEECH FRAGMENT | 2019 |
|
RU2731334C1 |
METHODS AND ELECTRONIC DEVICES FOR DETERMINATION OF INTENT ASSOCIATED WITH UTTERED UTTERANCE OF USER | 2018 |
|
RU2711153C2 |
METHOD FOR TRANSCRIBING SPEECH FROM DIGITAL SIGNALS WITH LOW-RATE CODING | 2023 |
|
RU2801621C1 |
METHOD AND SYSTEM FOR RECOGNIZING USER'S SPEECH FRAGMENT | 2021 |
|
RU2808582C2 |
RE-SPEECH RECOGNITION WITH EXTERNAL DATA SOURCES | 2016 |
|
RU2688277C1 |
METHOD OF RE-SOUNDING AUDIO MATERIALS AND APPARATUS FOR REALISING SAID METHOD | 2012 |
|
RU2510954C2 |
METHOD FOR ASSESSING VARIABILITY OF A PASS PHRASE (VERSIONS) | 2013 |
|
RU2598314C2 |
ADAPTIVE AUDIO ENHANCEMENT FOR MULTICHANNEL SPEECH RECOGNITION | 2016 |
|
RU2698153C1 |
Authors
Dates
2017-10-04—Published
2015-09-29—Filed