Speech Prosody in Speech Synthesis: Modeling and generation of prosody for high quality and flexible speech synthesis

The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions).  A high level of quality has already been ac...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Συγγραφή απο Οργανισμό/Αρχή: SpringerLink (Online service)
Άλλοι συγγραφείς: Hirose, Keikichi (Επιμελητής έκδοσης), Tao, Jianhua (Επιμελητής έκδοσης)
Μορφή: Ηλεκτρονική πηγή Ηλ. βιβλίο
Γλώσσα:English
Έκδοση: Berlin, Heidelberg : Springer Berlin Heidelberg : Imprint: Springer, 2015.
Σειρά:Prosody, Phonology and Phonetics,
Θέματα:
Διαθέσιμο Online:Full Text via HEAL-Link
LEADER 04071nam a22005055i 4500
001 978-3-662-45258-5
003 DE-He213
005 20151030201133.0
007 cr nn 008mamaa
008 150225s2015 gw | s |||| 0|eng d
020 |a 9783662452585  |9 978-3-662-45258-5 
024 7 |a 10.1007/978-3-662-45258-5  |2 doi 
040 |d GrThAP 
050 4 |a P215-240 
072 7 |a CFH  |2 bicssc 
072 7 |a LAN011000  |2 bisacsh 
082 0 4 |a 414  |2 23 
245 1 0 |a Speech Prosody in Speech Synthesis: Modeling and generation of prosody for high quality and flexible speech synthesis  |h [electronic resource] /  |c edited by Keikichi Hirose, Jianhua Tao. 
264 1 |a Berlin, Heidelberg :  |b Springer Berlin Heidelberg :  |b Imprint: Springer,  |c 2015. 
300 |a VIII, 213 p. 60 illus. in color.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a Prosody, Phonology and Phonetics,  |x 2197-8700 
505 0 |a Modeling of Prosody -- ProZed: A speech prosody editor for linguists, using analysis-by-synthesis -- On degree of freedom in prosody modeling -- Extraction, analysis and synthesis of Fujisaki model parameters -- Probabilistic modeling of pitch contours towards prosody synthesis and conversion -- Para- and non-linguistic issues of prosody -- Communicative speech synthesis as pan-linguistic prosody control -- Mandarin stress analysis and prediction for speech synthesis -- Expressivity in interactive speech synthesis; some para-linguistic and non-linguistic issues of speech prosody for conversational dialogue systems -- Temporally variable multi-attribute morphing of arbitrarily many voices for exploratory research of speech prosody -- Control of prosody in speech synthesis -- Statistical models for dealing with discontinuity of fundamental frequency -- Use of generation process model for improved control of fundamental frequency contours in HMM-based speech synthesis -- Tone Nucleus Model for Emotional Mandarin Speech Synthesis -- Emphasis, word prominence, and continuous wavelet transform in the control of HMM based synthesis -- Exploiting alternatives for text-to-speech synthesis: from machine to human -- Prosody control and variation enhancement techniques for HMM-based expressive speech synthesis. 
520 |a The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions).  A high level of quality has already been achieved in speech synthesis by using selection-based methods with segments of human speech.  Although the method enables synthetic speech with various voice qualities and speaking styles, it requires large speech corpora with targeted quality and style. Accordingly, speech conversion techniques are now of growing interest among researchers.  HMM/GMM-based methods are widely used, but entail several major problems when viewed from the prosody perspective; prosodic features cover a wider time span than segmental features and their frame-by-frame processing is not always appropriate. The book offers a good overview of state-of-the-art studies on prosody in speech synthesis.  . 
650 0 |a Linguistics. 
650 0 |a Phonology. 
650 0 |a Syntax. 
650 0 |a Communication. 
650 1 4 |a Linguistics. 
650 2 4 |a Phonology. 
650 2 4 |a Syntax. 
650 2 4 |a Signal, Image and Speech Processing. 
650 2 4 |a Communication Studies. 
700 1 |a Hirose, Keikichi.  |e editor. 
700 1 |a Tao, Jianhua.  |e editor. 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
776 0 8 |i Printed edition:  |z 9783662452578 
830 0 |a Prosody, Phonology and Phonetics,  |x 2197-8700 
856 4 0 |u http://dx.doi.org/10.1007/978-3-662-45258-5  |z Full Text via HEAL-Link 
912 |a ZDB-2-SHU 
950 |a Humanities, Social Sciences and Law (Springer-11648)