Quality of Synthetic Speech Perceptual Dimensions, Influencing Factors, and Instrumental Assessment /

This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and int...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Κύριος συγγραφέας: Hinterleitner, Florian (Συγγραφέας)
Συγγραφή απο Οργανισμό/Αρχή: SpringerLink (Online service)
Μορφή: Ηλεκτρονική πηγή Ηλ. βιβλίο
Γλώσσα:English
Έκδοση: Singapore : Springer Singapore : Imprint: Springer, 2017.
Σειρά:T-Labs Series in Telecommunication Services,
Θέματα:
Διαθέσιμο Online:Full Text via HEAL-Link
LEADER 02861nam a22004935i 4500
001 978-981-10-3734-4
003 DE-He213
005 20170408150601.0
007 cr nn 008mamaa
008 170408s2017 si | s |||| 0|eng d
020 |a 9789811037344  |9 978-981-10-3734-4 
024 7 |a 10.1007/978-981-10-3734-4  |2 doi 
040 |d GrThAP 
050 4 |a TK5102.9 
050 4 |a TA1637-1638 
050 4 |a TK7882.S65 
072 7 |a TTBM  |2 bicssc 
072 7 |a UYS  |2 bicssc 
072 7 |a TEC008000  |2 bisacsh 
072 7 |a COM073000  |2 bisacsh 
082 0 4 |a 621.382  |2 23 
100 1 |a Hinterleitner, Florian.  |e author. 
245 1 0 |a Quality of Synthetic Speech  |h [electronic resource] :  |b Perceptual Dimensions, Influencing Factors, and Instrumental Assessment /  |c by Florian Hinterleitner. 
264 1 |a Singapore :  |b Springer Singapore :  |b Imprint: Springer,  |c 2017. 
300 |a XVI, 157 p. 29 illus.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a T-Labs Series in Telecommunication Services,  |x 2192-2810 
505 0 |a Introduction -- Speech Synthesis -- Auditory and Instrumental Quality Evaluation Metrics -- Perceptual Quality Dimensions -- Influencing Factors on Perceptual Quality -- Instrumental Quality Assessment -- Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System -- Conclusions. 
520 |a This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined. 
650 0 |a Engineering. 
650 0 |a User interfaces (Computer systems). 
650 1 4 |a Engineering. 
650 2 4 |a Signal, Image and Speech Processing. 
650 2 4 |a User Interfaces and Human Computer Interaction. 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
776 0 8 |i Printed edition:  |z 9789811037337 
830 0 |a T-Labs Series in Telecommunication Services,  |x 2192-2810 
856 4 0 |u http://dx.doi.org/10.1007/978-981-10-3734-4  |z Full Text via HEAL-Link 
912 |a ZDB-2-ENG 
950 |a Engineering (Springer-11647)