Real-time Speech and Music Classification by Large Audio Feature Space Extraction

This book reports on an outstanding thesis that has significantly advanced the state-of-the-art in the automated analysis and classification of speech and music. It defines several standard acoustic parameter sets and describes their implementation in a novel, open-source, audio analysis framework c...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Κύριος συγγραφέας: Eyben, Florian (Συγγραφέας)
Συγγραφή απο Οργανισμό/Αρχή: SpringerLink (Online service)
Μορφή: Ηλεκτρονική πηγή Ηλ. βιβλίο
Γλώσσα:English
Έκδοση: Cham : Springer International Publishing : Imprint: Springer, 2016.
Σειρά:Springer Theses, Recognizing Outstanding Ph.D. Research,
Θέματα:
Διαθέσιμο Online:Full Text via HEAL-Link
LEADER 03044nam a22005415i 4500
001 978-3-319-27299-3
003 DE-He213
005 20170518013254.0
007 cr nn 008mamaa
008 151224s2016 gw | s |||| 0|eng d
020 |a 9783319272993  |9 978-3-319-27299-3 
024 7 |a 10.1007/978-3-319-27299-3  |2 doi 
040 |d GrThAP 
050 4 |a TK5102.9 
050 4 |a TA1637-1638 
050 4 |a TK7882.S65 
072 7 |a TTBM  |2 bicssc 
072 7 |a UYS  |2 bicssc 
072 7 |a TEC008000  |2 bisacsh 
072 7 |a COM073000  |2 bisacsh 
082 0 4 |a 621.382  |2 23 
100 1 |a Eyben, Florian.  |e author. 
245 1 0 |a Real-time Speech and Music Classification by Large Audio Feature Space Extraction  |h [electronic resource] /  |c by Florian Eyben. 
264 1 |a Cham :  |b Springer International Publishing :  |b Imprint: Springer,  |c 2016. 
300 |a XXXVIII, 298 p. 41 illus., 39 illus. in color.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a Springer Theses, Recognizing Outstanding Ph.D. Research,  |x 2190-5053 
505 0 |a Abstract -- Introduction -- Acoustic Features and Modelling -- Standard Baseline Feature Sets -- Real-time Incremental Processing -- Real-life Robustness -- Evaluation -- Discussion and Outlook -- Appendix -- Mel-frequency Filterbank Parameters. 
520 |a This book reports on an outstanding thesis that has significantly advanced the state-of-the-art in the automated analysis and classification of speech and music. It defines several standard acoustic parameter sets and describes their implementation in a novel, open-source, audio analysis framework called openSMILE, which has been accepted and intensively used worldwide. The book offers extensive descriptions of key methods for the automatic classification of speech and music signals in real-life conditions and reports on the evaluation of the framework developed and the acoustic parameter sets that were selected. It is not only intended as a manual for openSMILE users, but also and primarily as a guide and source of inspiration for students and scientists involved in the design of speech and music analysis methods that can robustly handle real-life conditions. 
650 0 |a Engineering. 
650 0 |a User interfaces (Computer systems). 
650 0 |a Computational linguistics. 
650 0 |a Acoustical engineering. 
650 1 4 |a Engineering. 
650 2 4 |a Signal, Image and Speech Processing. 
650 2 4 |a User Interfaces and Human Computer Interaction. 
650 2 4 |a Engineering Acoustics. 
650 2 4 |a Computational Linguistics. 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
776 0 8 |i Printed edition:  |z 9783319272986 
830 0 |a Springer Theses, Recognizing Outstanding Ph.D. Research,  |x 2190-5053 
856 4 0 |u http://dx.doi.org/10.1007/978-3-319-27299-3  |z Full Text via HEAL-Link 
912 |a ZDB-2-ENG 
950 |a Engineering (Springer-11647)