國立虎尾科技大學 |

Dynamic Speech Models = Theory, Algorithms, and Applications /

紀錄類型:	書目-語言資料,印刷品 : Monograph/item
正題名/作者:	Dynamic Speech Models/ by Li Deng.
其他題名:	Theory, Algorithms, and Applications /
作者:	Deng, Li.
面頁冊數:	XI, 105 p.online resource. :
Contained By:	Springer Nature eBook
標題:	Electrical engineering. -
電子資源:	https://doi.org/10.1007/978-3-031-02555-6
ISBN:	9783031025556

Dynamic Speech Models = Theory, Algorithms, and Applications /
Deng, Li.

Dynamic Speech ModelsTheory, Algorithms, and Applications /[electronic resource] :by Li Deng. - 1st ed. 2006. - XI, 105 p.online resource. - Synthesis Lectures on Speech and Audio Processing,1932-1678. - Synthesis Lectures on Speech and Audio Processing,.

Introduction -- A General Modeling and Computational Framework -- Modeling: From Acoustic Dynamics to Hidden Dynamics -- Models with Discrete-Valued Hidden Speech Dynamics -- Models with Continuous-Valued Hidden Speech Trajectories.

Speech dynamics refer to the temporal characteristics in all stages of the human speech communication process. This speech “chain” starts with the formation of a linguistic message in a speaker's brain and ends with the arrival of the message in a listener's brain. Given the intricacy of the dynamic speech process and its fundamental importance in human communication, this monograph is intended to provide a comprehensive material on mathematical models of speech dynamics and to address the following issues: How do we make sense of the complex speech process in terms of its functional role of speech communication? How do we quantify the special role of speech timing? How do the dynamics relate to the variability of speech that has often been said to seriously hamper automatic speech recognition? How do we put the dynamic process of speech into a quantitative form to enable detailed analyses? And finally, how can we incorporate the knowledge of speech dynamics into computerized speech analysis and recognition algorithms? The answers to all these questions require building and applying computational models for the dynamic speech process. What are the compelling reasons for carrying out dynamic speech modeling? We provide the answer in two related aspects. First, scientific inquiry into the human speech code has been relentlessly pursued for several decades. As an essential carrier of human intelligence and knowledge, speech is the most natural form of human communication. Embedded in the speech code are linguistic (as well as para-linguistic) messages, which are conveyed through four levels of the speech chain. Underlying the robust encoding and transmission of the linguistic messages are the speech dynamics at all the four levels. Mathematical modeling of speech dynamics provides an effective tool in the scientific methods of studying the speech chain. Such scientific studies help understand why humans speak as they do and how humans exploit redundancy and variability by way of multitiered dynamic processes to enhance the efficiency and effectiveness of human speech communication. Second, advancement of human language technology, especially that in automatic recognition of natural-style human speech is also expected to benefit from comprehensive computational modeling of speech dynamics. The limitations of current speech recognition technology are serious and are well known. A commonly acknowledged and frequently discussed weakness of the statistical model underlying current speech recognition technology is the lack of adequate dynamic modeling schemes to provide correlation structure across the temporal speech observation sequence. Unfortunately, due to a variety of reasons, the majority of current research activities in this area favor only incremental modifications and improvements to the existing HMM-based state-of-the-art. For example, while the dynamic and correlation modeling is known to be an important topic, most of the systems nevertheless employ only an ultra-weak form of speech dynamics; e.g., differential or delta parameters. Strong-form dynamic speech modeling, which is the focus of this monograph, may serve as an ultimate solution to this problem. After the introduction chapter, the main body of this monograph consists of four chapters. They cover various aspects of theory, algorithms, and applications of dynamic speech models, and provide a comprehensive survey of the research work in this area spanning over past 20~years. This monograph is intended as advanced materials of speech and signal processing for graudate-level teaching, for professionals and engineering practioners, as well as for seasoned researchers and engineers specialized in speech processing.

ISBN: 9783031025556

Standard No.: 10.1007/978-3-031-02555-6doiSubjects--Topical Terms:

596380
Electrical engineering.

LC Class. No.: TK1-9971

Dewey Class. No.: 621.3

Dynamic Speech Models = Theory, Algorithms, and Applications /
LDR:05252nam a22003735i 4500 001 1097470
003 DE-He213
005 20220601140107.0
007 cr nn 008mamaa
008 230315s2006 sz | s |||| 0|eng d
020 $a 9783031025556 $9 978-3-031-02555-6
024 7 $a 10.1007/978-3-031-02555-6 $2 doi
035 $a 978-3-031-02555-6
050 4 $a TK1-9971
072 7 $a THR $2 bicssc
072 7 $a TEC007000 $2 bisacsh
072 7 $a THR $2 thema
082 0 4 $a 621.3 $2 23
100 1 $a Deng, Li. $4 aut $4 http://id.loc.gov/vocabulary/relators/aut $3 1064095
245 1 0 $a Dynamic Speech Models $h [electronic resource] : $b Theory, Algorithms, and Applications / $c by Li Deng.
250 $a 1st ed. 2006.
264 1 $a Cham : $b Springer International Publishing : $b Imprint: Springer, $c 2006.
300 $a XI, 105 p. $b online resource.
336 $a text $b txt $2 rdacontent
337 $a computer $b c $2 rdamedia
338 $a online resource $b cr $2 rdacarrier
347 $a text file $b PDF $2 rda
490 1 $a Synthesis Lectures on Speech and Audio Processing, $x 1932-1678
505 0 $a Introduction -- A General Modeling and Computational Framework -- Modeling: From Acoustic Dynamics to Hidden Dynamics -- Models with Discrete-Valued Hidden Speech Dynamics -- Models with Continuous-Valued Hidden Speech Trajectories.
520 $a Speech dynamics refer to the temporal characteristics in all stages of the human speech communication process. This speech “chain” starts with the formation of a linguistic message in a speaker's brain and ends with the arrival of the message in a listener's brain. Given the intricacy of the dynamic speech process and its fundamental importance in human communication, this monograph is intended to provide a comprehensive material on mathematical models of speech dynamics and to address the following issues: How do we make sense of the complex speech process in terms of its functional role of speech communication? How do we quantify the special role of speech timing? How do the dynamics relate to the variability of speech that has often been said to seriously hamper automatic speech recognition? How do we put the dynamic process of speech into a quantitative form to enable detailed analyses? And finally, how can we incorporate the knowledge of speech dynamics into computerized speech analysis and recognition algorithms? The answers to all these questions require building and applying computational models for the dynamic speech process. What are the compelling reasons for carrying out dynamic speech modeling? We provide the answer in two related aspects. First, scientific inquiry into the human speech code has been relentlessly pursued for several decades. As an essential carrier of human intelligence and knowledge, speech is the most natural form of human communication. Embedded in the speech code are linguistic (as well as para-linguistic) messages, which are conveyed through four levels of the speech chain. Underlying the robust encoding and transmission of the linguistic messages are the speech dynamics at all the four levels. Mathematical modeling of speech dynamics provides an effective tool in the scientific methods of studying the speech chain. Such scientific studies help understand why humans speak as they do and how humans exploit redundancy and variability by way of multitiered dynamic processes to enhance the efficiency and effectiveness of human speech communication. Second, advancement of human language technology, especially that in automatic recognition of natural-style human speech is also expected to benefit from comprehensive computational modeling of speech dynamics. The limitations of current speech recognition technology are serious and are well known. A commonly acknowledged and frequently discussed weakness of the statistical model underlying current speech recognition technology is the lack of adequate dynamic modeling schemes to provide correlation structure across the temporal speech observation sequence. Unfortunately, due to a variety of reasons, the majority of current research activities in this area favor only incremental modifications and improvements to the existing HMM-based state-of-the-art. For example, while the dynamic and correlation modeling is known to be an important topic, most of the systems nevertheless employ only an ultra-weak form of speech dynamics; e.g., differential or delta parameters. Strong-form dynamic speech modeling, which is the focus of this monograph, may serve as an ultimate solution to this problem. After the introduction chapter, the main body of this monograph consists of four chapters. They cover various aspects of theory, algorithms, and applications of dynamic speech models, and provide a comprehensive survey of the research work in this area spanning over past 20~years. This monograph is intended as advanced materials of speech and signal processing for graudate-level teaching, for professionals and engineering practioners, as well as for seasoned researchers and engineers specialized in speech processing.
650 0 $a Electrical engineering. $3 596380
650 0 $a Signal processing. $3 561459
650 0 $a Acoustical engineering. $3 563185
650 1 4 $a Electrical and Electronic Engineering. $3 1388937
650 2 4 $a Signal, Speech and Image Processing . $3 1366353
650 2 4 $a Engineering Acoustics. $3 785331
710 2 $a SpringerLink (Online service) $3 593884
773 0 $t Springer Nature eBook
776 0 8 $i Printed edition: $z 9783031014277
776 0 8 $i Printed edition: $z 9783031036835
830 0 $a Synthesis Lectures on Speech and Audio Processing, $x 1932-1678 $3 1407318
856 4 0 $u https://doi.org/10.1007/978-3-031-02555-6
912 $a ZDB-2-SXSC
950 $a Synthesis Collection of Technology (R0) (SpringerNature-85007)