語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Incorporating Auditory Models in Spe...
~
Arizona State University.
Incorporating Auditory Models in Speech/Audio Applications.
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Incorporating Auditory Models in Speech/Audio Applications./
作者:
Krishnamoorthi, Harish.
面頁冊數:
160 p.
附註:
Source: Dissertation Abstracts International, Volume: 72-07, Section: B, page: 4217.
Contained By:
Dissertation Abstracts International72-07B.
標題:
Health Sciences, Audiology. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3453472
ISBN:
9781124615363
Incorporating Auditory Models in Speech/Audio Applications.
Krishnamoorthi, Harish.
Incorporating Auditory Models in Speech/Audio Applications.
- 160 p.
Source: Dissertation Abstracts International, Volume: 72-07, Section: B, page: 4217.
Thesis (Ph.D.)--Arizona State University, 2011.
Following the success in incorporating perceptual models in audio coding algorithms, their application in other speech/audio processing systems is expanding. In general, all perceptual speech/audio processing algorithms involve minimization of an objective function that directly/indirectly incorporates properties of human perception. This dissertation primarily investigates the problems associated with directly embedding an auditory model in the objective function formulation and proposes possible solutions to overcome high complexity issues for use in real-time speech/audio algorithms.
ISBN: 9781124615363Subjects--Topical Terms:
845395
Health Sciences, Audiology.
Incorporating Auditory Models in Speech/Audio Applications.
LDR
:03669nam 2200361 4500
001
712923
005
20121003100257.5
008
121101s2011 ||||||||||||||||| ||eng d
020
$a
9781124615363
035
$a
(UMI)AAI3453472
035
$a
AAI3453472
040
$a
UMI
$c
UMI
100
1
$a
Krishnamoorthi, Harish.
$3
845394
245
1 0
$a
Incorporating Auditory Models in Speech/Audio Applications.
300
$a
160 p.
500
$a
Source: Dissertation Abstracts International, Volume: 72-07, Section: B, page: 4217.
500
$a
Adviser: Andreas Spanias.
502
$a
Thesis (Ph.D.)--Arizona State University, 2011.
520
$a
Following the success in incorporating perceptual models in audio coding algorithms, their application in other speech/audio processing systems is expanding. In general, all perceptual speech/audio processing algorithms involve minimization of an objective function that directly/indirectly incorporates properties of human perception. This dissertation primarily investigates the problems associated with directly embedding an auditory model in the objective function formulation and proposes possible solutions to overcome high complexity issues for use in real-time speech/audio algorithms.
520
$a
Specific problems addressed in this dissertation include: 1) the development of approximate but computationally efficient auditory model implementations that are consistent with the principles of psychoacoustics, 2) the development of a mapping scheme that allows synthesizing a time/frequency domain representation from its equivalent auditory model output.
520
$a
The first problem is aimed at addressing the high computational complexity involved in solving perceptual objective functions that require repeated application of auditory model for evaluation of different candidate solutions. In this dissertation, a frequency pruning and a detector pruning algorithm is developed that efficiently implements the various auditory model stages. The performance of the pruned model is compared to that of the original auditory model for different types of test signals in the SQAM database. Experimental results indicate only a 4-7% relative error in loudness while attaining up to 80-90 % reduction in computational complexity. Similarly, a hybrid algorithm is developed specifically for use with sinusoidal signals and employs the proposed auditory pattern combining technique together with a look-up table to store representative auditory patterns.
520
$a
The second problem obtains an estimate of the auditory representation that minimizes a perceptual objective function and transforms the auditory pattern back to its equivalent time/frequency representation. This avoids the repeated application of auditory model stages to test different candidate time/frequency vectors in minimizing perceptual objective functions. In this dissertation, a constrained mapping scheme is developed by linearizing certain auditory model stages that ensures obtaining a time/frequency mapping corresponding to the estimated auditory representation. This paradigm was successfully incorporated in a perceptual speech enhancement algorithm and a sinusoidal component selection task.
590
$a
School code: 0010.
650
4
$a
Health Sciences, Audiology.
$3
845395
650
4
$a
Engineering, Electronics and Electrical.
$3
845382
650
4
$a
Physics, Acoustics.
$3
845396
690
$a
0300
690
$a
0544
690
$a
0986
710
2
$a
Arizona State University.
$b
Electrical Engineering.
$3
845389
773
0
$t
Dissertation Abstracts International
$g
72-07B.
790
1 0
$a
Spanias, Andreas,
$e
advisor
790
1 0
$a
Papandreou-Suppappola, Antonia
$e
committee member
790
1 0
$a
Tepedelenlioglu, Cihan
$e
committee member
790
1 0
$a
Tsakalis, Konstantinos
$e
committee member
790
$a
0010
791
$a
Ph.D.
792
$a
2011
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3453472
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入