語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
FindBook
Google Book
Amazon
博客來
Computational spectrotemporal auditory model with applications to acoustical information processing.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Computational spectrotemporal auditory model with applications to acoustical information processing./
作者:
Chi, Tai-Shih.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2003,
面頁冊數:
137 p.
附註:
Source: Dissertations Abstracts International, Volume: 65-05, Section: B.
Contained By:
Dissertations Abstracts International65-05B.
標題:
Electrical engineering. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3094467
ISBN:
9780496421688
Computational spectrotemporal auditory model with applications to acoustical information processing.
Chi, Tai-Shih.
Computational spectrotemporal auditory model with applications to acoustical information processing.
- Ann Arbor : ProQuest Dissertations & Theses, 2003 - 137 p.
Source: Dissertations Abstracts International, Volume: 65-05, Section: B.
Thesis (Ph.D.)--University of Maryland, College Park, 2003.
This item must not be sold to any third party vendors.
A computational spectrotemporal auditory model based on neurophysiological findings in early auditory and cortical stages is described. The model provides a unified multiresolution representation of the spectral and temporal features of sound likely critical in the perception of timbre. Several types of complex stimuli are used to demonstrate the spectrotemporal information preserved by the model. Shown by these examples, this two stage model reflects the apparent progressive loss of temporal dynamics along the auditory pathway from the rapid phase-locking (several kHz in auditory nerve), to moderate rates of synchrony (several hundred Hz in midbrain), to much lower rates of modulations in the cortex (around 30 Hz). To complete this model, several projection-based reconstruction algorithms are implemented to resynthesize the sound from the representations with reduced dynamics. One particular application of this model is to assess speech intelligibility. The spectro-temporal Modulation Transfer Functions (MTF) of this model is investigated and shown to be consistent with the salient trends in the human MTFs (derived from human detection thresholds) which exhibit a lowpass function with respect to both spectral and temporal dimensions, with 50% bandwidths of about 16 Hz and 2 cycles/octave. Therefore, the model is used to demonstrate the potential relevance of these MTFs to the assessment of speech intelligibility in noise and reverberant conditions. Another useful feature is the phase singularity emerged in the scale space generated by this multiscale auditory model. The singularity is shown to have certain robust properties and carry the crucial information about the spectral profile. Such claim is justified by perceptually tolerable resynthesized sounds from the nonconvex singularity set. In addition, the singularity set is demonstrated to encode the pitch and formants at different scales. These properties make the singularity set very suitable for traditional speech tasks such as vowel recognition or speaker (music instrument) identification. Other potential applications and future modification of this model are also discussed at the end.
ISBN: 9780496421688Subjects--Topical Terms:
649834
Electrical engineering.
Subjects--Index Terms:
Acoustical
Computational spectrotemporal auditory model with applications to acoustical information processing.
LDR
:03448nmm a2200373 4500
001
2348677
005
20220912135632.5
008
241004s2003 ||||||||||||||||| ||eng d
020
$a
9780496421688
035
$a
(MiAaPQ)AAI3094467
035
$a
AAI3094467
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Chi, Tai-Shih.
$3
3688046
245
1 0
$a
Computational spectrotemporal auditory model with applications to acoustical information processing.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2003
300
$a
137 p.
500
$a
Source: Dissertations Abstracts International, Volume: 65-05, Section: B.
500
$a
Publisher info.: Dissertation/Thesis.
500
$a
Advisor: Shamma, Shihab A.
502
$a
Thesis (Ph.D.)--University of Maryland, College Park, 2003.
506
$a
This item must not be sold to any third party vendors.
506
$a
This item must not be added to any third party search indexes.
520
$a
A computational spectrotemporal auditory model based on neurophysiological findings in early auditory and cortical stages is described. The model provides a unified multiresolution representation of the spectral and temporal features of sound likely critical in the perception of timbre. Several types of complex stimuli are used to demonstrate the spectrotemporal information preserved by the model. Shown by these examples, this two stage model reflects the apparent progressive loss of temporal dynamics along the auditory pathway from the rapid phase-locking (several kHz in auditory nerve), to moderate rates of synchrony (several hundred Hz in midbrain), to much lower rates of modulations in the cortex (around 30 Hz). To complete this model, several projection-based reconstruction algorithms are implemented to resynthesize the sound from the representations with reduced dynamics. One particular application of this model is to assess speech intelligibility. The spectro-temporal Modulation Transfer Functions (MTF) of this model is investigated and shown to be consistent with the salient trends in the human MTFs (derived from human detection thresholds) which exhibit a lowpass function with respect to both spectral and temporal dimensions, with 50% bandwidths of about 16 Hz and 2 cycles/octave. Therefore, the model is used to demonstrate the potential relevance of these MTFs to the assessment of speech intelligibility in noise and reverberant conditions. Another useful feature is the phase singularity emerged in the scale space generated by this multiscale auditory model. The singularity is shown to have certain robust properties and carry the crucial information about the spectral profile. Such claim is justified by perceptually tolerable resynthesized sounds from the nonconvex singularity set. In addition, the singularity set is demonstrated to encode the pitch and formants at different scales. These properties make the singularity set very suitable for traditional speech tasks such as vowel recognition or speaker (music instrument) identification. Other potential applications and future modification of this model are also discussed at the end.
590
$a
School code: 0117.
650
4
$a
Electrical engineering.
$3
649834
650
4
$a
Acoustics.
$3
879105
653
$a
Acoustical
653
$a
Auditory
653
$a
Information processing
653
$a
Spectrotemporal
690
$a
0544
690
$a
0986
710
2
$a
University of Maryland, College Park.
$3
657686
773
0
$t
Dissertations Abstracts International
$g
65-05B.
790
$a
0117
791
$a
Ph.D.
792
$a
2003
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3094467
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9471115
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入