語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Speech and computer = 26th Internati...
~
International Conference Speech and Computer (2024 :)
FindBook
Google Book
Amazon
博客來
Speech and computer = 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024 : proceedings.. Part I /
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Speech and computer/ edited by Alexey Karpov, Vlado Delić.
其他題名:
26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024 : proceedings.
其他題名:
SPECOM 2024
其他作者:
Karpov, Alexey.
團體作者:
International Conference Speech and Computer
出版者:
Cham :Springer Nature Switzerland : : 2025.,
面頁冊數:
xvii, 395 p. :ill. (chiefly color), digital ;24 cm.
內容註:
Invited Papers -- Preserving Language Heritage Through Speech Technology: The Case of Upper Sorbian -- Retrospective and Perspectives of TTS & STT Technology Development and Implementation for South Slavic Under-Resourced Languages -- Automatic Speech Recognition -- Comparison of Well- and Lower-Resourced Self-Training in ASR -- Towards a Livvi-Karelian End-to-End ASR System -- Advances in OpenASR21 Evaluation with Increased Temporal Resolution for Speech Self-Supervised Learning Models -- Benchmarking Whisper under Diverse Audio Transformations and Real-time Constraints -- AutoMode-ASR: Learning to Select ASR Systems for Better Quality and Cost -- Pre-Training and Adverse Audio Samples for Data-Efficient Wake Word Detection -- Cross-Lingual Summarization of Speech-to-Speech Translation: A Baseline -- Speech and Language Resources -- The ParlaSpeech Collection of Automatically Generated Speech and Text Corpora from Parliamentary Proceedings -- ESC Corpus of Spoken Russian: Everyday Student Conversations Captured through Continuous Speech Recording in Natural Communicative Environments -- OpenAV: Bilingual Dataset for Audio-Visual Voice Control of a Computer for Hand Disabled People -- Bulgarian Speech Resources in the CHILDES System -- Multiword Units in Russian Everyday Speech: Empirical Classification and Corpus-Based Studies -- Neurophysiological Correlates of Textual Modulation in Visual Stimuli: An Experimental Study of Russian and English Memes -- Speech Synthesis and Perception -- End-to-End Speech Synthesis for the Serbian Language Based on Tacotron -- ChildTinyTalks (CTT): A Benchmark Dataset and Baseline for Expressive Child Speech Synthesis -- Multidimensional Rhythm: Comparing Rhythmic Properties of Australian and New Zealand Monologues -- Influence of Linguistic and Sociolinguistic Factors on Speech Rate Perception -- Human and Machine Keyphrase Perception in Russian Text and Speech -- Assessment of Children's Ability to Manifest Emotions in Facial Expressions, Voice and Speech by Humans, Automatic, and on a Likert Scale -- Speech Processing for Medicine -- Investigating the Utility of wav2vec 2.0 Hidden Layers for Detecting Multiple Sclerosis -- Cross-Cultural Automatic Depression Detection based on Audio Signals -- Depression Classification using Token Merging-based Speech Spectrotemporal Transformer -- Detecting Depression from Audio Data -- Binary and Multiclass Classification of Dysphonia Using Whisper Encoder and One-Dimensional Convolutional Neural Network -- Approach to Assessing the Quality of Syllable Pronunciation by Patients in the Process of Speech Rehabilitation Based on Comparison with Healthy Speakers -- A Comparative Study for Contextualized Spoken Answer Classification in German Medical Questionnaires.
Contained By:
Springer Nature eBook
標題:
Natural language processing (Computer science) - Congresses. -
電子資源:
https://doi.org/10.1007/978-3-031-77961-9
ISBN:
9783031779619
Speech and computer = 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024 : proceedings.. Part I /
Speech and computer
26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024 : proceedings.Part I /[electronic resource] :SPECOM 2024edited by Alexey Karpov, Vlado Delić. - Cham :Springer Nature Switzerland :2025. - xvii, 395 p. :ill. (chiefly color), digital ;24 cm. - Lecture notes in computer science,152991611-3349 ;. - Lecture notes in computer science ;15299..
Invited Papers -- Preserving Language Heritage Through Speech Technology: The Case of Upper Sorbian -- Retrospective and Perspectives of TTS & STT Technology Development and Implementation for South Slavic Under-Resourced Languages -- Automatic Speech Recognition -- Comparison of Well- and Lower-Resourced Self-Training in ASR -- Towards a Livvi-Karelian End-to-End ASR System -- Advances in OpenASR21 Evaluation with Increased Temporal Resolution for Speech Self-Supervised Learning Models -- Benchmarking Whisper under Diverse Audio Transformations and Real-time Constraints -- AutoMode-ASR: Learning to Select ASR Systems for Better Quality and Cost -- Pre-Training and Adverse Audio Samples for Data-Efficient Wake Word Detection -- Cross-Lingual Summarization of Speech-to-Speech Translation: A Baseline -- Speech and Language Resources -- The ParlaSpeech Collection of Automatically Generated Speech and Text Corpora from Parliamentary Proceedings -- ESC Corpus of Spoken Russian: Everyday Student Conversations Captured through Continuous Speech Recording in Natural Communicative Environments -- OpenAV: Bilingual Dataset for Audio-Visual Voice Control of a Computer for Hand Disabled People -- Bulgarian Speech Resources in the CHILDES System -- Multiword Units in Russian Everyday Speech: Empirical Classification and Corpus-Based Studies -- Neurophysiological Correlates of Textual Modulation in Visual Stimuli: An Experimental Study of Russian and English Memes -- Speech Synthesis and Perception -- End-to-End Speech Synthesis for the Serbian Language Based on Tacotron -- ChildTinyTalks (CTT): A Benchmark Dataset and Baseline for Expressive Child Speech Synthesis -- Multidimensional Rhythm: Comparing Rhythmic Properties of Australian and New Zealand Monologues -- Influence of Linguistic and Sociolinguistic Factors on Speech Rate Perception -- Human and Machine Keyphrase Perception in Russian Text and Speech -- Assessment of Children's Ability to Manifest Emotions in Facial Expressions, Voice and Speech by Humans, Automatic, and on a Likert Scale -- Speech Processing for Medicine -- Investigating the Utility of wav2vec 2.0 Hidden Layers for Detecting Multiple Sclerosis -- Cross-Cultural Automatic Depression Detection based on Audio Signals -- Depression Classification using Token Merging-based Speech Spectrotemporal Transformer -- Detecting Depression from Audio Data -- Binary and Multiclass Classification of Dysphonia Using Whisper Encoder and One-Dimensional Convolutional Neural Network -- Approach to Assessing the Quality of Syllable Pronunciation by Patients in the Process of Speech Rehabilitation Based on Comparison with Healthy Speakers -- A Comparative Study for Contextualized Spoken Answer Classification in German Medical Questionnaires.
The two-volume set LNAI 15299 and 15300 constitutes the refereed proceedings of the 26th International Conference on Speech and Computer, SPECOM 2024, held in Belgrade, Serbia, during November 25-28, 2024. The 53 full papers included in these proceedings were carefully reviewed and selected from 90 submissions. The book also contains two invited talks in full paper length. The papers are organized in the following topical sections: Volume I: Invited papers; automatic speech recognition; speech and language resources; speech synthesis and perception; and speech processing for medicine. Volume II: Computational paralinguistics; affective computing; speaker recognition; digital speech processing; natural language processing.
ISBN: 9783031779619
Standard No.: 10.1007/978-3-031-77961-9doiSubjects--Topical Terms:
752585
Natural language processing (Computer science)
--Congresses.
LC Class. No.: QA76.9.N38
Dewey Class. No.: 006.35
Speech and computer = 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024 : proceedings.. Part I /
LDR
:04747nmm a2200361 a 4500
001
2407995
003
DE-He213
005
20241122115240.0
006
m d
007
cr nn 008maaau
008
260204s2025 sz s 0 eng d
020
$a
9783031779619
$q
(electronic bk.)
020
$a
9783031779602
$q
(paper)
024
7
$a
10.1007/978-3-031-77961-9
$2
doi
035
$a
978-3-031-77961-9
040
$a
GP
$c
GP
041
0
$a
eng
050
4
$a
QA76.9.N38
072
7
$a
UYQ
$2
bicssc
072
7
$a
COM004000
$2
bisacsh
072
7
$a
UYQ
$2
thema
082
0 4
$a
006.35
$2
23
090
$a
QA76.9.N38
$b
I61 2024
111
2
$a
International Conference Speech and Computer
$n
(26th :
$d
2024 :
$c
Belgrade, Serbia)
$3
3780196
245
1 0
$a
Speech and computer
$h
[electronic resource] :
$b
26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024 : proceedings.
$n
Part I /
$c
edited by Alexey Karpov, Vlado Delić.
246
3
$a
SPECOM 2024
260
$a
Cham :
$b
Springer Nature Switzerland :
$b
Imprint: Springer,
$c
2025.
300
$a
xvii, 395 p. :
$b
ill. (chiefly color), digital ;
$c
24 cm.
490
1
$a
Lecture notes in computer science,
$x
1611-3349 ;
$v
15299
490
1
$a
Lecture notes in artificial intelligence
505
0
$a
Invited Papers -- Preserving Language Heritage Through Speech Technology: The Case of Upper Sorbian -- Retrospective and Perspectives of TTS & STT Technology Development and Implementation for South Slavic Under-Resourced Languages -- Automatic Speech Recognition -- Comparison of Well- and Lower-Resourced Self-Training in ASR -- Towards a Livvi-Karelian End-to-End ASR System -- Advances in OpenASR21 Evaluation with Increased Temporal Resolution for Speech Self-Supervised Learning Models -- Benchmarking Whisper under Diverse Audio Transformations and Real-time Constraints -- AutoMode-ASR: Learning to Select ASR Systems for Better Quality and Cost -- Pre-Training and Adverse Audio Samples for Data-Efficient Wake Word Detection -- Cross-Lingual Summarization of Speech-to-Speech Translation: A Baseline -- Speech and Language Resources -- The ParlaSpeech Collection of Automatically Generated Speech and Text Corpora from Parliamentary Proceedings -- ESC Corpus of Spoken Russian: Everyday Student Conversations Captured through Continuous Speech Recording in Natural Communicative Environments -- OpenAV: Bilingual Dataset for Audio-Visual Voice Control of a Computer for Hand Disabled People -- Bulgarian Speech Resources in the CHILDES System -- Multiword Units in Russian Everyday Speech: Empirical Classification and Corpus-Based Studies -- Neurophysiological Correlates of Textual Modulation in Visual Stimuli: An Experimental Study of Russian and English Memes -- Speech Synthesis and Perception -- End-to-End Speech Synthesis for the Serbian Language Based on Tacotron -- ChildTinyTalks (CTT): A Benchmark Dataset and Baseline for Expressive Child Speech Synthesis -- Multidimensional Rhythm: Comparing Rhythmic Properties of Australian and New Zealand Monologues -- Influence of Linguistic and Sociolinguistic Factors on Speech Rate Perception -- Human and Machine Keyphrase Perception in Russian Text and Speech -- Assessment of Children's Ability to Manifest Emotions in Facial Expressions, Voice and Speech by Humans, Automatic, and on a Likert Scale -- Speech Processing for Medicine -- Investigating the Utility of wav2vec 2.0 Hidden Layers for Detecting Multiple Sclerosis -- Cross-Cultural Automatic Depression Detection based on Audio Signals -- Depression Classification using Token Merging-based Speech Spectrotemporal Transformer -- Detecting Depression from Audio Data -- Binary and Multiclass Classification of Dysphonia Using Whisper Encoder and One-Dimensional Convolutional Neural Network -- Approach to Assessing the Quality of Syllable Pronunciation by Patients in the Process of Speech Rehabilitation Based on Comparison with Healthy Speakers -- A Comparative Study for Contextualized Spoken Answer Classification in German Medical Questionnaires.
520
$a
The two-volume set LNAI 15299 and 15300 constitutes the refereed proceedings of the 26th International Conference on Speech and Computer, SPECOM 2024, held in Belgrade, Serbia, during November 25-28, 2024. The 53 full papers included in these proceedings were carefully reviewed and selected from 90 submissions. The book also contains two invited talks in full paper length. The papers are organized in the following topical sections: Volume I: Invited papers; automatic speech recognition; speech and language resources; speech synthesis and perception; and speech processing for medicine. Volume II: Computational paralinguistics; affective computing; speaker recognition; digital speech processing; natural language processing.
650
0
$a
Natural language processing (Computer science)
$v
Congresses.
$3
752585
650
0
$a
Automatic speech recognition
$v
Congresses.
$3
840482
650
0
$a
Speech processing systems
$x
Congresses.
$3
678615
650
0
$a
Human-computer interaction
$x
Congresses.
$3
705966
650
0
$a
Linguistics
$v
Congresses.
$3
792572
650
1 4
$a
Artificial Intelligence.
$3
769149
650
2 4
$a
Computer Imaging, Vision, Pattern Recognition and Graphics.
$3
890871
650
2 4
$a
Computer Engineering and Networks.
$3
3538504
650
2 4
$a
Computer and Information Systems Applications.
$3
3538505
700
1
$a
Karpov, Alexey.
$3
3251780
700
1
$a
Delić, Vlado.
$3
3780197
710
2
$a
SpringerLink (Online service)
$3
836513
773
0
$t
Springer Nature eBook
830
0
$a
Lecture notes in computer science ;
$v
15299.
$3
3780198
830
0
$a
Lecture notes in artificial intelligence.
$3
3382562
856
4 0
$u
https://doi.org/10.1007/978-3-031-77961-9
950
$a
Computer Science (SpringerNature-11645)
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9513493
電子資源
11.線上閱覽_V
電子書
EB QA76.9.N38
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入