國立虎尾科技大學 |

Speech and Computer = 22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7–9, 2020, Proceedings /

紀錄類型:	書目-語言資料,印刷品 : Monograph/item
正題名/作者:	Speech and Computer/ edited by Alexey Karpov, Rodmonga Potapova.
其他題名:	22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7–9, 2020, Proceedings /
其他作者:	Karpov, Alexey.
面頁冊數:	XIV, 689 p. 222 illus., 155 illus. in color.online resource. :
Contained By:	Springer Nature eBook
標題:	Artificial intelligence. -
電子資源:	https://doi.org/10.1007/978-3-030-60276-5
ISBN:	9783030602765

Speech and Computer = 22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7–9, 2020, Proceedings /
Speech and Computer22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7–9, 2020, Proceedings /[electronic resource] :edited by Alexey Karpov, Rodmonga Potapova. - 1st ed. 2020. - XIV, 689 p. 222 illus., 155 illus. in color.online resource. - Lecture Notes in Artificial Intelligence ;12335. - Lecture Notes in Artificial Intelligence ;9285.

Lightweight CNN for Robust Voice Activity Detection -- Hate Speech Detection Using Transformer Ensembles on the HASOC Dataset -- MP3 Compression to Diminish Adversarial Noise in End-to-End Speech Recognition -- Exploration of End-to-End ASR for OpenSTT – Russian Open Speech-to-Text Dataset -- Directional Clustering with Polyharmonic Phase Estimation for Enhanced Speaker Localization -- Speech Emotion Recognition using Spectrogram Patterns as Features -- Pragmatic Markers in Dialogue and Monologue: Difficulties of Identification and Typical Formation Models -- Data Augmentation and Loss Normalization for Deep Noise Suppression -- Automatic Information Extraction from Scanned Documents -- Dealing with Newly Emerging OOVs in Broadcast Programs by Daily Updates of the Lexicon and Language Model -- A Rumor Detection in Russian Tweets -- Automatic Prediction of Word form Reduction in Russian Spontaneous Speech -- Formant Frequency Analysis of MSA Vowels in Six Algerian Regions -- Emotion Recognition and Sentiment Analysis of Extemporaneous Speech Transcriptions in Russian -- Predicting a Cold from Speech using Fisher Vectors; SVM and XGBoost as Classifiers -- Toxicity in Texts and Images on the Internet -- An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents -- Lipreading with LipsID -- Automated Destructive Behavior State Detection on the 1D CNN-based Voice Analysis -- Rhythmic Structures of Russian Prose and Occasional Iambs (a Diachronic Case Study) -- Automatic Detection of Backchannels in Russian Dialogue Speech -- Experimenting with Attention Mechanisms in Joint CTC-Attention Models for Russian Speech Recognition -- Comparison of Deep Learning Methods for Spoken Language Identification -- Conceptual Operations with Semantics for a Companion Robot -- Legal Tech: Documents' Validation Method Based on the Associative-Ontological Approach -- Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech Recognition -- CTC-Segmentation of Large Corpora for German End-to-End Speech Recognition -- Stylometrics Features under Domain Shift: Do they Really “Context-independent” -- Speech Features of 13-15 Year-old Children with Autism Spectrum Disorders -- Multi-corpus Experiment on Continuous Speech Emotion Recognition: Convolution or Recurrence -- Detection of Toxic Language in Short Text Messages -- Transfer Learning in Speaker’s Age and Gender Recognition -- Interactivity-based Quality Prediction of Conversations with Transmission Delay -- Graphic Markers of Irony and Sarcasm in Written Texts -- Digital Rhetoric 2.0: How to Train Charismatic Speaking with Speech-melody Visualization Software -- Generating a Concept Relation Network for Turkish Based on ConceptNet Using Translational Methods -- Bulgarian Associative Dictionaries in the LABLASS Web-based System -- Preliminary Investigation of Potential Steganographic Container Localization -- Some Comparative Cognitive and Neurophysiological Reactions to Code-modified Internet Information -- The Influence of Multimodal Polycode Internet Content on Human Brain Activity -- Synthetic Speech Evaluation by Differential Maps in Pleasure-Arousal Space -- Investigating the Effect of Emoji in Opinion Classification of Uzbek Movie Review Comments -- Evaluation of Voice Mimicking using i-vector Framework -- Score Normalization of x-vector Speaker Verification System for Short-duration Speaker Verification Challenge -- Genuine Spontaneous vs Fake Spontaneous Speech: in Search of Distinction -- Mixing Synthetic and Recorded Signals for Audio-book Generation -- Temporal Concord in Speech Interaction: Overlaps and Interruptions in Spoken American English -- Cognitively Challenging: Language Shift and Speech Rate of Academic Bilinguals -- Toward Explainable Automatic Classification of Children’s Speech Disorders -- Recognition Performance of Selected Speech Recognition APIs – A Longitudinal Study -- Does A Priori Phonological Knowledge Improve Cross-Lingual Robustness of Phonemic Contrasts -- Can We Detect Irony in Speech Using Phonetic Characteristics Only? - Looking for a Methodology of Analysis -- Automated Compilation of a Corpus-based Dictionary and Computing Concreteness Ratings of Russian -- Increasing the Accuracy of the ASR System by Prolonging Voiceless Phonemes in the Speech of Patients using the Electrolarynx -- Leverage Unlabeled Data for Abstractive Speech Summarization with Self-Supervised Learning and Back-Summarization -- Uncertainty of Phone Voicing and its Impact on Speech Synthesis -- Grappling with Web Technologies: the Problems of Remote Speech Recording -- Robust Noisy Speech Parameterization Using Convolutional Neural Networks -- More than Words: Cross-Linguistic Exploration of Parkinson's Disease Identification from Speech -- Phonological Length of L2 Czech Speakers’ Vowels in Ambiguous Contexts as Perceived by L1 Listeners -- Learning an Unsupervised and Interpretable Representation of Emotion from Speech -- Synchronized Forward-Backward Transformer for End-to-End Speech Recognition -- KazNLP: a Pipeline for Automated Processing of Texts Written in Kazakh Language -- Diarization based on Identiﬁcation with x-vectors -- Different Approaches in Cross-Language Similar Documents Retrieval in the Legal Domain.

This book constitutes the proceedings of the 22nd International Conference on Speech and Computer, SPECOM 2020, held in St. Petersburg, Russia, in October 2020. The 65 papers presented were carefully reviewed and selected from 160 submissions. The papers present current research in the area of computer speech processing including speech science, speech technology, natural language processing, human-computer interaction, language identification, multimedia processing, human-machine interaction, deep learning for audio processing, computational paralinguistics, affective computing, speech and language resources, speech translation systems, text mining and sentiment analysis, voice assistants, etc. Due to the Corona pandemic SPECOM 2020 was held as a virtual event.

ISBN: 9783030602765

Standard No.: 10.1007/978-3-030-60276-5doiSubjects--Topical Terms:

559380
Artificial intelligence.

LC Class. No.: Q334-342

Dewey Class. No.: 006.3

Speech and Computer = 22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7–9, 2020, Proceedings /
LDR:07583nam a22004095i 4500 001 1030091
003 DE-He213
005 20201004081751.0
007 cr nn 008mamaa
008 210318s2020 gw | s |||| 0|eng d
020 $a 9783030602765 $9 978-3-030-60276-5
024 7 $a 10.1007/978-3-030-60276-5 $2 doi
035 $a 978-3-030-60276-5
050 4 $a Q334-342
072 7 $a UYQ $2 bicssc
072 7 $a COM004000 $2 bisacsh
072 7 $a UYQ $2 thema
082 0 4 $a 006.3 $2 23
245 1 0 $a Speech and Computer $h [electronic resource] : $b 22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7–9, 2020, Proceedings / $c edited by Alexey Karpov, Rodmonga Potapova.
250 $a 1st ed. 2020.
264 1 $a Cham : $b Springer International Publishing : $b Imprint: Springer, $c 2020.
300 $a XIV, 689 p. 222 illus., 155 illus. in color. $b online resource.
336 $a text $b txt $2 rdacontent
337 $a computer $b c $2 rdamedia
338 $a online resource $b cr $2 rdacarrier
347 $a text file $b PDF $2 rda
490 1 $a Lecture Notes in Artificial Intelligence ; $v 12335
505 0 $a Lightweight CNN for Robust Voice Activity Detection -- Hate Speech Detection Using Transformer Ensembles on the HASOC Dataset -- MP3 Compression to Diminish Adversarial Noise in End-to-End Speech Recognition -- Exploration of End-to-End ASR for OpenSTT – Russian Open Speech-to-Text Dataset -- Directional Clustering with Polyharmonic Phase Estimation for Enhanced Speaker Localization -- Speech Emotion Recognition using Spectrogram Patterns as Features -- Pragmatic Markers in Dialogue and Monologue: Difficulties of Identification and Typical Formation Models -- Data Augmentation and Loss Normalization for Deep Noise Suppression -- Automatic Information Extraction from Scanned Documents -- Dealing with Newly Emerging OOVs in Broadcast Programs by Daily Updates of the Lexicon and Language Model -- A Rumor Detection in Russian Tweets -- Automatic Prediction of Word form Reduction in Russian Spontaneous Speech -- Formant Frequency Analysis of MSA Vowels in Six Algerian Regions -- Emotion Recognition and Sentiment Analysis of Extemporaneous Speech Transcriptions in Russian -- Predicting a Cold from Speech using Fisher Vectors; SVM and XGBoost as Classifiers -- Toxicity in Texts and Images on the Internet -- An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents -- Lipreading with LipsID -- Automated Destructive Behavior State Detection on the 1D CNN-based Voice Analysis -- Rhythmic Structures of Russian Prose and Occasional Iambs (a Diachronic Case Study) -- Automatic Detection of Backchannels in Russian Dialogue Speech -- Experimenting with Attention Mechanisms in Joint CTC-Attention Models for Russian Speech Recognition -- Comparison of Deep Learning Methods for Spoken Language Identification -- Conceptual Operations with Semantics for a Companion Robot -- Legal Tech: Documents' Validation Method Based on the Associative-Ontological Approach -- Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech Recognition -- CTC-Segmentation of Large Corpora for German End-to-End Speech Recognition -- Stylometrics Features under Domain Shift: Do they Really “Context-independent” -- Speech Features of 13-15 Year-old Children with Autism Spectrum Disorders -- Multi-corpus Experiment on Continuous Speech Emotion Recognition: Convolution or Recurrence -- Detection of Toxic Language in Short Text Messages -- Transfer Learning in Speaker’s Age and Gender Recognition -- Interactivity-based Quality Prediction of Conversations with Transmission Delay -- Graphic Markers of Irony and Sarcasm in Written Texts -- Digital Rhetoric 2.0: How to Train Charismatic Speaking with Speech-melody Visualization Software -- Generating a Concept Relation Network for Turkish Based on ConceptNet Using Translational Methods -- Bulgarian Associative Dictionaries in the LABLASS Web-based System -- Preliminary Investigation of Potential Steganographic Container Localization -- Some Comparative Cognitive and Neurophysiological Reactions to Code-modified Internet Information -- The Influence of Multimodal Polycode Internet Content on Human Brain Activity -- Synthetic Speech Evaluation by Differential Maps in Pleasure-Arousal Space -- Investigating the Effect of Emoji in Opinion Classification of Uzbek Movie Review Comments -- Evaluation of Voice Mimicking using i-vector Framework -- Score Normalization of x-vector Speaker Verification System for Short-duration Speaker Verification Challenge -- Genuine Spontaneous vs Fake Spontaneous Speech: in Search of Distinction -- Mixing Synthetic and Recorded Signals for Audio-book Generation -- Temporal Concord in Speech Interaction: Overlaps and Interruptions in Spoken American English -- Cognitively Challenging: Language Shift and Speech Rate of Academic Bilinguals -- Toward Explainable Automatic Classification of Children’s Speech Disorders -- Recognition Performance of Selected Speech Recognition APIs – A Longitudinal Study -- Does A Priori Phonological Knowledge Improve Cross-Lingual Robustness of Phonemic Contrasts -- Can We Detect Irony in Speech Using Phonetic Characteristics Only? - Looking for a Methodology of Analysis -- Automated Compilation of a Corpus-based Dictionary and Computing Concreteness Ratings of Russian -- Increasing the Accuracy of the ASR System by Prolonging Voiceless Phonemes in the Speech of Patients using the Electrolarynx -- Leverage Unlabeled Data for Abstractive Speech Summarization with Self-Supervised Learning and Back-Summarization -- Uncertainty of Phone Voicing and its Impact on Speech Synthesis -- Grappling with Web Technologies: the Problems of Remote Speech Recording -- Robust Noisy Speech Parameterization Using Convolutional Neural Networks -- More than Words: Cross-Linguistic Exploration of Parkinson's Disease Identification from Speech -- Phonological Length of L2 Czech Speakers’ Vowels in Ambiguous Contexts as Perceived by L1 Listeners -- Learning an Unsupervised and Interpretable Representation of Emotion from Speech -- Synchronized Forward-Backward Transformer for End-to-End Speech Recognition -- KazNLP: a Pipeline for Automated Processing of Texts Written in Kazakh Language -- Diarization based on Identiﬁcation with x-vectors -- Different Approaches in Cross-Language Similar Documents Retrieval in the Legal Domain.
520 $a This book constitutes the proceedings of the 22nd International Conference on Speech and Computer, SPECOM 2020, held in St. Petersburg, Russia, in October 2020. The 65 papers presented were carefully reviewed and selected from 160 submissions. The papers present current research in the area of computer speech processing including speech science, speech technology, natural language processing, human-computer interaction, language identification, multimedia processing, human-machine interaction, deep learning for audio processing, computational paralinguistics, affective computing, speech and language resources, speech translation systems, text mining and sentiment analysis, voice assistants, etc. Due to the Corona pandemic SPECOM 2020 was held as a virtual event.
650 0 $a Artificial intelligence. $3 559380
650 0 $a Application software. $3 528147
650 0 $a Education—Data processing. $3 1253610
650 0 $a Data mining. $3 528622
650 0 $a Optical data processing. $3 639187
650 1 4 $a Artificial Intelligence. $3 646849
650 2 4 $a Computer Appl. in Social and Behavioral Sciences. $3 669920
650 2 4 $a Computers and Education. $3 669806
650 2 4 $a Data Mining and Knowledge Discovery. $3 677765
650 2 4 $a Information Systems Applications (incl. Internet). $3 881699
650 2 4 $a Computer Imaging, Vision, Pattern Recognition and Graphics. $3 671334
700 1 $a Karpov, Alexey. $4 edt $4 http://id.loc.gov/vocabulary/relators/edt $3 1199889
700 1 $a Potapova, Rodmonga. $4 edt $4 http://id.loc.gov/vocabulary/relators/edt $3 1069305
710 2 $a SpringerLink (Online service) $3 593884
773 0 $t Springer Nature eBook
776 0 8 $i Printed edition: $z 9783030602758
776 0 8 $i Printed edition: $z 9783030602772
830 0 $a Lecture Notes in Artificial Intelligence ; $v 9285 $3 1253845
856 4 0 $u https://doi.org/10.1007/978-3-030-60276-5
912 $a ZDB-2-SCS
912 $a ZDB-2-SXCS
912 $a ZDB-2-LNC
950 $a Computer Science (SpringerNature-11645)
950 $a Computer Science (R0) (SpringerNature-43710)