國立虎尾科技大學 |

Speech and Computer = 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings /

Record Type:	Language materials, printed : Monograph/item
Title/Author:	Speech and Computer/ edited by Andrey Ronzhin, Rodmonga Potapova, Géza Németh.
Reminder of title:	18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings /
other author:	Ronzhin, Andrey.
Description:	XVIII, 731 p. 197 illus.online resource. :
Contained By:	Springer Nature eBook
Subject:	Artificial intelligence. -
Online resource:	https://doi.org/10.1007/978-3-319-43958-7
ISBN:	9783319439587

Speech and Computer = 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings /
Speech and Computer18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings /[electronic resource] :edited by Andrey Ronzhin, Rodmonga Potapova, Géza Németh. - 1st ed. 2016. - XVIII, 731 p. 197 illus.online resource. - Lecture Notes in Artificial Intelligence ;9811. - Lecture Notes in Artificial Intelligence ;9285.

Automatic Speech Recognition based on Neural Networks -- Machine Processing of Dialogue States; Speculations on Conversational Entropy -- Speech Recognition Challenges in the Car Navigation Industry -- A Comparison of Acoustic Features of Speech of Typically Developing Children and Children with Autism Spectrum Disorders -- A Deep Neural Networks (DNN) Based models for a Computer Aided Pronunciation Learning System -- A Linguistic Interpretation of the Atom Decomposition of Fundamental Frequency Contour for American English -- A Phonetic Segmentation Procedure Based on Hidden Markov Models -- A Preliminary Exploration of Group Social Engagement Level Recognition in Multiparty Casual Conversation -- An Agonist-Antagonist Pitch Production Model -- An Algorithm for Phase Manipulation in a Speech Signal -- An Exploratory Study on Sociolinguistic Variation of Russian Everyday Speech -- Adaptation of DNN Acoustic Models using KL-divergence Regularization and Multi-Task Training -- Advances in STC Russian Spontaneous Speech Recognition System -- Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance -- Assessment of the Relation Between Low-Frequency Features and Velum Opening by Using Real Articulatory Data -- Automatic Summarization of Highly Spontaneous Speech -- Backchanneling via Twitter Data for Conversational Dialogue Systems -- Bio-Inspired Sparse Representation of Speech and Audio Using Psychoacoustic Adaptive Matching Pursuit -- Combining Atom Decomposition of the F0 Track and HMM-based Phonological Phrase Modelling for Robust Stress Detection in Speech -- Comparative analysis of classifiers for automatic language recognition in spontaneous speech -- Comparison of Retrieval Approaches and Blind Relevance Feedback Methods within the Czech Speech Information Retrieval -- Convolutional Neural Network in the Task of Speaker Change Detection -- Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer -- Designing High-Coverage Multi-Level Text Corpus for Non-Professional-Voice Conservation -- Designing Syllable Models for an HMM based Speech Recognition System -- Detecting Filled Pauses and Lengthenings in Russian Spontaneous Speech using SVM -- Detecting Laughter and Filler Events by Time Series Smoothing with Genetic Algorithms -- Detecting State of Aggression in Sentences using CNN -- DNN-based Acoustic Modeling for Russian Speech Recognition Using Kaldi -- DNN-Based Duration Modeling for Synthesizing Short Sentences -- Emotional Speech of 3-Years Old Children: Norm-Risk-Deprivation -- Ensemble Deep Neural Network based Waveform-Driven Stress Model for Speech Synthesis -- Evaluation of Response Times on a Touch Screen using Stereo Panned Speech Command Auditory Feedback -- Evaluation of the Speech Quality During Rehabilitation after Surgical Treatment of the Cancer of Oral Cavity and Oropharynx based on a Comparison of the Fourier Spectra -- Experiments with One-Class Classifier as a Predictor of Spectral Discontinuities in Unit Concatenation -- Exploring GMM-derived Features for Unsupervised Adaptation of Deep Neural Network Acoustic Models -- Feature Space VTS with Phase Term Modeling -- Finding Speaker Position Under Difficult Acoustic Conditions -- Fusing Various Audio Feature Sets for Detection of Parkinson's Disease from Sustained Voice and Speech Recordings -- HAVRUS Corpus: High-speed Recordings of Audio-Visual Russian Speech -- Human-Smartphone Interaction for Dangerous Situation Detection & Recommendation Generation while Driving -- Improving Automatic Speech Recognition Containing Additive Noise Using Deep Denoising Autoencoders of LSTM Networks -- Improving the Quality of Automatic Speech Recognition in Trucks -- Improving Recognition of Dysarthric Speech Using Severity Based Tempo Adaptation -- Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent & Text-Independent Operation Modalities -- Improvements to Prosodic Variation in Long Short-Term Memory based Intonation Models Using Random Forest -- In-document Adaptation for a Human Guided Automatic Transcription Service -- Interaction Quality as a Human-Human Task-Oriented Conversation Performance -- Investigation of Segmentation in i-Vector based Speaker Diarization of Telephone Speech -- Investigation of Speech Signal Parameters Reflecting the Truth of Transmitted Information -- Investigating Signal Correlation as Continuity Metric in a Syllable based Unit Selection Synthesis System -- Knowledge Transfer for Utterance Classification in Low-Resource Languages -- Language Identification using Time Delay Neural Network D-Vector on Short Utterances -- Lexical Stress in Punjabi and its Representation in PLS -- Low Inter-Annotator Agreement in Sentence Boundary Detection and Personality -- LSTM-based Language Models for Spontaneous Speech Recognition -- Measuring Prosodic Entrainment in Italian Collaborative Game-based Dialogues -- Microphone Array Directivity Improvement in Low-Frequency Domain for Speech Processing -- Modeling Imperative Utterances in Russian Spoken Dialogue: Verb-Central Quantitative Approach -- Multimodal Perception of Aggressive Behavior -- On Individual Polyinformativity of Speech and Voice Regarding Speaker's Auditive Attribution (Forensic Phonetic Aspect) -- Online Biometric Identification With Face Analysis in Web Applications -- Optimization of Zelinski post-filtering calculation -- Phonetic Aspects of High Level of Naturalness in Speech Synthesis -- Polybasic Attribution of Social Network Discourse -- Precise Estimation of Harmonic Parameter Trend and Modification of a Speech Signal -- Profiling a Set of Personality Traits of a Text's Author: a Corpus-Based Approach -- Prosody Analysis of Malay Language Storytelling Corpus -- Quality Assessment of two Fullband Audio Codecs Supporting Real-Time Communication -- Robust Speech Analysis Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition in Noisy Environments -- Scenarios of Multimodal Information Navigation Services for Users in Cyberphysical Environment -- Scores Calibration in Speaker Recognition Systems -- Selecting Keypoint Detector and Descriptor Combination for Augmented Reality Application -- Semi-automatic Speaker Verification System Based on Analysis of Formant, Durational and Pitch Characteristics -- Speaker-Dependent Bottleneck Features for Egyptian Arabic Speech Recognition -- Speech Acts Annotation of Everyday Conversations in the ORD corpus of Spoken Russian -- Speech Enhancement with Microphone Array Using a Multi Beam Adaptive Noise Suppressor -- Speech Features Evaluation for Small Set Automatic Speaker Verification Using GMM-UBM System -- Speech Recognition combining MFCCs and Image Features -- Sociolinguistic Extension of the ORD Corpus of Russian Everyday Speech -- Statistical Analysis of Acoustical Parameters in the Voice of Children with Juvenile Dysphonia -- Stress, Arousal, and Stress Detector Trained on Acted Speech Database -- Study on the Improvement of Intelligibility for Elderly Speech using Formant Frequency Shift Method -- Text Classification in the Domain of Applied Linguistics as Part of a Pre-editing Module for Machine Translation Systems -- Tonal Specification of Perceptually Prominent Non-Nuclear Pitch Accents in Russian -- Toward Sign Language Motion Capture Dataset Building -- Trade-off Between Speed and Accuracy for Noise Variance Minimization (NVM) Pitch Estimation Algorithm -- Unsupervised Trained Functional Discourse Parser for E-Learning Materials Scaffolding.

This book constitutes the proceedings of the 18th International Conference on Speech and Computer, SPECOM 2016, held in Budapest, Hungary, in August 2016. The 85 papers presented in this volume were carefully reviewed and selected from 154 submissions.

ISBN: 9783319439587

Standard No.: 10.1007/978-3-319-43958-7doiSubjects--Topical Terms:

559380
Artificial intelligence.

LC Class. No.: Q334-342

Dewey Class. No.: 006.3

Speech and Computer = 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings /
LDR:09295nam a22004095i 4500 001 981042
003 DE-He213
005 20200704054430.0
007 cr nn 008mamaa
008 201211s2016 gw | s |||| 0|eng d
020 $a 9783319439587 $9 978-3-319-43958-7
024 7 $a 10.1007/978-3-319-43958-7 $2 doi
035 $a 978-3-319-43958-7
050 4 $a Q334-342
072 7 $a UYQ $2 bicssc
072 7 $a COM004000 $2 bisacsh
072 7 $a UYQ $2 thema
082 0 4 $a 006.3 $2 23
245 1 0 $a Speech and Computer $h [electronic resource] : $b 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings / $c edited by Andrey Ronzhin, Rodmonga Potapova, Géza Németh.
250 $a 1st ed. 2016.
264 1 $a Cham : $b Springer International Publishing : $b Imprint: Springer, $c 2016.
300 $a XVIII, 731 p. 197 illus. $b online resource.
336 $a text $b txt $2 rdacontent
337 $a computer $b c $2 rdamedia
338 $a online resource $b cr $2 rdacarrier
347 $a text file $b PDF $2 rda
490 1 $a Lecture Notes in Artificial Intelligence ; $v 9811
505 0 $a Automatic Speech Recognition based on Neural Networks -- Machine Processing of Dialogue States; Speculations on Conversational Entropy -- Speech Recognition Challenges in the Car Navigation Industry -- A Comparison of Acoustic Features of Speech of Typically Developing Children and Children with Autism Spectrum Disorders -- A Deep Neural Networks (DNN) Based models for a Computer Aided Pronunciation Learning System -- A Linguistic Interpretation of the Atom Decomposition of Fundamental Frequency Contour for American English -- A Phonetic Segmentation Procedure Based on Hidden Markov Models -- A Preliminary Exploration of Group Social Engagement Level Recognition in Multiparty Casual Conversation -- An Agonist-Antagonist Pitch Production Model -- An Algorithm for Phase Manipulation in a Speech Signal -- An Exploratory Study on Sociolinguistic Variation of Russian Everyday Speech -- Adaptation of DNN Acoustic Models using KL-divergence Regularization and Multi-Task Training -- Advances in STC Russian Spontaneous Speech Recognition System -- Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance -- Assessment of the Relation Between Low-Frequency Features and Velum Opening by Using Real Articulatory Data -- Automatic Summarization of Highly Spontaneous Speech -- Backchanneling via Twitter Data for Conversational Dialogue Systems -- Bio-Inspired Sparse Representation of Speech and Audio Using Psychoacoustic Adaptive Matching Pursuit -- Combining Atom Decomposition of the F0 Track and HMM-based Phonological Phrase Modelling for Robust Stress Detection in Speech -- Comparative analysis of classifiers for automatic language recognition in spontaneous speech -- Comparison of Retrieval Approaches and Blind Relevance Feedback Methods within the Czech Speech Information Retrieval -- Convolutional Neural Network in the Task of Speaker Change Detection -- Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer -- Designing High-Coverage Multi-Level Text Corpus for Non-Professional-Voice Conservation -- Designing Syllable Models for an HMM based Speech Recognition System -- Detecting Filled Pauses and Lengthenings in Russian Spontaneous Speech using SVM -- Detecting Laughter and Filler Events by Time Series Smoothing with Genetic Algorithms -- Detecting State of Aggression in Sentences using CNN -- DNN-based Acoustic Modeling for Russian Speech Recognition Using Kaldi -- DNN-Based Duration Modeling for Synthesizing Short Sentences -- Emotional Speech of 3-Years Old Children: Norm-Risk-Deprivation -- Ensemble Deep Neural Network based Waveform-Driven Stress Model for Speech Synthesis -- Evaluation of Response Times on a Touch Screen using Stereo Panned Speech Command Auditory Feedback -- Evaluation of the Speech Quality During Rehabilitation after Surgical Treatment of the Cancer of Oral Cavity and Oropharynx based on a Comparison of the Fourier Spectra -- Experiments with One-Class Classifier as a Predictor of Spectral Discontinuities in Unit Concatenation -- Exploring GMM-derived Features for Unsupervised Adaptation of Deep Neural Network Acoustic Models -- Feature Space VTS with Phase Term Modeling -- Finding Speaker Position Under Difficult Acoustic Conditions -- Fusing Various Audio Feature Sets for Detection of Parkinson's Disease from Sustained Voice and Speech Recordings -- HAVRUS Corpus: High-speed Recordings of Audio-Visual Russian Speech -- Human-Smartphone Interaction for Dangerous Situation Detection & Recommendation Generation while Driving -- Improving Automatic Speech Recognition Containing Additive Noise Using Deep Denoising Autoencoders of LSTM Networks -- Improving the Quality of Automatic Speech Recognition in Trucks -- Improving Recognition of Dysarthric Speech Using Severity Based Tempo Adaptation -- Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent & Text-Independent Operation Modalities -- Improvements to Prosodic Variation in Long Short-Term Memory based Intonation Models Using Random Forest -- In-document Adaptation for a Human Guided Automatic Transcription Service -- Interaction Quality as a Human-Human Task-Oriented Conversation Performance -- Investigation of Segmentation in i-Vector based Speaker Diarization of Telephone Speech -- Investigation of Speech Signal Parameters Reflecting the Truth of Transmitted Information -- Investigating Signal Correlation as Continuity Metric in a Syllable based Unit Selection Synthesis System -- Knowledge Transfer for Utterance Classification in Low-Resource Languages -- Language Identification using Time Delay Neural Network D-Vector on Short Utterances -- Lexical Stress in Punjabi and its Representation in PLS -- Low Inter-Annotator Agreement in Sentence Boundary Detection and Personality -- LSTM-based Language Models for Spontaneous Speech Recognition -- Measuring Prosodic Entrainment in Italian Collaborative Game-based Dialogues -- Microphone Array Directivity Improvement in Low-Frequency Domain for Speech Processing -- Modeling Imperative Utterances in Russian Spoken Dialogue: Verb-Central Quantitative Approach -- Multimodal Perception of Aggressive Behavior -- On Individual Polyinformativity of Speech and Voice Regarding Speaker's Auditive Attribution (Forensic Phonetic Aspect) -- Online Biometric Identification With Face Analysis in Web Applications -- Optimization of Zelinski post-filtering calculation -- Phonetic Aspects of High Level of Naturalness in Speech Synthesis -- Polybasic Attribution of Social Network Discourse -- Precise Estimation of Harmonic Parameter Trend and Modification of a Speech Signal -- Profiling a Set of Personality Traits of a Text's Author: a Corpus-Based Approach -- Prosody Analysis of Malay Language Storytelling Corpus -- Quality Assessment of two Fullband Audio Codecs Supporting Real-Time Communication -- Robust Speech Analysis Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition in Noisy Environments -- Scenarios of Multimodal Information Navigation Services for Users in Cyberphysical Environment -- Scores Calibration in Speaker Recognition Systems -- Selecting Keypoint Detector and Descriptor Combination for Augmented Reality Application -- Semi-automatic Speaker Verification System Based on Analysis of Formant, Durational and Pitch Characteristics -- Speaker-Dependent Bottleneck Features for Egyptian Arabic Speech Recognition -- Speech Acts Annotation of Everyday Conversations in the ORD corpus of Spoken Russian -- Speech Enhancement with Microphone Array Using a Multi Beam Adaptive Noise Suppressor -- Speech Features Evaluation for Small Set Automatic Speaker Verification Using GMM-UBM System -- Speech Recognition combining MFCCs and Image Features -- Sociolinguistic Extension of the ORD Corpus of Russian Everyday Speech -- Statistical Analysis of Acoustical Parameters in the Voice of Children with Juvenile Dysphonia -- Stress, Arousal, and Stress Detector Trained on Acted Speech Database -- Study on the Improvement of Intelligibility for Elderly Speech using Formant Frequency Shift Method -- Text Classification in the Domain of Applied Linguistics as Part of a Pre-editing Module for Machine Translation Systems -- Tonal Specification of Perceptually Prominent Non-Nuclear Pitch Accents in Russian -- Toward Sign Language Motion Capture Dataset Building -- Trade-off Between Speed and Accuracy for Noise Variance Minimization (NVM) Pitch Estimation Algorithm -- Unsupervised Trained Functional Discourse Parser for E-Learning Materials Scaffolding.
520 $a This book constitutes the proceedings of the 18th International Conference on Speech and Computer, SPECOM 2016, held in Budapest, Hungary, in August 2016. The 85 papers presented in this volume were carefully reviewed and selected from 154 submissions.
650 0 $a Artificial intelligence. $3 559380
650 0 $a Application software. $3 528147
650 0 $a Pattern recognition. $3 1253525
650 0 $a Information storage and retrieval. $3 1069252
650 0 $a Optical data processing. $3 639187
650 0 $a Database management. $3 557799
650 1 4 $a Artificial Intelligence. $3 646849
650 2 4 $a Information Systems Applications (incl. Internet). $3 881699
650 2 4 $a Pattern Recognition. $3 669796
650 2 4 $a Information Storage and Retrieval. $3 593926
650 2 4 $a Image Processing and Computer Vision. $3 670819
650 2 4 $a Database Management. $3 669820
700 1 $a Ronzhin, Andrey. $4 edt $4 http://id.loc.gov/vocabulary/relators/edt $3 1069304
700 1 $a Potapova, Rodmonga. $4 edt $4 http://id.loc.gov/vocabulary/relators/edt $3 1069305
700 1 $a Németh, Géza. $e editor. $4 edt $4 http://id.loc.gov/vocabulary/relators/edt $3 1273609
710 2 $a SpringerLink (Online service) $3 593884
773 0 $t Springer Nature eBook
776 0 8 $i Printed edition: $z 9783319439570
776 0 8 $i Printed edition: $z 9783319439594
830 0 $a Lecture Notes in Artificial Intelligence ; $v 9285 $3 1253845
856 4 0 $u https://doi.org/10.1007/978-3-319-43958-7
912 $a ZDB-2-SCS
912 $a ZDB-2-SXCS
912 $a ZDB-2-LNC
950 $a Computer Science (SpringerNature-11645)
950 $a Computer Science (R0) (SpringerNature-43710)