國立虎尾科技大學 |

Graph-Based Acoustic Clustering and Classification.

紀錄類型:	書目-語言資料,手稿 : Monograph/item
正題名/作者:	Graph-Based Acoustic Clustering and Classification./
作者:	Sunu, Justin.
面頁冊數:	1 online resource (114 pages)
附註:	Source: Dissertations Abstracts International, Volume: 85-04, Section: A.
Contained By:	Dissertations Abstracts International85-04A.
標題:	Computer science. -
電子資源:	click for full text (PQDT)
ISBN:	9798380417167

Graph-Based Acoustic Clustering and Classification.
Sunu, Justin.

Graph-Based Acoustic Clustering and Classification. - 1 online resource (114 pages)

Source: Dissertations Abstracts International, Volume: 85-04, Section: A.

Thesis (Ph.D.)--San Diego State University, 2023.

Includes bibliographical references

The rapid growth of audio data collection in various domains necessitates advanced techniques for efficient analysis and classification. This dissertation proposes new approaches for categorizing acoustic data, using both unsupervised and semi-supervised learning methods. Starting with raw audio, we preprocess the signal to segment it into time windows, each of which we consider as an independent data point. We use the short-time Fourier transform to describe the signal in a given time window as a set of Fourier coefficients. We interpret the resulting frequency signature as a high-dimensional feature description of each data point.We then develop a graph-based approach for analyzing these signals, representing the data using a similarity graph. Following methods used successfully in image processing and problems on networks, we apply a spectral embedding to project the high-dimensional graph data onto a low-dimensional subspace. We show how the Nystrom extension can accelerate the calculation of the eigenvectors of the graph Laplacian, and how to adapt the method to accommodate streaming data.Using the low-dimensional representation of the audio signal, we consider several clustering methods for categorizing the data. We compare results of the conventional spectral clustering algorithm, which applies \uD835\uDC3E-means to the eigenvectors of the Laplacian, with a semi-supervised implementation of \uD835\uDC3E-nearest neighbors on these eigenvectors. We also use an incremental reseeding algorithm that diffuses cluster labels across a graph, showing how its output can construct a novel reduced-dimensionality representation of the data. Based on this, we propose a semi-supervised extension of the method.Finally, we evaluate the effectiveness of our methodology on problems of classifying vehicles based on roadside microphone recordings and of classifying songs according to musical genre. We demonstrate the effects of spectral embedding on these problems, as well as the relative performance of both our unsupervised and semi-supervised algorithms. These results suggest that, even with little or no training data, graph-based methods can provide a powerful tool for acoustic analysis and for machine learning from acoustic signals.

Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2024

Mode of access: World Wide Web

ISBN: 9798380417167Subjects--Topical Terms:

573171
Computer science.
Subjects--Index Terms:

Acoustic dataIndex Terms--Genre/Form:

554714
Electronic books.

Graph-Based Acoustic Clustering and Classification.
LDR:03590ntm a22003977 4500 001 1143873
005 20240517105015.5
006 m o d
007 cr mn ---uuuuu
008 250605s2023 xx obm 000 0 eng d
020 $a 9798380417167
035 $a (MiAaPQ)AAI30688736
035 $a AAI30688736
040 $a MiAaPQ $b eng $c MiAaPQ $d NTU
100 1 $a Sunu, Justin. $3 1468681
245 1 0 $a Graph-Based Acoustic Clustering and Classification.
264 0 $c 2023
300 $a 1 online resource (114 pages)
336 $a text $b txt $2 rdacontent
337 $a computer $b c $2 rdamedia
338 $a online resource $b cr $2 rdacarrier
500 $a Source: Dissertations Abstracts International, Volume: 85-04, Section: A.
500 $a Advisor: Percus, Allon G.
502 $a Thesis (Ph.D.)--San Diego State University, 2023.
504 $a Includes bibliographical references
520 $a The rapid growth of audio data collection in various domains necessitates advanced techniques for efficient analysis and classification. This dissertation proposes new approaches for categorizing acoustic data, using both unsupervised and semi-supervised learning methods. Starting with raw audio, we preprocess the signal to segment it into time windows, each of which we consider as an independent data point. We use the short-time Fourier transform to describe the signal in a given time window as a set of Fourier coefficients. We interpret the resulting frequency signature as a high-dimensional feature description of each data point.We then develop a graph-based approach for analyzing these signals, representing the data using a similarity graph. Following methods used successfully in image processing and problems on networks, we apply a spectral embedding to project the high-dimensional graph data onto a low-dimensional subspace. We show how the Nystrom extension can accelerate the calculation of the eigenvectors of the graph Laplacian, and how to adapt the method to accommodate streaming data.Using the low-dimensional representation of the audio signal, we consider several clustering methods for categorizing the data. We compare results of the conventional spectral clustering algorithm, which applies \uD835\uDC3E-means to the eigenvectors of the Laplacian, with a semi-supervised implementation of \uD835\uDC3E-nearest neighbors on these eigenvectors. We also use an incremental reseeding algorithm that diffuses cluster labels across a graph, showing how its output can construct a novel reduced-dimensionality representation of the data. Based on this, we propose a semi-supervised extension of the method.Finally, we evaluate the effectiveness of our methodology on problems of classifying vehicles based on roadside microphone recordings and of classifying songs according to musical genre. We demonstrate the effects of spectral embedding on these problems, as well as the relative performance of both our unsupervised and semi-supervised algorithms. These results suggest that, even with little or no training data, graph-based methods can provide a powerful tool for acoustic analysis and for machine learning from acoustic signals.
533 $a Electronic reproduction. $b Ann Arbor, Mich. : $c ProQuest, $d 2024
538 $a Mode of access: World Wide Web
650 4 $a Computer science. $3 573171
650 4 $a Information technology. $3 559429
650 4 $a Information science. $3 561178
653 $a Acoustic data
653 $a Audio
653 $a Data collection
653 $a Graph
653 $a Machine learning
655 7 $a Electronic books. $2 local $3 554714
690 $a 0984
690 $a 0489
690 $a 0723
710 2 $a ProQuest Information and Learning Co. $3 1178819
710 2 $a San Diego State University. $b Computational Science. $3 1468682
773 0 $t Dissertations Abstracts International $g 85-04A.
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=30688736 $z click for full text (PQDT)