語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Geometric Methods for Robust Data An...
~
Anderson, Joseph Timothy.
Geometric Methods for Robust Data Analysis in High Dimension.
紀錄類型:
書目-語言資料,手稿 : Monograph/item
正題名/作者:
Geometric Methods for Robust Data Analysis in High Dimension./
作者:
Anderson, Joseph Timothy.
面頁冊數:
1 online resource (178 pages)
附註:
Source: Dissertation Abstracts International, Volume: 78-10(E), Section: B.
Contained By:
Dissertation Abstracts International78-10B(E).
標題:
Applied mathematics. -
電子資源:
click for full text (PQDT)
ISBN:
9781369838626
Geometric Methods for Robust Data Analysis in High Dimension.
Anderson, Joseph Timothy.
Geometric Methods for Robust Data Analysis in High Dimension.
- 1 online resource (178 pages)
Source: Dissertation Abstracts International, Volume: 78-10(E), Section: B.
Thesis (Ph.D.)
Includes bibliographical references
Data-driven applications are growing. Machine learning and data analysis now finds both scientific and industrial application in biology, chemistry, geology, medicine, and physics. These applications rely on large quantities of data gathered from automated sensors and user input. Furthermore, the dimensionality of many datasets is extreme: more details are being gathered about single user interactions or sensor readings. All of these applications encounter problems with a common theme: use observed data to make inferences about the world. Our work obtains the first provably efficient algorithms for Independent Component Analysis (ICA) in the presence of heavy-tailed data. The main tool in this result is the centroid body (a well-known topic in convex geometry), along with optimization and random walks for sampling from a convex body. This is the first algorithmic use of the centroid body and it is of independent theoretical interest, since it effectively replaces the estimation of covariance from samples, and is more generally accessible.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018
Mode of access: World Wide Web
ISBN: 9781369838626Subjects--Topical Terms:
1069907
Applied mathematics.
Index Terms--Genre/Form:
554714
Electronic books.
Geometric Methods for Robust Data Analysis in High Dimension.
LDR
:03767ntm a2200373Ki 4500
001
909878
005
20180426091049.5
006
m o u
007
cr mn||||a|a||
008
190606s2017 xx obm 000 0 eng d
020
$a
9781369838626
035
$a
(MiAaPQ)AAI10610128
035
$a
(MiAaPQ)OhioLINK:osu1488372786126891
035
$a
AAI10610128
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
099
$a
TUL
$f
hyy
$c
available through World Wide Web
100
1
$a
Anderson, Joseph Timothy.
$3
1180872
245
1 0
$a
Geometric Methods for Robust Data Analysis in High Dimension.
264
0
$c
2017
300
$a
1 online resource (178 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Dissertation Abstracts International, Volume: 78-10(E), Section: B.
500
$a
Advisers: Luis Rademacher; Anastasios Sidiropolous.
502
$a
Thesis (Ph.D.)
$c
The Ohio State University
$d
2017.
504
$a
Includes bibliographical references
520
$a
Data-driven applications are growing. Machine learning and data analysis now finds both scientific and industrial application in biology, chemistry, geology, medicine, and physics. These applications rely on large quantities of data gathered from automated sensors and user input. Furthermore, the dimensionality of many datasets is extreme: more details are being gathered about single user interactions or sensor readings. All of these applications encounter problems with a common theme: use observed data to make inferences about the world. Our work obtains the first provably efficient algorithms for Independent Component Analysis (ICA) in the presence of heavy-tailed data. The main tool in this result is the centroid body (a well-known topic in convex geometry), along with optimization and random walks for sampling from a convex body. This is the first algorithmic use of the centroid body and it is of independent theoretical interest, since it effectively replaces the estimation of covariance from samples, and is more generally accessible.
520
$a
We demonstrate that ICA is itself a powerful geometric primitive. That is, having access to an efficient algorithm for ICA enables us to efficiently solve other important problems in machine learning. The first such reduction is a solution to the open problem of efficiently learning the intersection of n + 1 halfspaces in Rn, posed in [43]. This reduction relies on a non-linear transformation of samples from such an intersection of halfspaces (i.e. a simplex) to samples which are approximately from a linearly transformed product distribution. Through this transformation of samples, which can be done efficiently, one can then use an ICA algorithm to recover the vertices of the intersection of halfspaces.
520
$a
Finally, we again use ICA as an algorithmic primitive to construct an efficient solution to the widely-studied problem of learning the parameters of a Gaussian mixture model. Our algorithm again transforms samples from a Gaussian mixture model into samples which fit into the ICA model and, when processed by an ICA algorithm, result in recovery of the mixture parameters. Our algorithm is effective even when the number of Gaussians in the mixture grows with the ambient dimension, even polynomially in the dimension. In addition to the efficient parameter estimation, we also obtain a complexity lower bound for a low-dimension Gaussian mixture model.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2018
538
$a
Mode of access: World Wide Web
650
4
$a
Applied mathematics.
$3
1069907
650
4
$a
Computer science.
$3
573171
655
7
$a
Electronic books.
$2
local
$3
554714
690
$a
0364
690
$a
0984
710
2
$a
ProQuest Information and Learning Co.
$3
1178819
710
2
$a
The Ohio State University.
$b
Computer Science and Engineering.
$3
1180873
773
0
$t
Dissertation Abstracts International
$g
78-10B(E).
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10610128
$z
click for full text (PQDT)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入