語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
A Review of 'Big Data' Variable Sele...
~
ProQuest Information and Learning Co.
A Review of 'Big Data' Variable Selection Procedures for Use in Predictive Modeling.
紀錄類型:
書目-語言資料,手稿 : Monograph/item
正題名/作者:
A Review of 'Big Data' Variable Selection Procedures for Use in Predictive Modeling./
作者:
Papke, Sarah.
面頁冊數:
1 online resource (69 pages)
附註:
Source: Masters Abstracts International, Volume: 56-06.
Contained By:
Masters Abstracts International56-06(E).
標題:
Statistics. -
電子資源:
click for full text (PQDT)
ISBN:
9780355220292
A Review of 'Big Data' Variable Selection Procedures for Use in Predictive Modeling.
Papke, Sarah.
A Review of 'Big Data' Variable Selection Procedures for Use in Predictive Modeling.
- 1 online resource (69 pages)
Source: Masters Abstracts International, Volume: 56-06.
Thesis (M.S.)
Includes bibliographical references
Several problems arise when attempting to use traditional predictive modeling techniques on 'big data.' For instance, multiple linear regression models cannot be used on datasets with hundreds of variables. However several techniques are becoming common tools for selective inference as the need for analyzing big data increases. Forward selection and penalized regression models (such as LASSO, Ridge Regression, and Elastic Net) are simple modi?cations of multiple linear regression that can provide some guidance on simplifying a model through variable selection. Dimension reducing techniques, such as Partial Least Squares and Principal Components Analysis, are more complex than regression but have the ability to handle highly correlated independent variables. Each of the aforementioned techniques are valuable in predictive modeling if used properly. This paper provides a mathematical introduction to these developments in selective inference. A sample dataset is used to demonstrate modeling and interpretation. Further, the applications to big data, as well as advantages and disadvantages of each procedure, are discussed.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018
Mode of access: World Wide Web
ISBN: 9780355220292Subjects--Topical Terms:
556824
Statistics.
Index Terms--Genre/Form:
554714
Electronic books.
A Review of 'Big Data' Variable Selection Procedures for Use in Predictive Modeling.
LDR
:02417ntm a2200361Ki 4500
001
908976
005
20180419104823.5
006
m o u
007
cr mn||||a|a||
008
190606s2017 xx obm 000 0 eng d
020
$a
9780355220292
035
$a
(MiAaPQ)AAI10281816
035
$a
(MiAaPQ)duquesne:10975
035
$a
AAI10281816
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
099
$a
TUL
$f
hyy
$c
available through World Wide Web
100
1
$a
Papke, Sarah.
$3
1179432
245
1 2
$a
A Review of 'Big Data' Variable Selection Procedures for Use in Predictive Modeling.
264
0
$c
2017
300
$a
1 online resource (69 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Masters Abstracts International, Volume: 56-06.
500
$a
Adviser: Frank D'Amico.
502
$a
Thesis (M.S.)
$c
Duquesne University
$d
2017.
504
$a
Includes bibliographical references
520
$a
Several problems arise when attempting to use traditional predictive modeling techniques on 'big data.' For instance, multiple linear regression models cannot be used on datasets with hundreds of variables. However several techniques are becoming common tools for selective inference as the need for analyzing big data increases. Forward selection and penalized regression models (such as LASSO, Ridge Regression, and Elastic Net) are simple modi?cations of multiple linear regression that can provide some guidance on simplifying a model through variable selection. Dimension reducing techniques, such as Partial Least Squares and Principal Components Analysis, are more complex than regression but have the ability to handle highly correlated independent variables. Each of the aforementioned techniques are valuable in predictive modeling if used properly. This paper provides a mathematical introduction to these developments in selective inference. A sample dataset is used to demonstrate modeling and interpretation. Further, the applications to big data, as well as advantages and disadvantages of each procedure, are discussed.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2018
538
$a
Mode of access: World Wide Web
650
4
$a
Statistics.
$3
556824
650
4
$a
Mathematics.
$3
527692
650
4
$a
Information science.
$3
561178
655
7
$a
Electronic books.
$2
local
$3
554714
690
$a
0463
690
$a
0405
690
$a
0723
710
2
$a
ProQuest Information and Learning Co.
$3
1178819
710
2
$a
Duquesne University.
$b
Computational Mathematics.
$3
1179433
773
0
$t
Masters Abstracts International
$g
56-06(E).
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10281816
$z
click for full text (PQDT)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入