語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Hadoop Based Algorithm for Computing...
~
ProQuest Information and Learning Co.
Hadoop Based Algorithm for Computing Linear Regression.
紀錄類型:
書目-語言資料,手稿 : Monograph/item
正題名/作者:
Hadoop Based Algorithm for Computing Linear Regression./
作者:
Wang, Tian.
面頁冊數:
1 online resource (57 pages)
附註:
Source: Masters Abstracts International, Volume: 56-06.
Contained By:
Masters Abstracts International56-06(E).
標題:
Computer science. -
電子資源:
click for full text (PQDT)
ISBN:
9780355157314
Hadoop Based Algorithm for Computing Linear Regression.
Wang, Tian.
Hadoop Based Algorithm for Computing Linear Regression.
- 1 online resource (57 pages)
Source: Masters Abstracts International, Volume: 56-06.
Thesis (M.S.)--Purdue University, 2017.
Includes bibliographical references
As machine learning and big data analysis play a more and more important role in both industry and academia, researchers correspondingly spend a large amount of time trying to find those accurate models that could help researchers predict the trend of a certain phenomenon. Current packages and functions in R, Hadoop and RHadoop require accessing the entire data set each time when a new set of parameters need to be evaluated. This is extremely time-consuming when data is big and disk I/O is slow. This study implemented an one-read-multiple-evaluation technique that can greatly reduce time needed to find the best model from multiple sets of parameters. In the testing RHadoop environment, the proposed approach showed that finding the best Box-Cox transformed linear model from 41 potential parameters was about 25 times faster than the linear models on RHadoop when the training datasets is about 12.4 GB. Results also showed the scheme is scalable when the size of data is bigger and more sets of parameters need to be compared.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018
Mode of access: World Wide Web
ISBN: 9780355157314Subjects--Topical Terms:
573171
Computer science.
Index Terms--Genre/Form:
554714
Electronic books.
Hadoop Based Algorithm for Computing Linear Regression.
LDR
:02204ntm a2200325Ki 4500
001
919568
005
20181129115238.5
006
m o u
007
cr mn||||a|a||
008
190606s2017 xx obm 000 0 eng d
020
$a
9780355157314
035
$a
(MiAaPQ)AAI10271743
035
$a
(MiAaPQ)purdue:21299
035
$a
AAI10271743
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
$d
NTU
100
1
$a
Wang, Tian.
$3
1194178
245
1 0
$a
Hadoop Based Algorithm for Computing Linear Regression.
264
0
$c
2017
300
$a
1 online resource (57 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Masters Abstracts International, Volume: 56-06.
500
$a
Advisers: Baijian Yang; TongLin Zhang.
502
$a
Thesis (M.S.)--Purdue University, 2017.
504
$a
Includes bibliographical references
520
$a
As machine learning and big data analysis play a more and more important role in both industry and academia, researchers correspondingly spend a large amount of time trying to find those accurate models that could help researchers predict the trend of a certain phenomenon. Current packages and functions in R, Hadoop and RHadoop require accessing the entire data set each time when a new set of parameters need to be evaluated. This is extremely time-consuming when data is big and disk I/O is slow. This study implemented an one-read-multiple-evaluation technique that can greatly reduce time needed to find the best model from multiple sets of parameters. In the testing RHadoop environment, the proposed approach showed that finding the best Box-Cox transformed linear model from 41 potential parameters was about 25 times faster than the linear models on RHadoop when the training datasets is about 12.4 GB. Results also showed the scheme is scalable when the size of data is bigger and more sets of parameters need to be compared.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2018
538
$a
Mode of access: World Wide Web
650
4
$a
Computer science.
$3
573171
655
7
$a
Electronic books.
$2
local
$3
554714
690
$a
0984
710
2
$a
ProQuest Information and Learning Co.
$3
1178819
710
2
$a
Purdue University.
$b
Computer and Information Technology.
$3
1180535
773
0
$t
Masters Abstracts International
$g
56-06(E).
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10271743
$z
click for full text (PQDT)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入