語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Algorithms for Data Science
~
Chandler, John.
Algorithms for Data Science
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Algorithms for Data Science/ by Brian Steele, John Chandler, Swarna Reddy.
作者:
Steele, Brian.
其他作者:
Chandler, John.
面頁冊數:
XXIII, 430 p. 48 illus., 30 illus. in color.online resource. :
Contained By:
Springer Nature eBook
標題:
Data mining. -
電子資源:
https://doi.org/10.1007/978-3-319-45797-0
ISBN:
9783319457970
Algorithms for Data Science
Steele, Brian.
Algorithms for Data Science
[electronic resource] /by Brian Steele, John Chandler, Swarna Reddy. - 1st ed. 2016. - XXIII, 430 p. 48 illus., 30 illus. in color.online resource.
Introduction -- Data Mapping and Data Dictionaries -- Scalable Algorithms and Associative Statistics -- Hadoop and MapReduce -- Data Visualization -- Linear Regression Methods -- Healthcare Analytics -- Cluster Analysis -- k-Nearest Neighbor Prediction Functions -- The Multinomial Naive Bayes Prediction Function -- Forecasting -- Real-time Analytics.
This textbook on practical data analytics unites fundamental principles, algorithms, and data. Algorithms are the keystone of data analytics and the focal point of this textbook. Clear and intuitive explanations of the mathematical and statistical foundations make the algorithms transparent. But practical data analytics requires more than just the foundations. Problems and data are enormously variable and only the most elementary of algorithms can be used without modification. Programming fluency and experience with real and challenging data is indispensable and so the reader is immersed in Python and R and real data analysis. By the end of the book, the reader will have gained the ability to adapt algorithms to new problems and carry out innovative analyses. This book has three parts: (a) Data Reduction: Begins with the concepts of data reduction, data maps, and information extraction. The second chapter introduces associative statistics, the mathematical foundation of scalable algorithms and distributed computing. Practical aspects of distributed computing is the subject of the Hadoop and MapReduce chapter. (b) Extracting Information from Data: Linear regression and data visualization are the principal topics of Part II. The authors dedicate a chapter to the critical domain of Healthcare Analytics for an extended example of practical data analytics. The algorithms and analytics will be of much interest to practitioners interested in utilizing the large and unwieldly data sets of the Centers for Disease Control and Prevention's Behavioral Risk Factor Surveillance System. (c) Predictive Analytics Two foundational and widely used algorithms, k-nearest neighbors and naive Bayes, are developed in detail. A chapter is dedicated to forecasting. The last chapter focuses on streaming data and uses publicly accessible data streams originating from the Twitter API and the NASDAQ stock market in the tutorials. This book is intended for a one- or two-semester course in data analytics for upper-division undergraduate and graduate students in mathematics, statistics, and computer science. The prerequisites are kept low, and students with one or two courses in probability or statistics, an exposure to vectors and matrices, and a programming course will have no difficulty. The core material of every chapter is accessible to all with these prerequisites. The chapters often expand at the close with innovations of interest to practitioners of data science. Each chapter includes exercises of varying levels of difficulty. The text is eminently suitable for self-study and an exceptional resource for practitioners.
ISBN: 9783319457970
Standard No.: 10.1007/978-3-319-45797-0doiSubjects--Topical Terms:
528622
Data mining.
LC Class. No.: QA76.9.D343
Dewey Class. No.: 006.312
Algorithms for Data Science
LDR
:04368nam a22004095i 4500
001
977360
003
DE-He213
005
20200630120247.0
007
cr nn 008mamaa
008
201211s2016 gw | s |||| 0|eng d
020
$a
9783319457970
$9
978-3-319-45797-0
024
7
$a
10.1007/978-3-319-45797-0
$2
doi
035
$a
978-3-319-45797-0
050
4
$a
QA76.9.D343
072
7
$a
UNF
$2
bicssc
072
7
$a
COM021030
$2
bisacsh
072
7
$a
UNF
$2
thema
072
7
$a
UYQE
$2
thema
082
0 4
$a
006.312
$2
23
100
1
$a
Steele, Brian.
$4
aut
$4
http://id.loc.gov/vocabulary/relators/aut
$3
1116263
245
1 0
$a
Algorithms for Data Science
$h
[electronic resource] /
$c
by Brian Steele, John Chandler, Swarna Reddy.
250
$a
1st ed. 2016.
264
1
$a
Cham :
$b
Springer International Publishing :
$b
Imprint: Springer,
$c
2016.
300
$a
XXIII, 430 p. 48 illus., 30 illus. in color.
$b
online resource.
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
347
$a
text file
$b
PDF
$2
rda
505
0
$a
Introduction -- Data Mapping and Data Dictionaries -- Scalable Algorithms and Associative Statistics -- Hadoop and MapReduce -- Data Visualization -- Linear Regression Methods -- Healthcare Analytics -- Cluster Analysis -- k-Nearest Neighbor Prediction Functions -- The Multinomial Naive Bayes Prediction Function -- Forecasting -- Real-time Analytics.
520
$a
This textbook on practical data analytics unites fundamental principles, algorithms, and data. Algorithms are the keystone of data analytics and the focal point of this textbook. Clear and intuitive explanations of the mathematical and statistical foundations make the algorithms transparent. But practical data analytics requires more than just the foundations. Problems and data are enormously variable and only the most elementary of algorithms can be used without modification. Programming fluency and experience with real and challenging data is indispensable and so the reader is immersed in Python and R and real data analysis. By the end of the book, the reader will have gained the ability to adapt algorithms to new problems and carry out innovative analyses. This book has three parts: (a) Data Reduction: Begins with the concepts of data reduction, data maps, and information extraction. The second chapter introduces associative statistics, the mathematical foundation of scalable algorithms and distributed computing. Practical aspects of distributed computing is the subject of the Hadoop and MapReduce chapter. (b) Extracting Information from Data: Linear regression and data visualization are the principal topics of Part II. The authors dedicate a chapter to the critical domain of Healthcare Analytics for an extended example of practical data analytics. The algorithms and analytics will be of much interest to practitioners interested in utilizing the large and unwieldly data sets of the Centers for Disease Control and Prevention's Behavioral Risk Factor Surveillance System. (c) Predictive Analytics Two foundational and widely used algorithms, k-nearest neighbors and naive Bayes, are developed in detail. A chapter is dedicated to forecasting. The last chapter focuses on streaming data and uses publicly accessible data streams originating from the Twitter API and the NASDAQ stock market in the tutorials. This book is intended for a one- or two-semester course in data analytics for upper-division undergraduate and graduate students in mathematics, statistics, and computer science. The prerequisites are kept low, and students with one or two courses in probability or statistics, an exposure to vectors and matrices, and a programming course will have no difficulty. The core material of every chapter is accessible to all with these prerequisites. The chapters often expand at the close with innovations of interest to practitioners of data science. Each chapter includes exercises of varying levels of difficulty. The text is eminently suitable for self-study and an exceptional resource for practitioners.
650
0
$a
Data mining.
$3
528622
650
0
$a
Statistics .
$3
1253516
650
0
$a
Computer science—Mathematics.
$3
1253519
650
0
$a
Health informatics.
$3
1064466
650
1 4
$a
Data Mining and Knowledge Discovery.
$3
677765
650
2 4
$a
Statistics and Computing/Statistics Programs.
$3
669775
650
2 4
$a
Mathematics of Computing.
$3
669457
650
2 4
$a
Health Informatics.
$3
593963
700
1
$a
Chandler, John.
$4
aut
$4
http://id.loc.gov/vocabulary/relators/aut
$3
1116264
700
1
$a
Reddy, Swarna.
$4
aut
$4
http://id.loc.gov/vocabulary/relators/aut
$3
1116265
710
2
$a
SpringerLink (Online service)
$3
593884
773
0
$t
Springer Nature eBook
776
0 8
$i
Printed edition:
$z
9783319457956
776
0 8
$i
Printed edition:
$z
9783319457963
776
0 8
$i
Printed edition:
$z
9783319833736
856
4 0
$u
https://doi.org/10.1007/978-3-319-45797-0
912
$a
ZDB-2-SCS
912
$a
ZDB-2-SXCS
950
$a
Computer Science (SpringerNature-11645)
950
$a
Computer Science (R0) (SpringerNature-43710)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入