國立虎尾科技大學 |

Pro Hadoop data analytics = designing and building big data systems using the Hadoop ecosystem /

紀錄類型:	書目-語言資料,印刷品 : Monograph/item
正題名/作者:	Pro Hadoop data analytics/ by Kerry Koitzsch.
其他題名:	designing and building big data systems using the Hadoop ecosystem /
作者:	Koitzsch, Kerry.
出版者:	Berkeley, CA :Apress : : 2017.,
面頁冊數:	xxi, 298 p. :ill., digital ; : 24 cm.;
Contained By:	Springer eBooks
標題:	Database management. -
電子資源:	http://dx.doi.org/10.1007/978-1-4842-1910-2
ISBN:	9781484219102

Pro Hadoop data analytics = designing and building big data systems using the Hadoop ecosystem /
Koitzsch, Kerry.

Pro Hadoop data analyticsdesigning and building big data systems using the Hadoop ecosystem /[electronic resource] :by Kerry Koitzsch. - Berkeley, CA :Apress :2017. - xxi, 298 p. :ill., digital ;24 cm.

Chapter 1: Overview: Building Data Analytic Systems with Hadoop -- Chapter 2: A Scala and Python Refresher -- Chapter 3: Standard Toolkits for Hadoop and Analytics -- Chapter 4: Relational, noSQL, and Graph Databases -- Chapter 5: Data Pipelines and How to Construct Them -- Chapter 6: Advanced Search Techniques with Hadoop, Lucene, and Solr -- Chapter 7: An Overview of Analytical Techniques and Algorithms -- Chapter 8: Rule Engines, System Control, and System Orchestration -- Chapter 9: Putting it All Together: Designing a Complete Analytical System -- Chapter 10: Data Visualizers: Seeing and Interacting with the Analysis -- Chapter 11: A Case Study in Bioinformatics: Analyzing Microscope Slide Data -- Chapter 12: A Bayesian Analysis Software Component: Identifying Credit Card Fraud -- Chapter 13: Searching for Oil: Geological Data Analysis with Mahout -- Chapter 14: 'Image as Big Data' Systems: Some Case Studies -- Chapter 15: A Generic Data Pipeline Analytical System -- Chapter 16: Conclusions and The Future of Big Data Analysis.

Learn advanced analytical techniques and leverage existing toolkits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems which go beyond the basics of classification, clustering, and recommendation. In Pro Hadoop Data Analytics best practices are emphasized to ensure coherent, efficient development. A complete example system will be developed using standard third-party components which will consist of the toolkits, libraries, visualization and reporting code, as well as support glue to provide a working and extensible end-to-end system. The book emphasizes four important topics: The importance of end-to-end, flexible, configurable, high-performance data pipeline systems with analytical components as well as appropriate visualization results. Deep-dive topics will include Spark, H20, Vopal Wabbit (NLP), Stanford NLP, and other appropriate toolkits and plugins. Best practices and structured design principles. This will include strategic topics as well as the how to example portions. The importance of mix-and-match or hybrid systems, using different analytical components in one application to accomplish application goals. The hybrid approach will be prominent in the examples. Use of existing third-party libraries is key to effective development. Deep dive examples of the functionality of some of these toolkits will be showcased as you develop the example system.

ISBN: 9781484219102

Standard No.: 10.1007/978-1-4842-1910-2doiSubjects--Uniform Titles:

Apache Hadoop.
Subjects--Topical Terms:

557799
Database management.

LC Class. No.: QA76.9.D3

Dewey Class. No.: 005.74

Pro Hadoop data analytics = designing and building big data systems using the Hadoop ecosystem /
LDR:03506nam a2200289 a 4500 001 958086
003 DE-He213
005 20170721090149.0
006 m d
007 cr nn 008maaau
008 201118s2017 cau s 0 eng d
020 $a 9781484219102 $q (electronic bk.)
020 $a 9781484219096 $q (paper)
024 7 $a 10.1007/978-1-4842-1910-2 $2 doi
035 $a 978-1-4842-1910-2
040 $a GP $c GP
041 0 $a eng
050 4 $a QA76.9.D3
082 0 4 $a 005.74 $2 23
090 $a QA76.9.D3 $b K79 2017
100 1 $a Koitzsch, Kerry. $3 1250064
245 1 0 $a Pro Hadoop data analytics $h [electronic resource] : $b designing and building big data systems using the Hadoop ecosystem / $c by Kerry Koitzsch.
260 $a Berkeley, CA : $b Apress : $b Imprint: Apress, $c 2017.
300 $a xxi, 298 p. : $b ill., digital ; $c 24 cm.
505 0 $a Chapter 1: Overview: Building Data Analytic Systems with Hadoop -- Chapter 2: A Scala and Python Refresher -- Chapter 3: Standard Toolkits for Hadoop and Analytics -- Chapter 4: Relational, noSQL, and Graph Databases -- Chapter 5: Data Pipelines and How to Construct Them -- Chapter 6: Advanced Search Techniques with Hadoop, Lucene, and Solr -- Chapter 7: An Overview of Analytical Techniques and Algorithms -- Chapter 8: Rule Engines, System Control, and System Orchestration -- Chapter 9: Putting it All Together: Designing a Complete Analytical System -- Chapter 10: Data Visualizers: Seeing and Interacting with the Analysis -- Chapter 11: A Case Study in Bioinformatics: Analyzing Microscope Slide Data -- Chapter 12: A Bayesian Analysis Software Component: Identifying Credit Card Fraud -- Chapter 13: Searching for Oil: Geological Data Analysis with Mahout -- Chapter 14: 'Image as Big Data' Systems: Some Case Studies -- Chapter 15: A Generic Data Pipeline Analytical System -- Chapter 16: Conclusions and The Future of Big Data Analysis.
520 $a Learn advanced analytical techniques and leverage existing toolkits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems which go beyond the basics of classification, clustering, and recommendation. In Pro Hadoop Data Analytics best practices are emphasized to ensure coherent, efficient development. A complete example system will be developed using standard third-party components which will consist of the toolkits, libraries, visualization and reporting code, as well as support glue to provide a working and extensible end-to-end system. The book emphasizes four important topics: The importance of end-to-end, flexible, configurable, high-performance data pipeline systems with analytical components as well as appropriate visualization results. Deep-dive topics will include Spark, H20, Vopal Wabbit (NLP), Stanford NLP, and other appropriate toolkits and plugins. Best practices and structured design principles. This will include strategic topics as well as the how to example portions. The importance of mix-and-match or hybrid systems, using different analytical components in one application to accomplish application goals. The hybrid approach will be prominent in the examples. Use of existing third-party libraries is key to effective development. Deep dive examples of the functionality of some of these toolkits will be showcased as you develop the example system.
630 0 0 $a Apache Hadoop. $3 1025311
650 0 $a Database management. $3 557799
650 1 4 $a Computer Science. $3 593922
650 2 4 $a Big Data. $3 1017136
650 2 4 $a Programming Techniques. $3 669781
650 2 4 $a Programming Languages, Compilers, Interpreters. $3 669782
650 2 4 $a Data Mining and Knowledge Discovery. $3 677765
710 2 $a SpringerLink (Online service) $3 593884
773 0 $t Springer eBooks
856 4 0 $u http://dx.doi.org/10.1007/978-1-4842-1910-2
950 $a Professional and Applied Computing (Springer-12059)