Language:
English
繁體中文
Help
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Beginning Apache Spark Using Azure D...
~
SpringerLink (Online service)
Beginning Apache Spark Using Azure Databricks = Unleashing Large Cluster Analytics in the Cloud /
Record Type:
Language materials, printed : Monograph/item
Title/Author:
Beginning Apache Spark Using Azure Databricks/ by Robert Ilijason.
Reminder of title:
Unleashing Large Cluster Analytics in the Cloud /
Author:
Ilijason, Robert.
Description:
XVII, 274 p. 14 illus.online resource. :
Contained By:
Springer Nature eBook
Subject:
Big data. -
Online resource:
https://doi.org/10.1007/978-1-4842-5781-4
ISBN:
9781484257814
Beginning Apache Spark Using Azure Databricks = Unleashing Large Cluster Analytics in the Cloud /
Ilijason, Robert.
Beginning Apache Spark Using Azure Databricks
Unleashing Large Cluster Analytics in the Cloud /[electronic resource] :by Robert Ilijason. - 1st ed. 2020. - XVII, 274 p. 14 illus.online resource.
Chapter 1: Introduction to Large-Scale Data Analytics -- Chapter 2: Spark and Databricks -- Chapter 3: Getting Started with Databricks -- Chapter 4: Workspaces, Clusters, and Notebooks -- Chapter 5: Getting Data into Databricks -- Chapter 6: Querying Data Using SQL -- Chapter 7: The Power of Python -- Chapter 8: ETL and Advanced Data Wrangling -- Chapter 9: Connecting to and from Afar -- Chapter 10: Running in Production -- Chapter 11: Bits and Pieces.
Analyze vast amounts of data in record time using Apache Spark with Databricks in the Cloud. Learn the fundamentals, and more, of running analytics on large clusters in Azure and AWS, using Apache Spark with Databricks on top. Discover how to squeeze the most value out of your data at a mere fraction of what classical analytics solutions cost, while at the same time getting the results you need, incrementally faster. This book explains how the confluence of these pivotal technologies gives you enormous power, and cheaply, when it comes to huge datasets. You will begin by learning how cloud infrastructure makes it possible to scale your code to large amounts of processing units, without having to pay for the machinery in advance. From there you will learn how Apache Spark, an open source framework, can enable all those CPUs for data analytics use. Finally, you will see how services such as Databricks provide the power of Apache Spark, without you having to know anything about configuring hardware or software. By removing the need for expensive experts and hardware, your resources can instead be allocated to actually finding business value in the data. This book guides you through some advanced topics such as analytics in the cloud, data lakes, data ingestion, architecture, machine learning, and tools, including Apache Spark, Apache Hadoop, Apache Hive, Python, and SQL. Valuable exercises help reinforce what you have learned. What You Will Learn Discover the value of big data analytics that leverage the power of the cloud Get started with Databricks using SQL and Python in either Microsoft Azure or AWS Understand the underlying technology, and how the cloud and Spark fit into the bigger picture See how these tools are used in the real world Run basic analytics, including machine learning, on billions of rows at a fraction of a cost or free This book is for data engineers, data scientists, and cloud architects who want or need to run advanced analytics in the cloud. It is assumed that the reader has data experience, but perhaps minimal exposure to Apache Spark and Azure Databricks. The book is also recommended for people who want to get started in the analytics field, as it provides a strong foundation. Robert Ilijason is a 20-year veteran in the business intelligence (BI) segment. He has worked as a contractor for some of Europe’s biggest companies and has conducted large-scale analytics projects within the areas of retail, telecom, banking, government, and more. Robert has seen his share of analytic trends come and go over the years, but unlike most of them, he strongly believes that Apache Spark in the cloud, especially with Azure Databricks, is a game changer.
ISBN: 9781484257814
Standard No.: 10.1007/978-1-4842-5781-4doiSubjects--Topical Terms:
981821
Big data.
LC Class. No.: HF5548.125-5548.6
Dewey Class. No.: 658.4038
Beginning Apache Spark Using Azure Databricks = Unleashing Large Cluster Analytics in the Cloud /
LDR
:04488nam a22003855i 4500
001
1028593
003
DE-He213
005
20200630153536.0
007
cr nn 008mamaa
008
210318s2020 xxu| s |||| 0|eng d
020
$a
9781484257814
$9
978-1-4842-5781-4
024
7
$a
10.1007/978-1-4842-5781-4
$2
doi
035
$a
978-1-4842-5781-4
050
4
$a
HF5548.125-5548.6
072
7
$a
KJQ
$2
bicssc
072
7
$a
BUS070030
$2
bisacsh
072
7
$a
KJQ
$2
thema
082
0 4
$a
658.4038
$2
23
100
1
$a
Ilijason, Robert.
$e
author.
$4
aut
$4
http://id.loc.gov/vocabulary/relators/aut
$3
1325168
245
1 0
$a
Beginning Apache Spark Using Azure Databricks
$h
[electronic resource] :
$b
Unleashing Large Cluster Analytics in the Cloud /
$c
by Robert Ilijason.
250
$a
1st ed. 2020.
264
1
$a
Berkeley, CA :
$b
Apress :
$b
Imprint: Apress,
$c
2020.
300
$a
XVII, 274 p. 14 illus.
$b
online resource.
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
347
$a
text file
$b
PDF
$2
rda
505
0
$a
Chapter 1: Introduction to Large-Scale Data Analytics -- Chapter 2: Spark and Databricks -- Chapter 3: Getting Started with Databricks -- Chapter 4: Workspaces, Clusters, and Notebooks -- Chapter 5: Getting Data into Databricks -- Chapter 6: Querying Data Using SQL -- Chapter 7: The Power of Python -- Chapter 8: ETL and Advanced Data Wrangling -- Chapter 9: Connecting to and from Afar -- Chapter 10: Running in Production -- Chapter 11: Bits and Pieces.
520
$a
Analyze vast amounts of data in record time using Apache Spark with Databricks in the Cloud. Learn the fundamentals, and more, of running analytics on large clusters in Azure and AWS, using Apache Spark with Databricks on top. Discover how to squeeze the most value out of your data at a mere fraction of what classical analytics solutions cost, while at the same time getting the results you need, incrementally faster. This book explains how the confluence of these pivotal technologies gives you enormous power, and cheaply, when it comes to huge datasets. You will begin by learning how cloud infrastructure makes it possible to scale your code to large amounts of processing units, without having to pay for the machinery in advance. From there you will learn how Apache Spark, an open source framework, can enable all those CPUs for data analytics use. Finally, you will see how services such as Databricks provide the power of Apache Spark, without you having to know anything about configuring hardware or software. By removing the need for expensive experts and hardware, your resources can instead be allocated to actually finding business value in the data. This book guides you through some advanced topics such as analytics in the cloud, data lakes, data ingestion, architecture, machine learning, and tools, including Apache Spark, Apache Hadoop, Apache Hive, Python, and SQL. Valuable exercises help reinforce what you have learned. What You Will Learn Discover the value of big data analytics that leverage the power of the cloud Get started with Databricks using SQL and Python in either Microsoft Azure or AWS Understand the underlying technology, and how the cloud and Spark fit into the bigger picture See how these tools are used in the real world Run basic analytics, including machine learning, on billions of rows at a fraction of a cost or free This book is for data engineers, data scientists, and cloud architects who want or need to run advanced analytics in the cloud. It is assumed that the reader has data experience, but perhaps minimal exposure to Apache Spark and Azure Databricks. The book is also recommended for people who want to get started in the analytics field, as it provides a strong foundation. Robert Ilijason is a 20-year veteran in the business intelligence (BI) segment. He has worked as a contractor for some of Europe’s biggest companies and has conducted large-scale analytics projects within the areas of retail, telecom, banking, government, and more. Robert has seen his share of analytic trends come and go over the years, but unlike most of them, he strongly believes that Apache Spark in the cloud, especially with Azure Databricks, is a game changer.
650
0
$a
Big data.
$3
981821
650
0
$a
Microsoft software.
$3
1253736
650
0
$a
Microsoft .NET Framework.
$3
565417
650
0
$a
Open source software.
$3
561177
650
0
$a
Computer programming.
$3
527822
650
1 4
$a
Big Data/Analytics.
$3
1106909
650
2 4
$a
Microsoft and .NET.
$3
1114109
650
2 4
$a
Open Source.
$3
1113081
710
2
$a
SpringerLink (Online service)
$3
593884
773
0
$t
Springer Nature eBook
776
0 8
$i
Printed edition:
$z
9781484257807
776
0 8
$i
Printed edition:
$z
9781484257821
856
4 0
$u
https://doi.org/10.1007/978-1-4842-5781-4
912
$a
ZDB-2-BUM
912
$a
ZDB-2-SXBM
950
$a
Business and Management (SpringerNature-41169)
950
$a
Business and Management (R0) (SpringerNature-43719)
based on 0 review(s)
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login