語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Data-intensive workflow management :...
~
Pacitti, Esther,
Data-intensive workflow management : = for clouds and data-intensive and scalable computing environments /
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Data-intensive workflow management :/ Daniel C.M. de Oliveira, Ji Liu, Esther Pacitti.
其他題名:
for clouds and data-intensive and scalable computing environments /
作者:
De Oliveira, Daniel C. M.,
其他作者:
Liu, Ji,
面頁冊數:
1 PDF (xvii, 161 pages) :illustrations. :
附註:
Part of: Synthesis digital library of engineering and computer science.
標題:
Cloud computing. -
電子資源:
https://doi.org/10.2200/S00915ED1V01Y201904DTM060
電子資源:
https://ieeexplore.ieee.org/servlet/opac?bknumber=8715841
ISBN:
9781681735580
Data-intensive workflow management : = for clouds and data-intensive and scalable computing environments /
De Oliveira, Daniel C. M.,
Data-intensive workflow management :
for clouds and data-intensive and scalable computing environments /Daniel C.M. de Oliveira, Ji Liu, Esther Pacitti. - 1 PDF (xvii, 161 pages) :illustrations. - Synthesis lectures on data management,#602153-5426 ;. - Synthesis digital library of engineering and computer science..
Part of: Synthesis digital library of engineering and computer science.
Includes bibliographical references (pages 133-160).
1. Overview -- 1.1. Motivating examples -- 1.2. The life cycle of cloud and disc workflows -- 1.3. Structure of the book
Abstract freely available; full-text restricted to subscribers or individual document purchasers.
Compendex
Workflows may be defined as abstractions used to model the coherent flow of activities in the context of an in silico scientific experiment. They are employed in many domains of science such as bioinformatics, astronomy, and engineering. Such workflows usually present a considerable number of activities and activations (i.e., tasks associated with activities) and may need a long time for execution. Due to the continuous need to store and process data efficiently (making them data-intensive workflows), high-performance computing environments allied to parallelization techniques are used to run these workflows. At the beginning of the 2010s, cloud technologies emerged as a promising environment to run scientific workflows. By using clouds, scientists have expanded beyond single parallel computers to hundreds or even thousands of virtual machines. More recently, Data-Intensive Scalable Computing (DISC) frameworks (e.g., Apache Spark and Hadoop) and environments emerged and are being used to execute data-intensive workflows. DISC environments are composed of processors and disks in large-commodity computing clusters connected using high-speed communications switches and networks. The main advantage of DISC frameworks is that they support and grant efficient in-memory data management for large-scale applications, such as data-intensive workflows. However, the execution of workflows in cloud and DISC environments raise many challenges such as scheduling workflow activities and activations, managing produced data, collecting provenance data, etc. Several existing approaches deal with the challenges mentioned earlier. This way, there is a real need for understanding how to manage these workflows and various big data platforms that have been developed and introduced. As such, this book can help researchers understand how linking workflow management with Data-Intensive Scalable Computing can help in understanding and analyzing scientific big data. In this book, we aim to identify and distill the body of work on workflow management in clouds and DISC environments. We start by discussing the basic principles of data-intensive scientific workflows. Next, we present two workflows that are executed in a single site and multi-site clouds taking advantage of provenance. Afterward, we go towards workflow management in DISC environments, and we present, in detail, solutions that enable the optimized execution of the workflow using frameworks such as Apache Spark and its extensions.
Mode of access: World Wide Web.
ISBN: 9781681735580
Standard No.: 10.2200/S00915ED1V01Y201904DTM060doiSubjects--Topical Terms:
716205
Cloud computing.
Subjects--Index Terms:
scientific workflows
LC Class. No.: QA76.585 / .D426 2019eb
Dewey Class. No.: 004.67/82
Data-intensive workflow management : = for clouds and data-intensive and scalable computing environments /
LDR
:05735nam a2200589 i 4500
001
959762
003
IEEE
005
20190606190106.0
006
m eo d
007
cr bn |||m|||a
008
201209s2019 caua fob 000 0 eng d
020
$a
9781681735580
$q
electronic
020
$z
9781681735597
$q
hardcover
020
$z
9781681735573
$q
paperback
024
7
$a
10.2200/S00915ED1V01Y201904DTM060
$2
doi
035
$a
(CaBNVSL)thg00979011
035
$a
(OCoLC)1102007651
035
$a
8715841
040
$a
CaBNVSL
$b
eng
$e
rda
$c
CaBNVSL
$d
CaBNVSL
050
4
$a
QA76.585
$b
.D426 2019eb
082
0 4
$a
004.67/82
$2
23
100
1
$a
De Oliveira, Daniel C. M.,
$e
author.
$3
1253101
245
1 0
$a
Data-intensive workflow management :
$b
for clouds and data-intensive and scalable computing environments /
$c
Daniel C.M. de Oliveira, Ji Liu, Esther Pacitti.
264
1
$a
[San Rafael, California] :
$b
Morgan & Claypool,
$c
[2019]
300
$a
1 PDF (xvii, 161 pages) :
$b
illustrations.
336
$a
text
$2
rdacontent
337
$a
electronic
$2
isbdmedia
338
$a
online resource
$2
rdacarrier
490
1
$a
Synthesis lectures on data management,
$x
2153-5426 ;
$v
#60
500
$a
Part of: Synthesis digital library of engineering and computer science.
504
$a
Includes bibliographical references (pages 133-160).
505
0
$a
1. Overview -- 1.1. Motivating examples -- 1.2. The life cycle of cloud and disc workflows -- 1.3. Structure of the book
505
8
$a
2. Background knowledge -- 2.1. Key concepts -- 2.2. Distributed environments used for executing workflows -- 2.3. Conclusion
505
8
$a
3. Workflow execution in a single-site cloud -- 3.1. Bibliographic and historical notes -- 3.2. Multi-objective cost model -- 3.3. Single-site virtual machine provisioning (SSVP) -- 3.4. Sgreedy scheduling algorithm -- 3.5. Evaluating SSVP and SGreedy -- 3.6. Conclusion
505
8
$a
4. Workflow execution in a multi-site cloud -- 4.1. Overview of workflow execution in a multi-site cloud -- 4.2. Fine-grained workflow execution -- 4.3. Coarse-grained workflow execution with multiple objectives -- 4.4. Conclusion
505
8
$a
5. Workflow execution in disc environments -- 5.1. Bibliographic and historical notes -- 5.2. Fine tuning of spark parameters -- 5.3. Provenance management in Apache Spark -- 5.4. Scheduling Spark workflows in DISC environments -- 5.5. Conclusion -- 6. Conclusion.
506
$a
Abstract freely available; full-text restricted to subscribers or individual document purchasers.
510
0
$a
Compendex
510
0
$a
INSPEC
510
0
$a
Google scholar
510
0
$a
Google book search
520
3
$a
Workflows may be defined as abstractions used to model the coherent flow of activities in the context of an in silico scientific experiment. They are employed in many domains of science such as bioinformatics, astronomy, and engineering. Such workflows usually present a considerable number of activities and activations (i.e., tasks associated with activities) and may need a long time for execution. Due to the continuous need to store and process data efficiently (making them data-intensive workflows), high-performance computing environments allied to parallelization techniques are used to run these workflows. At the beginning of the 2010s, cloud technologies emerged as a promising environment to run scientific workflows. By using clouds, scientists have expanded beyond single parallel computers to hundreds or even thousands of virtual machines. More recently, Data-Intensive Scalable Computing (DISC) frameworks (e.g., Apache Spark and Hadoop) and environments emerged and are being used to execute data-intensive workflows. DISC environments are composed of processors and disks in large-commodity computing clusters connected using high-speed communications switches and networks. The main advantage of DISC frameworks is that they support and grant efficient in-memory data management for large-scale applications, such as data-intensive workflows. However, the execution of workflows in cloud and DISC environments raise many challenges such as scheduling workflow activities and activations, managing produced data, collecting provenance data, etc. Several existing approaches deal with the challenges mentioned earlier. This way, there is a real need for understanding how to manage these workflows and various big data platforms that have been developed and introduced. As such, this book can help researchers understand how linking workflow management with Data-Intensive Scalable Computing can help in understanding and analyzing scientific big data. In this book, we aim to identify and distill the body of work on workflow management in clouds and DISC environments. We start by discussing the basic principles of data-intensive scientific workflows. Next, we present two workflows that are executed in a single site and multi-site clouds taking advantage of provenance. Afterward, we go towards workflow management in DISC environments, and we present, in detail, solutions that enable the optimized execution of the workflow using frameworks such as Apache Spark and its extensions.
530
$a
Also available in print.
538
$a
Mode of access: World Wide Web.
538
$a
System requirements: Adobe Acrobat Reader.
588
$a
Title from PDF title page (viewed on May 29, 2019).
650
0
$a
Cloud computing.
$3
716205
650
0
$a
Database management.
$3
557799
653
$a
scientific workflows
653
$a
cloud computing
653
$a
Data-Intensive Scalable Computing
653
$a
data provenance
653
$a
Apache Spark
700
1
$a
Liu, Ji,
$e
author.
$3
1253102
700
1
$a
Pacitti, Esther,
$e
author.
$3
1253103
776
0 8
$i
Print version:
$z
9781681735597
$z
9781681735573
830
0
$a
Synthesis digital library of engineering and computer science.
$3
598254
830
0
$a
Synthesis lectures on data management ;
$v
#36.
$3
931356
856
4 0
$3
Abstract with links to full text
$u
https://doi.org/10.2200/S00915ED1V01Y201904DTM060
856
4 2
$3
Abstract with links to resource
$u
https://ieeexplore.ieee.org/servlet/opac?bknumber=8715841
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入