語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
A Data Science Perspective on Searches for New Physics at the LHC.
紀錄類型:
書目-語言資料,手稿 : Monograph/item
正題名/作者:
A Data Science Perspective on Searches for New Physics at the LHC./
作者:
Espejo Morales, Irina.
面頁冊數:
1 online resource (128 pages)
附註:
Source: Dissertations Abstracts International, Volume: 85-04, Section: B.
Contained By:
Dissertations Abstracts International85-04B.
標題:
Computer science. -
電子資源:
click for full text (PQDT)
ISBN:
9798380623179
A Data Science Perspective on Searches for New Physics at the LHC.
Espejo Morales, Irina.
A Data Science Perspective on Searches for New Physics at the LHC.
- 1 online resource (128 pages)
Source: Dissertations Abstracts International, Volume: 85-04, Section: B.
Thesis (Ph.D.)--New York University, 2023.
Includes bibliographical references
The use of Machine Learning (ML) techniques in the field of Physical Sciences has gained growing attention in recent years due to its potential to accelerate scientific discovery. In particular, Computational High Energy Physics (HEP) presents distinctive challenges to Data Science (DS), from methodology design to deployment of solutions, when searching for new physics at the Large Hadron Collider (LHC). In this thesis, we address key bottlenecks in HEP by leveraging the knowledge and practices accumulated from decades of Data Science (DS) end-to-end research. We provide practical solutions to contemporary HEP issues, emphasizing re-usability, with potential for mainstream deployments at scale in the ATLAS Experiment.In the first project, we develop scalable cyberinfrastructure that integrates pre-existing ML techniques into a standard analysis pipeline for new physics searches. The outcome is a distributed workflow that is containerized, parametrized, with a shallow learning curve, and reproducible. Experiments to test the scalability limits of this approach were performed at the National Energy Research Scientific Computing Center (NERSC). The processing time for 11 million collision samples was reduced from days to 5 hours.In the second project, we address the curse of dimensionality in producing confidence limit contours for hypothesis testing of new physics theories. We use the Active Learning framework and low-fidelity data to train a Multitask Gaussian Process (GP) to intelligently evaluate the high-fidelity hypothesis testing pipeline only in regions of interest. This approach leads to the production of 4D contours, an improvement compared to traditional 1-2D contours. The study is performed in real-world settings without approximations, as reviewed by the ATLAS Experiment.Finally, we conclude with a discussion on the interplay between Data Science and High Energy Physics, focusing on leveraging domain knowledge in a general way, producing long-lasting results for the community to build upon, and systematically exploiting past results beyond benchmarking studies.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2024
Mode of access: World Wide Web
ISBN: 9798380623179Subjects--Topical Terms:
573171
Computer science.
Subjects--Index Terms:
Active learningIndex Terms--Genre/Form:
554714
Electronic books.
A Data Science Perspective on Searches for New Physics at the LHC.
LDR
:03490ntm a22004097 4500
001
1143738
005
20240517104940.5
006
m o d
007
cr mn ---uuuuu
008
250605s2023 xx obm 000 0 eng d
020
$a
9798380623179
035
$a
(MiAaPQ)AAI30314708
035
$a
AAI30314708
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
$d
NTU
100
1
$a
Espejo Morales, Irina.
$3
1468520
245
1 2
$a
A Data Science Perspective on Searches for New Physics at the LHC.
264
0
$c
2023
300
$a
1 online resource (128 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Dissertations Abstracts International, Volume: 85-04, Section: B.
500
$a
Advisor: Cranmer, Kyle S.
502
$a
Thesis (Ph.D.)--New York University, 2023.
504
$a
Includes bibliographical references
520
$a
The use of Machine Learning (ML) techniques in the field of Physical Sciences has gained growing attention in recent years due to its potential to accelerate scientific discovery. In particular, Computational High Energy Physics (HEP) presents distinctive challenges to Data Science (DS), from methodology design to deployment of solutions, when searching for new physics at the Large Hadron Collider (LHC). In this thesis, we address key bottlenecks in HEP by leveraging the knowledge and practices accumulated from decades of Data Science (DS) end-to-end research. We provide practical solutions to contemporary HEP issues, emphasizing re-usability, with potential for mainstream deployments at scale in the ATLAS Experiment.In the first project, we develop scalable cyberinfrastructure that integrates pre-existing ML techniques into a standard analysis pipeline for new physics searches. The outcome is a distributed workflow that is containerized, parametrized, with a shallow learning curve, and reproducible. Experiments to test the scalability limits of this approach were performed at the National Energy Research Scientific Computing Center (NERSC). The processing time for 11 million collision samples was reduced from days to 5 hours.In the second project, we address the curse of dimensionality in producing confidence limit contours for hypothesis testing of new physics theories. We use the Active Learning framework and low-fidelity data to train a Multitask Gaussian Process (GP) to intelligently evaluate the high-fidelity hypothesis testing pipeline only in regions of interest. This approach leads to the production of 4D contours, an improvement compared to traditional 1-2D contours. The study is performed in real-world settings without approximations, as reviewed by the ATLAS Experiment.Finally, we conclude with a discussion on the interplay between Data Science and High Energy Physics, focusing on leveraging domain knowledge in a general way, producing long-lasting results for the community to build upon, and systematically exploiting past results beyond benchmarking studies.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2024
538
$a
Mode of access: World Wide Web
650
4
$a
Computer science.
$3
573171
650
4
$a
Computational physics.
$3
1181955
650
4
$a
Information technology.
$3
559429
653
$a
Active learning
653
$a
Data science
653
$a
High Energy Physics
653
$a
Hypothesis testing
653
$a
Machine learning
655
7
$a
Electronic books.
$2
local
$3
554714
690
$a
0984
690
$a
0216
690
$a
0489
690
$a
0800
710
2
$a
ProQuest Information and Learning Co.
$3
1178819
710
2
$a
New York University.
$b
Center for Data Science.
$3
1468521
773
0
$t
Dissertations Abstracts International
$g
85-04B.
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=30314708
$z
click for full text (PQDT)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入