語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Machine Learning Methods to Identify...
~
ProQuest Information and Learning Co.
Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record.
紀錄類型:
書目-語言資料,手稿 : Monograph/item
正題名/作者:
Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record./
作者:
Beaulieu-Jones, Brett Kreigh.
面頁冊數:
1 online resource (176 pages)
附註:
Source: Dissertation Abstracts International, Volume: 79-07(E), Section: B.
標題:
Bioinformatics. -
電子資源:
click for full text (PQDT)
ISBN:
9780355618037
Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record.
Beaulieu-Jones, Brett Kreigh.
Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record.
- 1 online resource (176 pages)
Source: Dissertation Abstracts International, Volume: 79-07(E), Section: B.
Thesis (Ph.D.)--University of Pennsylvania, 2017.
Includes bibliographical references
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient treatment and outcome data is available to researchers. Research is a tertiary priority in the EHR, where the priorities are patient care and billing. Because of this, the data is not standardized or formatted in a manner easily adapted to machine learning approaches. Data may be missing for a large variety of reasons ranging from individual input styles to differences in clinical decision making, for example, which lab tests to issue. Few patients are annotated at a research quality, limiting sample size and presenting a moving gold standard. Patient progression over time is key to understanding many diseases but many machine learning algorithms require a snapshot, at a single time point, to create a usable vector form. In this dissertation, we develop new machine learning methods and computational workflows to extract hidden phenotypes from the Electronic Health Record (EHR). In Part 1, we use a semi-supervised deep learning approach to compensate for the low number of research quality labels present in the EHR. In Part 2, we examine and provide recommendations for characterizing and managing the large amount of missing data inherent to EHR data. In Part 3, we present an adversarial approach to generate synthetic data that closely resembles the original data while protecting subject privacy. We also introduce a workflow to enable reproducible research even when data cannot be shared. In Part 4, we introduce a novel strategy to first extract sequential data from the EHR and then demonstrate the ability to model these sequences with deep learning.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018
Mode of access: World Wide Web
ISBN: 9780355618037Subjects--Topical Terms:
583857
Bioinformatics.
Index Terms--Genre/Form:
554714
Electronic books.
Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record.
LDR
:02890ntm a2200337K 4500
001
912174
005
20180608102941.5
006
m o u
007
cr mn||||a|a||
008
190606s2017 xx obm 000 0 eng d
020
$a
9780355618037
035
$a
(MiAaPQ)AAI10624530
035
$a
(MiAaPQ)upenngdas:13002
035
$a
AAI10624530
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
100
1
$a
Beaulieu-Jones, Brett Kreigh.
$3
1184412
245
1 0
$a
Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record.
264
0
$c
2017
300
$a
1 online resource (176 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Dissertation Abstracts International, Volume: 79-07(E), Section: B.
500
$a
Advisers: Jason H. Moore; Casey S. Greene.
502
$a
Thesis (Ph.D.)--University of Pennsylvania, 2017.
504
$a
Includes bibliographical references
520
$a
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient treatment and outcome data is available to researchers. Research is a tertiary priority in the EHR, where the priorities are patient care and billing. Because of this, the data is not standardized or formatted in a manner easily adapted to machine learning approaches. Data may be missing for a large variety of reasons ranging from individual input styles to differences in clinical decision making, for example, which lab tests to issue. Few patients are annotated at a research quality, limiting sample size and presenting a moving gold standard. Patient progression over time is key to understanding many diseases but many machine learning algorithms require a snapshot, at a single time point, to create a usable vector form. In this dissertation, we develop new machine learning methods and computational workflows to extract hidden phenotypes from the Electronic Health Record (EHR). In Part 1, we use a semi-supervised deep learning approach to compensate for the low number of research quality labels present in the EHR. In Part 2, we examine and provide recommendations for characterizing and managing the large amount of missing data inherent to EHR data. In Part 3, we present an adversarial approach to generate synthetic data that closely resembles the original data while protecting subject privacy. We also introduce a workflow to enable reproducible research even when data cannot be shared. In Part 4, we introduce a novel strategy to first extract sequential data from the EHR and then demonstrate the ability to model these sequences with deep learning.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2018
538
$a
Mode of access: World Wide Web
650
4
$a
Bioinformatics.
$3
583857
650
4
$a
Genetics.
$3
578972
650
4
$a
Artificial intelligence.
$3
559380
655
7
$a
Electronic books.
$2
local
$3
554714
690
$a
0715
690
$a
0369
690
$a
0800
710
2
$a
ProQuest Information and Learning Co.
$3
1178819
710
2
$a
University of Pennsylvania.
$b
Genomics and Computational Biology.
$3
1184413
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10624530
$z
click for full text (PQDT)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入