國立虎尾科技大學 |

Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record.

紀錄類型:	書目-語言資料,手稿 : Monograph/item
正題名/作者:	Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record./
作者:	Beaulieu-Jones, Brett Kreigh.
面頁冊數:	1 online resource (176 pages)
附註:	Source: Dissertation Abstracts International, Volume: 79-07(E), Section: B.
標題:	Bioinformatics. -
電子資源:	click for full text (PQDT)
ISBN:	9780355618037

Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record.
Beaulieu-Jones, Brett Kreigh.

Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record. - 1 online resource (176 pages)

Source: Dissertation Abstracts International, Volume: 79-07(E), Section: B.

Thesis (Ph.D.)--University of Pennsylvania, 2017.

Includes bibliographical references

The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient treatment and outcome data is available to researchers. Research is a tertiary priority in the EHR, where the priorities are patient care and billing. Because of this, the data is not standardized or formatted in a manner easily adapted to machine learning approaches. Data may be missing for a large variety of reasons ranging from individual input styles to differences in clinical decision making, for example, which lab tests to issue. Few patients are annotated at a research quality, limiting sample size and presenting a moving gold standard. Patient progression over time is key to understanding many diseases but many machine learning algorithms require a snapshot, at a single time point, to create a usable vector form. In this dissertation, we develop new machine learning methods and computational workflows to extract hidden phenotypes from the Electronic Health Record (EHR). In Part 1, we use a semi-supervised deep learning approach to compensate for the low number of research quality labels present in the EHR. In Part 2, we examine and provide recommendations for characterizing and managing the large amount of missing data inherent to EHR data. In Part 3, we present an adversarial approach to generate synthetic data that closely resembles the original data while protecting subject privacy. We also introduce a workflow to enable reproducible research even when data cannot be shared. In Part 4, we introduce a novel strategy to first extract sequential data from the EHR and then demonstrate the ability to model these sequences with deep learning.

Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018

Mode of access: World Wide Web

ISBN: 9780355618037Subjects--Topical Terms:

583857
Bioinformatics.
Index Terms--Genre/Form:

554714
Electronic books.

Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record.
LDR:02890ntm a2200337K 4500 001 912174
005 20180608102941.5
006 m o u
007 cr mn||||a|a||
008 190606s2017 xx obm 000 0 eng d
020 $a 9780355618037
035 $a (MiAaPQ)AAI10624530
035 $a (MiAaPQ)upenngdas:13002
035 $a AAI10624530
040 $a MiAaPQ $b eng $c MiAaPQ
100 1 $a Beaulieu-Jones, Brett Kreigh. $3 1184412
245 1 0 $a Machine Learning Methods to Identify Hidden Phenotypes in the Electronic Health Record.
264 0 $c 2017
300 $a 1 online resource (176 pages)
336 $a text $b txt $2 rdacontent
337 $a computer $b c $2 rdamedia
338 $a online resource $b cr $2 rdacarrier
500 $a Source: Dissertation Abstracts International, Volume: 79-07(E), Section: B.
500 $a Advisers: Jason H. Moore; Casey S. Greene.
502 $a Thesis (Ph.D.)--University of Pennsylvania, 2017.
504 $a Includes bibliographical references
520 $a The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient treatment and outcome data is available to researchers. Research is a tertiary priority in the EHR, where the priorities are patient care and billing. Because of this, the data is not standardized or formatted in a manner easily adapted to machine learning approaches. Data may be missing for a large variety of reasons ranging from individual input styles to differences in clinical decision making, for example, which lab tests to issue. Few patients are annotated at a research quality, limiting sample size and presenting a moving gold standard. Patient progression over time is key to understanding many diseases but many machine learning algorithms require a snapshot, at a single time point, to create a usable vector form. In this dissertation, we develop new machine learning methods and computational workflows to extract hidden phenotypes from the Electronic Health Record (EHR). In Part 1, we use a semi-supervised deep learning approach to compensate for the low number of research quality labels present in the EHR. In Part 2, we examine and provide recommendations for characterizing and managing the large amount of missing data inherent to EHR data. In Part 3, we present an adversarial approach to generate synthetic data that closely resembles the original data while protecting subject privacy. We also introduce a workflow to enable reproducible research even when data cannot be shared. In Part 4, we introduce a novel strategy to first extract sequential data from the EHR and then demonstrate the ability to model these sequences with deep learning.
533 $a Electronic reproduction. $b Ann Arbor, Mich. : $c ProQuest, $d 2018
538 $a Mode of access: World Wide Web
650 4 $a Bioinformatics. $3 583857
650 4 $a Genetics. $3 578972
650 4 $a Artificial intelligence. $3 559380
655 7 $a Electronic books. $2 local $3 554714
690 $a 0715
690 $a 0369
690 $a 0800
710 2 $a ProQuest Information and Learning Co. $3 1178819
710 2 $a University of Pennsylvania. $b Genomics and Computational Biology. $3 1184413
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10624530 $z click for full text (PQDT)