語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Data Masking, Encryption, and their ...
~
Asenjo, Juan C.
Data Masking, Encryption, and their Effect on Classification Performance : = Trade-offs Between Data Security and Utility.
紀錄類型:
書目-語言資料,手稿 : Monograph/item
正題名/作者:
Data Masking, Encryption, and their Effect on Classification Performance :/
其他題名:
Trade-offs Between Data Security and Utility.
作者:
Asenjo, Juan C.
面頁冊數:
1 online resource (187 pages)
附註:
Source: Dissertation Abstracts International, Volume: 78-12(E), Section: A.
標題:
Information science. -
電子資源:
click for full text (PQDT)
ISBN:
9780355104165
Data Masking, Encryption, and their Effect on Classification Performance : = Trade-offs Between Data Security and Utility.
Asenjo, Juan C.
Data Masking, Encryption, and their Effect on Classification Performance :
Trade-offs Between Data Security and Utility. - 1 online resource (187 pages)
Source: Dissertation Abstracts International, Volume: 78-12(E), Section: A.
Thesis (Ph.D.)--Nova Southeastern University, 2017.
Includes bibliographical references
As data mining increasingly shapes organizational decision-making, the quality of its results must be questioned to ensure trust in the technology. Inaccuracies can mislead decision-makers and cause costly mistakes. With more data collected for analytical purposes, privacy is also a major concern. Data security policies and regulations are increasingly put in place to manage risks, but these policies and regulations often employ technologies that substitute and/or suppress sensitive details contained in the data sets being mined. Data masking and substitution and/or data encryption and suppression of sensitive attributes from data sets can limit access to important details. It is believed that the use of data masking and encryption can impact the quality of data mining results. This dissertation investigated and compared the causal effects of data masking and encryption on classification performance as a measure of the quality of knowledge discovery. A review of the literature found a gap in the body of knowledge, indicating that this problem had not been studied before in an experimental setting. The objective of this dissertation was to gain an understanding of the trade-offs between data security and utility in the field of analytics and data mining. The research used a nationally recognized cancer incidence database, to show how masking and encryption of potentially sensitive demographic attributes such as patients' marital status, race/ethnicity, origin, and year of birth, could have a statistically significant impact on the patients' predicted survival. Performance parameters measured by four different classifiers delivered sizable variations in the range of 9% to 10% between a control group, where the select attributes were untouched, and two experimental groups where the attributes were substituted or suppressed to simulate the effects of the data protection techniques. In practice, this represented a corroboration of the potential risk involved when basing medical treatment decisions using data mining applications where attributes in the data sets are masked or encrypted for patient privacy and security concerns.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018
Mode of access: World Wide Web
ISBN: 9780355104165Subjects--Topical Terms:
561178
Information science.
Index Terms--Genre/Form:
554714
Electronic books.
Data Masking, Encryption, and their Effect on Classification Performance : = Trade-offs Between Data Security and Utility.
LDR
:03389ntm a2200337K 4500
001
912557
005
20180608112133.5
006
m o u
007
cr mn||||a|a||
008
190606s2017 xx obm 000 0 eng d
020
$a
9780355104165
035
$a
(MiAaPQ)AAI10603724
035
$a
(MiAaPQ)scisnova:10477
035
$a
AAI10603724
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
100
1
$a
Asenjo, Juan C.
$3
1184974
245
1 0
$a
Data Masking, Encryption, and their Effect on Classification Performance :
$b
Trade-offs Between Data Security and Utility.
264
0
$c
2017
300
$a
1 online resource (187 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Dissertation Abstracts International, Volume: 78-12(E), Section: A.
500
$a
Adviser: Junping Sun.
502
$a
Thesis (Ph.D.)--Nova Southeastern University, 2017.
504
$a
Includes bibliographical references
520
$a
As data mining increasingly shapes organizational decision-making, the quality of its results must be questioned to ensure trust in the technology. Inaccuracies can mislead decision-makers and cause costly mistakes. With more data collected for analytical purposes, privacy is also a major concern. Data security policies and regulations are increasingly put in place to manage risks, but these policies and regulations often employ technologies that substitute and/or suppress sensitive details contained in the data sets being mined. Data masking and substitution and/or data encryption and suppression of sensitive attributes from data sets can limit access to important details. It is believed that the use of data masking and encryption can impact the quality of data mining results. This dissertation investigated and compared the causal effects of data masking and encryption on classification performance as a measure of the quality of knowledge discovery. A review of the literature found a gap in the body of knowledge, indicating that this problem had not been studied before in an experimental setting. The objective of this dissertation was to gain an understanding of the trade-offs between data security and utility in the field of analytics and data mining. The research used a nationally recognized cancer incidence database, to show how masking and encryption of potentially sensitive demographic attributes such as patients' marital status, race/ethnicity, origin, and year of birth, could have a statistically significant impact on the patients' predicted survival. Performance parameters measured by four different classifiers delivered sizable variations in the range of 9% to 10% between a control group, where the select attributes were untouched, and two experimental groups where the attributes were substituted or suppressed to simulate the effects of the data protection techniques. In practice, this represented a corroboration of the potential risk involved when basing medical treatment decisions using data mining applications where attributes in the data sets are masked or encrypted for patient privacy and security concerns.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2018
538
$a
Mode of access: World Wide Web
650
4
$a
Information science.
$3
561178
650
4
$a
Computer science.
$3
573171
650
4
$a
Information technology.
$3
559429
655
7
$a
Electronic books.
$2
local
$3
554714
690
$a
0723
690
$a
0984
690
$a
0489
710
2
$a
ProQuest Information and Learning Co.
$3
1178819
710
2
$a
Nova Southeastern University.
$b
Information Systems.
$3
1148698
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10603724
$z
click for full text (PQDT)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入