Language:
English
繁體中文
Help
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Knowledge Discovery from Multi-Sourced Data
Record Type:
Language materials, printed : Monograph/item
Title/Author:
Knowledge Discovery from Multi-Sourced Data/ by Chen Ye, Hongzhi Wang, Guojun Dai.
Author:
Ye, Chen.
other author:
Wang, Hongzhi.
Description:
XII, 83 p. 14 illus., 9 illus. in color.online resource. :
Contained By:
Springer Nature eBook
Subject:
Data mining. -
Online resource:
https://doi.org/10.1007/978-981-19-1879-7
ISBN:
9789811918797
Knowledge Discovery from Multi-Sourced Data
Ye, Chen.
Knowledge Discovery from Multi-Sourced Data
[electronic resource] /by Chen Ye, Hongzhi Wang, Guojun Dai. - 1st ed. 2022. - XII, 83 p. 14 illus., 9 illus. in color.online resource. - SpringerBriefs in Computer Science,2191-5776. - SpringerBriefs in Computer Science,.
1. Introduction -- 2. Functional-dependency-based truth discovery for isomorphic data -- 3. Denial-constraint-based truth discovery for isomorphic data -- 4. Pattern discovery for heterogeneous data -- 5. Deep fact discovery for text data.
This book addresses several knowledge discovery problems on multi-sourced data where the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used. This book mainly focuses on three data models: the multi-sourced isomorphic data, the multi-sourced heterogeneous data, and the text data. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery and fact discovery on multi-sourced data from four important properties: relevance, inconsistency, sparseness, and heterogeneity, which is useful for specialists as well as graduate students. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to “label” or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery. At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.
ISBN: 9789811918797
Standard No.: 10.1007/978-981-19-1879-7doiSubjects--Topical Terms:
528622
Data mining.
LC Class. No.: QA76.9.D343
Dewey Class. No.: 006.312
Knowledge Discovery from Multi-Sourced Data
LDR
:03438nam a22004215i 4500
001
1087346
003
DE-He213
005
20220613221017.0
007
cr nn 008mamaa
008
221228s2022 si | s |||| 0|eng d
020
$a
9789811918797
$9
978-981-19-1879-7
024
7
$a
10.1007/978-981-19-1879-7
$2
doi
035
$a
978-981-19-1879-7
050
4
$a
QA76.9.D343
072
7
$a
UNF
$2
bicssc
072
7
$a
UYQE
$2
bicssc
072
7
$a
COM021030
$2
bisacsh
072
7
$a
UNF
$2
thema
072
7
$a
UYQE
$2
thema
082
0 4
$a
006.312
$2
23
100
1
$a
Ye, Chen.
$e
author.
$4
aut
$4
http://id.loc.gov/vocabulary/relators/aut
$3
1394374
245
1 0
$a
Knowledge Discovery from Multi-Sourced Data
$h
[electronic resource] /
$c
by Chen Ye, Hongzhi Wang, Guojun Dai.
250
$a
1st ed. 2022.
264
1
$a
Singapore :
$b
Springer Nature Singapore :
$b
Imprint: Springer,
$c
2022.
300
$a
XII, 83 p. 14 illus., 9 illus. in color.
$b
online resource.
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
347
$a
text file
$b
PDF
$2
rda
490
1
$a
SpringerBriefs in Computer Science,
$x
2191-5776
505
0
$a
1. Introduction -- 2. Functional-dependency-based truth discovery for isomorphic data -- 3. Denial-constraint-based truth discovery for isomorphic data -- 4. Pattern discovery for heterogeneous data -- 5. Deep fact discovery for text data.
520
$a
This book addresses several knowledge discovery problems on multi-sourced data where the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used. This book mainly focuses on three data models: the multi-sourced isomorphic data, the multi-sourced heterogeneous data, and the text data. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery and fact discovery on multi-sourced data from four important properties: relevance, inconsistency, sparseness, and heterogeneity, which is useful for specialists as well as graduate students. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to “label” or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery. At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.
650
0
$a
Data mining.
$3
528622
650
0
$a
Database management.
$3
557799
650
0
$a
Artificial intelligence—Data processing.
$3
1366684
650
1 4
$a
Data Mining and Knowledge Discovery.
$3
677765
650
2 4
$a
Database Management.
$3
669820
650
2 4
$a
Data Science.
$3
1174436
700
1
$a
Wang, Hongzhi.
$4
aut
$4
http://id.loc.gov/vocabulary/relators/aut
$3
1065009
700
1
$a
Dai, Guojun.
$e
author.
$4
aut
$4
http://id.loc.gov/vocabulary/relators/aut
$3
1394375
710
2
$a
SpringerLink (Online service)
$3
593884
773
0
$t
Springer Nature eBook
776
0 8
$i
Printed edition:
$z
9789811918780
776
0 8
$i
Printed edition:
$z
9789811918803
830
0
$a
SpringerBriefs in Computer Science,
$x
2191-5768
$3
1255334
856
4 0
$u
https://doi.org/10.1007/978-981-19-1879-7
912
$a
ZDB-2-SCS
912
$a
ZDB-2-SXCS
950
$a
Computer Science (SpringerNature-11645)
950
$a
Computer Science (R0) (SpringerNature-43710)
based on 0 review(s)
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login