語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
A Bootstrapped Approach to Multiling...
~
State University of New York at Buffalo.
A Bootstrapped Approach to Multilingual Text Stream Parsing.
紀錄類型:
書目-語言資料,手稿 : Monograph/item
正題名/作者:
A Bootstrapped Approach to Multilingual Text Stream Parsing./
作者:
Londhe, Nikhil.
面頁冊數:
1 online resource (146 pages)
附註:
Source: Dissertation Abstracts International, Volume: 79-03(E), Section: B.
標題:
Computer science. -
電子資源:
click for full text (PQDT)
ISBN:
9780355310160
A Bootstrapped Approach to Multilingual Text Stream Parsing.
Londhe, Nikhil.
A Bootstrapped Approach to Multilingual Text Stream Parsing.
- 1 online resource (146 pages)
Source: Dissertation Abstracts International, Volume: 79-03(E), Section: B.
Thesis (Ph.D.)--State University of New York at Buffalo, 2017.
Includes bibliographical references
The ubiquitous hashtag has disruptively transformed how news stories are reported and shared across social media networks. Often, such text streams are massively multilingual with 50 different languages on an average and contain a combination of subjective user opinion, objective evolving information about the story and unrelated spam. This is in addition to the usual challenges of processing social media content like lack of grammar, stylized spellings and usage of slang, emojis and emoticons. Further, language dense regions frequently exhibit code switching and code mixing, where users switch between languages in a single post with or without retaining a single writing system. So far, most research on parsing such streams has largely resorted to piecemeal and language specific approaches. As part of this work, we propose a processing pipeline with two salient features. First, we show how the topical and temporal relationships between the posts can be utilized for language agnostic discourse interpretation. Second, we also show how bootstrapping for incremental parsing can lead to an improved system performance and propose an end to end pipeline to that effect. We explore how the said pipeline can be utilized for two sample use cases - question answering and summarization.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018
Mode of access: World Wide Web
ISBN: 9780355310160Subjects--Topical Terms:
573171
Computer science.
Index Terms--Genre/Form:
554714
Electronic books.
A Bootstrapped Approach to Multilingual Text Stream Parsing.
LDR
:02479ntm a2200337K 4500
001
913847
005
20180628103545.5
006
m o u
007
cr mn||||a|a||
008
190606s2017 xx obm 000 0 eng d
020
$a
9780355310160
035
$a
(MiAaPQ)AAI10620498
035
$a
(MiAaPQ)buffalo:15389
035
$a
AAI10620498
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
100
1
$a
Londhe, Nikhil.
$3
1186853
245
1 2
$a
A Bootstrapped Approach to Multilingual Text Stream Parsing.
264
0
$c
2017
300
$a
1 online resource (146 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Dissertation Abstracts International, Volume: 79-03(E), Section: B.
500
$a
Adviser: Rohini K. Srihari.
502
$a
Thesis (Ph.D.)--State University of New York at Buffalo, 2017.
504
$a
Includes bibliographical references
520
$a
The ubiquitous hashtag has disruptively transformed how news stories are reported and shared across social media networks. Often, such text streams are massively multilingual with 50 different languages on an average and contain a combination of subjective user opinion, objective evolving information about the story and unrelated spam. This is in addition to the usual challenges of processing social media content like lack of grammar, stylized spellings and usage of slang, emojis and emoticons. Further, language dense regions frequently exhibit code switching and code mixing, where users switch between languages in a single post with or without retaining a single writing system. So far, most research on parsing such streams has largely resorted to piecemeal and language specific approaches. As part of this work, we propose a processing pipeline with two salient features. First, we show how the topical and temporal relationships between the posts can be utilized for language agnostic discourse interpretation. Second, we also show how bootstrapping for incremental parsing can lead to an improved system performance and propose an end to end pipeline to that effect. We explore how the said pipeline can be utilized for two sample use cases - question answering and summarization.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2018
538
$a
Mode of access: World Wide Web
650
4
$a
Computer science.
$3
573171
650
4
$a
Artificial intelligence.
$3
559380
650
4
$a
Linguistics.
$3
557829
655
7
$a
Electronic books.
$2
local
$3
554714
690
$a
0984
690
$a
0800
690
$a
0290
710
2
$a
ProQuest Information and Learning Co.
$3
1178819
710
2
$a
State University of New York at Buffalo.
$b
Computer Science and Engineering.
$3
1180201
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10620498
$z
click for full text (PQDT)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入