國立虎尾科技大學 |

Deep Learning for Image Understanding.

紀錄類型:	書目-語言資料,手稿 : Monograph/item
正題名/作者:	Deep Learning for Image Understanding./
作者:	Wang, Yufei.
面頁冊數:	1 online resource (151 pages)
附註:	Source: Dissertation Abstracts International, Volume: 79-04(E), Section: B.
標題:	Electrical engineering. -
電子資源:	click for full text (PQDT)
ISBN:	9780355545173

Deep Learning for Image Understanding.
Wang, Yufei.

Deep Learning for Image Understanding. - 1 online resource (151 pages)

Source: Dissertation Abstracts International, Volume: 79-04(E), Section: B.

Thesis (Ph.D.)--University of California, San Diego, 2017.

Includes bibliographical references

Computer vision and image understanding is the problem of interpreting images by locating, recognizing objects, attributes and other higher level features in an image. In this thesis, I seek to tackle this broad problem using deep learning techniques. More specifically, I build deep neural network based models to solve two specific problems to understand images in a high level: album wise image understanding with event-specific image importance score, and description generation for an image.

Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018

Mode of access: World Wide Web

ISBN: 9780355545173Subjects--Topical Terms:

596380
Electrical engineering.
Index Terms--Genre/Form:

554714
Electronic books.

Deep Learning for Image Understanding.
LDR:03209ntm a2200361K 4500 001 912187
005 20180608102941.5
006 m o u
007 cr mn||||a|a||
008 190606s2017 xx obm 000 0 eng d
020 $a 9780355545173
035 $a (MiAaPQ)AAI10682977
035 $a (MiAaPQ)ucsd:17062
035 $a AAI10682977
040 $a MiAaPQ $b eng $c MiAaPQ
100 1 $a Wang, Yufei. $3 1184433
245 1 0 $a Deep Learning for Image Understanding.
264 0 $c 2017
300 $a 1 online resource (151 pages)
336 $a text $b txt $2 rdacontent
337 $a computer $b c $2 rdamedia
338 $a online resource $b cr $2 rdacarrier
500 $a Source: Dissertation Abstracts International, Volume: 79-04(E), Section: B.
500 $a Advisers: Garrison W. Cottrell; Nuno Vasconcelos.
502 $a Thesis (Ph.D.)--University of California, San Diego, 2017.
504 $a Includes bibliographical references
520 $a Computer vision and image understanding is the problem of interpreting images by locating, recognizing objects, attributes and other higher level features in an image. In this thesis, I seek to tackle this broad problem using deep learning techniques. More specifically, I build deep neural network based models to solve two specific problems to understand images in a high level: album wise image understanding with event-specific image importance score, and description generation for an image.
520 $a I first focus on the understanding of a collection of images in an event album. In an event album, some images are more important or interesting to save or present than others, and I show that with an event-specific image importance property, we can learn the interestingness of an image given an album, and the performance of the model generated importance score is very close to human preference. I build a siamese network that can predict image importance score given the event type of that image, using novel objective function and learning scheme. Next, to make the process fully automated, I propose an iterative updating procedure for event type and image importance score prediction, that can simultaneously decide the event type of the album and the importance score of every image. It consists of a Convolutional Neural Network that recognizes the event type, a Long-Short Term Memory (LSTM) that uses sequential information for event type recognition, and a siamese network that predicts image importance score.
520 $a Furthermore, not just limited to describing an image with a score or by a classified type, I seek the possibility to describe it with a phrase or sentence. I propose a coarse-to-fine LSTM based method that decomposes the original image description into a skeleton sentence and its notable attributes, and demonstrate that in this way the language model can generate better descriptions, with the capability to generate image descriptions that better accommodates user preference.
533 $a Electronic reproduction. $b Ann Arbor, Mich. : $c ProQuest, $d 2018
538 $a Mode of access: World Wide Web
650 4 $a Electrical engineering. $3 596380
650 4 $a Computer science. $3 573171
650 4 $a Artificial intelligence. $3 559380
655 7 $a Electronic books. $2 local $3 554714
690 $a 0544
690 $a 0984
690 $a 0800
710 2 $a ProQuest Information and Learning Co. $3 1178819
710 2 $a University of California, San Diego. $b Electrical Engineering. $3 1184434
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10682977 $z click for full text (PQDT)