Language:
English
繁體中文
Help
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Deep Learning for Image Understanding.
~
Wang, Yufei.
Deep Learning for Image Understanding.
Record Type:
Language materials, manuscript : Monograph/item
Title/Author:
Deep Learning for Image Understanding./
Author:
Wang, Yufei.
Description:
1 online resource (151 pages)
Notes:
Source: Dissertation Abstracts International, Volume: 79-04(E), Section: B.
Subject:
Electrical engineering. -
Online resource:
click for full text (PQDT)
ISBN:
9780355545173
Deep Learning for Image Understanding.
Wang, Yufei.
Deep Learning for Image Understanding.
- 1 online resource (151 pages)
Source: Dissertation Abstracts International, Volume: 79-04(E), Section: B.
Thesis (Ph.D.)--University of California, San Diego, 2017.
Includes bibliographical references
Computer vision and image understanding is the problem of interpreting images by locating, recognizing objects, attributes and other higher level features in an image. In this thesis, I seek to tackle this broad problem using deep learning techniques. More specifically, I build deep neural network based models to solve two specific problems to understand images in a high level: album wise image understanding with event-specific image importance score, and description generation for an image.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018
Mode of access: World Wide Web
ISBN: 9780355545173Subjects--Topical Terms:
596380
Electrical engineering.
Index Terms--Genre/Form:
554714
Electronic books.
Deep Learning for Image Understanding.
LDR
:03209ntm a2200361K 4500
001
912187
005
20180608102941.5
006
m o u
007
cr mn||||a|a||
008
190606s2017 xx obm 000 0 eng d
020
$a
9780355545173
035
$a
(MiAaPQ)AAI10682977
035
$a
(MiAaPQ)ucsd:17062
035
$a
AAI10682977
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
100
1
$a
Wang, Yufei.
$3
1184433
245
1 0
$a
Deep Learning for Image Understanding.
264
0
$c
2017
300
$a
1 online resource (151 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Dissertation Abstracts International, Volume: 79-04(E), Section: B.
500
$a
Advisers: Garrison W. Cottrell; Nuno Vasconcelos.
502
$a
Thesis (Ph.D.)--University of California, San Diego, 2017.
504
$a
Includes bibliographical references
520
$a
Computer vision and image understanding is the problem of interpreting images by locating, recognizing objects, attributes and other higher level features in an image. In this thesis, I seek to tackle this broad problem using deep learning techniques. More specifically, I build deep neural network based models to solve two specific problems to understand images in a high level: album wise image understanding with event-specific image importance score, and description generation for an image.
520
$a
I first focus on the understanding of a collection of images in an event album. In an event album, some images are more important or interesting to save or present than others, and I show that with an event-specific image importance property, we can learn the interestingness of an image given an album, and the performance of the model generated importance score is very close to human preference. I build a siamese network that can predict image importance score given the event type of that image, using novel objective function and learning scheme. Next, to make the process fully automated, I propose an iterative updating procedure for event type and image importance score prediction, that can simultaneously decide the event type of the album and the importance score of every image. It consists of a Convolutional Neural Network that recognizes the event type, a Long-Short Term Memory (LSTM) that uses sequential information for event type recognition, and a siamese network that predicts image importance score.
520
$a
Furthermore, not just limited to describing an image with a score or by a classified type, I seek the possibility to describe it with a phrase or sentence. I propose a coarse-to-fine LSTM based method that decomposes the original image description into a skeleton sentence and its notable attributes, and demonstrate that in this way the language model can generate better descriptions, with the capability to generate image descriptions that better accommodates user preference.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2018
538
$a
Mode of access: World Wide Web
650
4
$a
Electrical engineering.
$3
596380
650
4
$a
Computer science.
$3
573171
650
4
$a
Artificial intelligence.
$3
559380
655
7
$a
Electronic books.
$2
local
$3
554714
690
$a
0544
690
$a
0984
690
$a
0800
710
2
$a
ProQuest Information and Learning Co.
$3
1178819
710
2
$a
University of California, San Diego.
$b
Electrical Engineering.
$3
1184434
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10682977
$z
click for full text (PQDT)
based on 0 review(s)
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login