語系:
繁體中文
English
說明(常見問題)
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Head Pose Estimation on Deep CNN Models.
~
ProQuest Information and Learning Co.
Head Pose Estimation on Deep CNN Models.
紀錄類型:
書目-語言資料,手稿 : Monograph/item
正題名/作者:
Head Pose Estimation on Deep CNN Models./
作者:
Shao, Wenxin.
面頁冊數:
1 online resource (105 pages)
附註:
Source: Masters Abstracts International, Volume: 57-01.
標題:
Electrical engineering. -
電子資源:
click for full text (PQDT)
ISBN:
9780355453942
Head Pose Estimation on Deep CNN Models.
Shao, Wenxin.
Head Pose Estimation on Deep CNN Models.
- 1 online resource (105 pages)
Source: Masters Abstracts International, Volume: 57-01.
Thesis (M.S.)--Rensselaer Polytechnic Institute, 2017.
Includes bibliographical references
Head pose estimation is an important computer vision task. Its applications can benefit people's daily life. In this thesis, our goal is to solve the head pose estimation problem using deep convolutional neural networks. The first task is to perform head pose estimation as a classification task. We propose a multimodal convolutional neural network(CNN) for head pose classification. The architecture of the model consists of three pathways whose inputs are face image, head image, and facial landmarks, which respectively capture the face appearance, facial context, and facial shape. We first perform the experiments on benchmark datasets. Then we perform head pose classification on low-quality driving videos. In order to deal with the noises in the videos, we propose the Max-Feature Map(MFM) with the help of Network-In-Network(NIN) for the CNN model, which has a better capability of handling the noises and the feature selection. We also use training techniques such as _ne-tuning and joint training to improve the performance on driving videos. The second task is to predict head pose angles using regression. We propose a deep CNN model which uses larger face image as the input and outputs all three head pose angles. The model has more layers and more parameters. Because head pose estimation needs more face images with various large head poses, face detection method is an essential part of data processing. Thus besides the face detection method we use in head pose classification, we also use a deep-learning face detection method based on region-based convolution neural network(R-CNN). We compare the performance of these two face detection methods on some datasets with continuous head poses, including benchmark datasets, driving videos, and some videos we record with a head tracker device. From head pose classification on benchmark datasets to head pose estimation on arbitrary data, we move on to more challenging tasks step by step. The experimental results on all these tasks and the comparison with other state-of-art methods show that our methods achieve a good robustness as well as a better accuracy, compared to the baseline and existing methods.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2018
Mode of access: World Wide Web
ISBN: 9780355453942Subjects--Topical Terms:
596380
Electrical engineering.
Index Terms--Genre/Form:
554714
Electronic books.
Head Pose Estimation on Deep CNN Models.
LDR
:03281ntm a2200325K 4500
001
912166
005
20180608102940.5
006
m o u
007
cr mn||||a|a||
008
190606s2017 xx obm 000 0 eng d
020
$a
9780355453942
035
$a
(MiAaPQ)AAI10603318
035
$a
(MiAaPQ)rpi:11110
035
$a
AAI10603318
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
100
1
$a
Shao, Wenxin.
$3
1184400
245
1 0
$a
Head Pose Estimation on Deep CNN Models.
264
0
$c
2017
300
$a
1 online resource (105 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Masters Abstracts International, Volume: 57-01.
500
$a
Adviser: Qiang Ji.
502
$a
Thesis (M.S.)--Rensselaer Polytechnic Institute, 2017.
504
$a
Includes bibliographical references
520
$a
Head pose estimation is an important computer vision task. Its applications can benefit people's daily life. In this thesis, our goal is to solve the head pose estimation problem using deep convolutional neural networks. The first task is to perform head pose estimation as a classification task. We propose a multimodal convolutional neural network(CNN) for head pose classification. The architecture of the model consists of three pathways whose inputs are face image, head image, and facial landmarks, which respectively capture the face appearance, facial context, and facial shape. We first perform the experiments on benchmark datasets. Then we perform head pose classification on low-quality driving videos. In order to deal with the noises in the videos, we propose the Max-Feature Map(MFM) with the help of Network-In-Network(NIN) for the CNN model, which has a better capability of handling the noises and the feature selection. We also use training techniques such as _ne-tuning and joint training to improve the performance on driving videos. The second task is to predict head pose angles using regression. We propose a deep CNN model which uses larger face image as the input and outputs all three head pose angles. The model has more layers and more parameters. Because head pose estimation needs more face images with various large head poses, face detection method is an essential part of data processing. Thus besides the face detection method we use in head pose classification, we also use a deep-learning face detection method based on region-based convolution neural network(R-CNN). We compare the performance of these two face detection methods on some datasets with continuous head poses, including benchmark datasets, driving videos, and some videos we record with a head tracker device. From head pose classification on benchmark datasets to head pose estimation on arbitrary data, we move on to more challenging tasks step by step. The experimental results on all these tasks and the comparison with other state-of-art methods show that our methods achieve a good robustness as well as a better accuracy, compared to the baseline and existing methods.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2018
538
$a
Mode of access: World Wide Web
650
4
$a
Electrical engineering.
$3
596380
650
4
$a
Artificial intelligence.
$3
559380
655
7
$a
Electronic books.
$2
local
$3
554714
690
$a
0544
690
$a
0800
710
2
$a
ProQuest Information and Learning Co.
$3
1178819
710
2
$a
Rensselaer Polytechnic Institute.
$b
Electrical Engineering.
$3
1184401
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10603318
$z
click for full text (PQDT)
筆 0 讀者評論
多媒體
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼[密碼必須為2種組合(英文和數字)及長度為10碼以上]
登入