Please use this identifier to cite or link to this item: http://thuvienso.dut.udn.vn/handle/DUT/5553
DC FieldValueLanguage
dc.contributor.advisorTS. Nguyễn, Quang Như Quỳnhen_US
dc.contributor.advisorProf. Wen-Nung Lieen_US
dc.contributor.authorNguyễn, Hữu Thắngen_US
dc.date.accessioned2025-02-18T03:25:26Z-
dc.date.available2025-02-18T03:25:26Z-
dc.date.issued2019-
dc.identifier.urihttp://thuvienso.dut.udn.vn/handle/DUT/5553-
dc.description37 Tr.en_US
dc.description.abstractThis thesis is “3D Human Skeleton Extraction for Action Analysis”. We want to generate an accurate 3D pose estimation from a single view RGB image to understand more about people in images and videos. The 3D human pose estimation algorithm contained two steps. First, we put all of the 3D Human samples into K-means algorithm, and take the mean pose in each cluster as anchor pose ground truth in training part. Second, we trained first model called “3D anchor pose estimator”, which use 2D human pose as input ground truth, and the result of clustering as output ground truth. Anchor poses are some common human posture in our daily life. Final we train second model called “3D human pose estimator” and combine 2D human pose and 3D anchor pose to estimate final 3D human pose. In this work, we show a systematic design for how Fully Convolutional Network (FCNs) can be incorporated for the task of pose estimation. Our deep learning network contains two-stage: The first network (FCN1) estimate a 3D Anchor Pose from the 2D skeleton, then pass the results to the second stage (FCN2) to further regress/refine the 3D Anchor Pose to yield the final 3D human pose estimation. According to the experiments, our two-stage FCN network can generate a 3D human pose with an average MPJPE (Mean per Joint Position Error) of 62.99 mm when a 2D skeleton prediction is used as the input. The 2D skeleton predictions are produced by a pre-trained model called Stacked Hourglass. Stateof-the-art results are achieved on the H36M benchmarken_US
dc.language.isoenen_US
dc.publisherTrường Đại học Bách khoa - Đại học Đà Nẵngen_US
dc.subject3D Human Pose Estimationen_US
dc.subjectFully Convolutional Networken_US
dc.subjectDeep Learningen_US
dc.title3D human skeleton extraction for action analysisen_US
dc.typeĐồ ánen_US
dc.identifier.id2.DA.FA.19.010-
item.cerifentitytypePublications-
item.languageiso639-1en-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.fulltextCó toàn văn-
item.openairetypeĐồ án-
item.grantfulltextrestricted-
Appears in Collections:DA.Điện tử - Viễn thông
Files in This Item:
File Description SizeFormat Existing users please Login
2.DA.FA.19.010.Nguyen Huu Thang.pdfThuyết minh8.39 MBAdobe PDFThumbnail
Show simple item record

CORE Recommender

Page view(s) 50

9
checked on May 9, 2025

Download(s) 50

5
checked on May 9, 2025

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.