Top-down human pose estimation with depth images and domain adaptation

In this paper, a method for estimation of human pose is proposed, making use of ToF (Time of Flight) cameras. For this, a YOLO based object detection method was used, to develop a top-down method. In the first stage, a network was developed to detect people in the image. In the second stage, a netwo...

Full description

Bibliographic Details
Main Author: Rodrigues, Nelson (author)
Other Authors: Torres, Helena Daniela Ribeiro (author), Oliveira, Bruno (author), Borges, João (author), Queirós, Sandro Filipe Monteiro (author), Mendes, José A. (author), Fonseca, Jaime C. (author), Coelho, Victor (author), Brito, José Henrique (author)
Format: conferencePaper
Language:eng
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/1822/71176
Country:Portugal
Oai:oai:repositorium.sdum.uminho.pt:1822/71176
Description
Summary:In this paper, a method for estimation of human pose is proposed, making use of ToF (Time of Flight) cameras. For this, a YOLO based object detection method was used, to develop a top-down method. In the first stage, a network was developed to detect people in the image. In the second stage, a network was developed to estimate the joints of each person, using the image result from the first stage. We show that a deep learning network trained from scratch with ToF images yields better results than taking a deep neural network pretrained on RGB data and retraining it with ToF data. We also show that a top-down detector, with a person detector and a joint detector works better than detecting the body joints over the entire image.