Top-down human pose estimation with depth images and domain adaptation

In this paper, a method for estimation of human pose is proposed, making use of ToF (Time of Flight) cameras. For this, a YOLO based object detection method was used, to develop a top-down method. In the first stage, a network was developed to detect people in the image. In the second stage, a netwo...

ver descrição completa

Detalhes bibliográficos
Autor principal: Rodrigues, Nelson (author)
Outros Autores: Torres, Helena Daniela Ribeiro (author), Oliveira, Bruno (author), Borges, João (author), Queirós, Sandro Filipe Monteiro (author), Mendes, José A. (author), Fonseca, Jaime C. (author), Coelho, Victor (author), Brito, José Henrique (author)
Formato: conferencePaper
Idioma:eng
Publicado em: 2019
Assuntos:
Texto completo:http://hdl.handle.net/1822/71176
País:Portugal
Oai:oai:repositorium.sdum.uminho.pt:1822/71176
Descrição
Resumo:In this paper, a method for estimation of human pose is proposed, making use of ToF (Time of Flight) cameras. For this, a YOLO based object detection method was used, to develop a top-down method. In the first stage, a network was developed to detect people in the image. In the second stage, a network was developed to estimate the joints of each person, using the image result from the first stage. We show that a deep learning network trained from scratch with ToF images yields better results than taking a deep neural network pretrained on RGB data and retraining it with ToF data. We also show that a top-down detector, with a person detector and a joint detector works better than detecting the body joints over the entire image.