Predicting human activities in sequences of actions in RGB-D videos

In our daily activities we perform prediction or anticipation when interacting with other humans or with objects. Prediction of human activity made by computers has several potential applications: surveillance systems, human computer interfaces, sports video analysis, human-robot-collaboration, game...

Full description

Bibliographic Details
Main Author: Jardim, D. (author)
Other Authors: Nunes, L. (author), Dias, M. (author)
Format: conferenceObject
Language:eng
Published: 2021
Subjects:
Online Access:http://hdl.handle.net/10071/22877
Country:Portugal
Oai:oai:repositorio.iscte-iul.pt:10071/22877
Description
Summary:In our daily activities we perform prediction or anticipation when interacting with other humans or with objects. Prediction of human activity made by computers has several potential applications: surveillance systems, human computer interfaces, sports video analysis, human-robot-collaboration, games and health-care. We propose a system capable of recognizing and predicting human actions using supervised classifiers trained with automatically labeled data evaluated in our human activity RGB-D dataset (recorded with a Kinect sensor) and using only the position of the main skeleton joints to extract features. Using conditional random fields (CRFs) to model the sequential nature of actions in a sequence has been used before, but where other approaches try to predict an outcome or anticipate ahead in time (seconds), we try to predict what will be the next action of a subject. Our results show an activity prediction accuracy of 89.9% using an automatically labeled dataset.