A data mining approach for predicting academic success – a case study

The present study puts forward a regression analytic model based on the random forest algorithm, developed to predict, at an early stage, the global academic performance of the undergraduates of a polytechnic higher education institution. The study targets the universe of an institution composed of...

Full description

Bibliographic Details
Main Author: Martins, Maria Prudência (author)
Other Authors: Miguéis, Vera (author), Fonseca, Davide (author), Alves, Albano (author)
Format: conferenceObject
Language:eng
Published: 2020
Subjects:
Online Access:http://hdl.handle.net/10198/22709
Country:Portugal
Oai:oai:bibliotecadigital.ipb.pt:10198/22709
Description
Summary:The present study puts forward a regression analytic model based on the random forest algorithm, developed to predict, at an early stage, the global academic performance of the undergraduates of a polytechnic higher education institution. The study targets the universe of an institution composed of 5 schools rather than following the usual procedure of delimiting the prediction to one single specific degree course. Hence, we intend to provide the institution with one single tool capable of including the heterogeneity of the universe of students as well as educational dynamics. A different approach to feature selection is proposed, which enables to completely exclude categories of predictive variables, making the model useful for scenarios in which not all categories of data considered are collected. The introduced model can be used at a central level by the decision-makers who are entitled to design actions to mitigate academic failure.