Resumo: | In data mining problems, the high dimensionality of the input features can affect the performance of the process. In this way, the features selection methods appear as a solution to the problems encountered when analyzing databases with large dimensions. This article presents the implementation of the Pearson's linear correlation, ReliefF, Welch's t-test and multilinear regression based algorithms with forwards selection and backward elimination direction for the selection of acoustic features for the task of voice pathologies identification. The best set of selected features improved the accuracy and F1-score from 83% to 92% (9 points of percentage), using the ReliefF algorithm.
|