An automated machine learning approach for predicting chemical laboratory material consumption

This paper address a relevant business analytics need of a chemical company, which is adopting an Industry 4.0 transformation. In this company, quality tests are executed at the Analytical Laboratories (AL), which receive production samples and execute several instrumen- tal analyses. In order to im...

Full description

Bibliographic Details
Main Author: Silva, António João (author)
Other Authors: Cortez, Paulo (author)
Format: conferencePaper
Language:eng
Published: 2021
Subjects:
Online Access:http://hdl.handle.net/1822/73561
Country:Portugal
Oai:oai:repositorium.sdum.uminho.pt:1822/73561
Description
Summary:This paper address a relevant business analytics need of a chemical company, which is adopting an Industry 4.0 transformation. In this company, quality tests are executed at the Analytical Laboratories (AL), which receive production samples and execute several instrumen- tal analyses. In order to improve the AL stock warehouse management, a Machine Learning (ML) project was developed, aiming to estimate the AL materials consumption based on week plans of sample analy- ses. Following the CRoss-Industry Standard Process for Data Mining (CRISP-DM) methodology, several iterations were executed, in which three input variable selection strategies and two sets of AL materials (top 10 and all consumed materials) were tested. To reduce the mod- eling effort, an Automated Machine Learning (AutoML) was adopted, allowing to automatically set the best ML model among six distinct re- gression algorithms. Using real data from the chemical company and a realistic rolling window evaluation, several ML train and test iterations were executed. The AutoML results were compared with two time series forecasting methods, the ARIMA methodology and a deep learning Long Short-Term Memory (LSTM) model. Overall, competitive results were achieved by the best AutoML models, particularly for the top 10 set of materials.