Study of evaluation metrics while predicting the yield of lettuce plants in indoor farms using machine learning models

University essay from Högskolan i Skövde/Institutionen för informationsteknologi

Abstract: A key challenge for maximizing the world’s food supply is crop yield prediction. In this study, three machine models are used to predict the fresh weight (yield) of lettuce plants that are grown inside indoor farms hydroponically using the vertical farming infrastructure, namely, support vector regressor (SVR), random forest regressor (RFR), and deep neural network (DNN).The climate data, nutrient data, and plant growth data are passed as input to train the models to understand the growth pattern based on the available features. The study of evaluation metrics majorly covers Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), R-squared, and Adjusted R-squared values.The results of the project have shown that the Random Forest with all the features is the best model having the best results with the least cross-validated MAE score and good cross-validated Adjusted R-squared value considering that the error of the prediction is minimal. This is followed by the DNN model with minor differences in the resulting values. The Support Vector Regressor (SVR) model gave a very poor performance with a huge error value that cannot be afforded in this scenario. In this study, we have also compared various evaluating metrics mentioned above and considered the cross-validated MAE and cross-validated Adjusted R-squared metrics. According to our study, MAE had the lowest error value, which is less sensitive to the outliers and adjusted R-squared value helps to understand the variance of the target variable with the predictor variable and adjust the metric to prevent the issues of overfitting.

  AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)