Developing highly accurate machine learning models for optimizing water quality management decisions in tilapia aquaculture

Sci Rep. 2025 Oct 13;15(1):35600. doi: 10.1038/s41598-025-16939-w.

Abstract

The optimization of water quality management is crucial for the success and sustainability of tilapia aquaculture. This study presents a novel approach for developing a decision-support system by comparing various machine learning models to predict optimal water quality management actions based on key environmental parameters. The novelty of this work lies in its focus on automating management decisions, moving beyond simple parameter prediction. A synthetic dataset, representing 20 critical water quality scenarios, was generated and used for model development. This dataset was preprocessed using class balancing with SMOTETomek and feature scaling. Several machine learning algorithms, namely Random Forest, Gradient Boosting, XGBoost, Support Vector Machines, Logistic Regression, and Neural Networks, were trained and evaluated. Additionally, a Voting Classifier ensemble model was employed to leverage the strengths of these individual models. Performance was assessed using accuracy, precision, recall, and F1-score, with cross-validation conducted to ensure robustness. The results demonstrated that multiple models including the ensemble Voting Classifier, Random Forest, Gradient Boosting, XGBoost, and Neural Network models, achieved perfect accuracy on the held-out test set. Cross-validation confirmed high performance across all top models, with the Neural Network achieving the highest mean accuracy of 98.99% ± 1.64%. Rather than identifying a single optimal model, this study demonstrates that model selection should be guided by specific deployment requirements, with each approach offering distinct advantages for different operational priorities. The proposed machine learning approach offers a promising tool for optimizing water quality management in Tilapia aquaculture, providing a foundation for data-driven systems that can improve efficiency, productivity, and sustainability in the industry.

Keywords: Machine learning; Predictive modeling; Tilapia aquaculture; Water quality management.

MeSH terms

  • Algorithms
  • Animals
  • Aquaculture* / methods
  • Machine Learning*
  • Neural Networks, Computer
  • Support Vector Machine
  • Tilapia* / growth & development
  • Water Quality* / standards