← All projects
Featured Solo · modeling and feature engineering · 2024 · Model

Milk Quality Prediction

Classifying milk grade from physicochemical measurements

Milk Quality Prediction
Multi-class Low / med / high grade
Sensor-only No downstream lab tests needed
Imbalanced Class-weighted training

The problem

Dairy QC labs catch contamination at the end of the line, after bad batches are already produced. Earlier prediction from upstream sensor data would catch issues before a batch is ruined.

My contribution

Built a multi-class classifier predicting milk quality grade (low / medium / high) from chemical sensor features (pH, fat, protein, lactose, somatic cell count). Compared tree ensembles against linear baselines and tuned class weights to handle imbalanced grade distributions.

Outcome

Predictive model identifying milk grade from upstream sensor data alone, with feature importance pinpointing which chemical signals matter most. Useful pattern for any process-quality prediction problem.

What I learned

Domain context matters: a feature that looks weak on its own (e.g. color alone) can become highly informative once combined with chemistry features the way an expert would combine them mentally.

Type
Model
Role
Solo · modeling and feature engineering
Timeframe
2024
Stack
PythonScikit-learnPandasMatplotlibSeaborn
Tags
ClassificationMachine LearningDairy IndustryPredictive Analytics