- Acquire labelled or annotated data
- Consider Ethical Considerations in ML
- Perform Data Cleaning
- Validation in small datasets
- Analyze Data and features using visualizations, dimension reduction
- Perform Feature Engineering
- Shortlist models
- pick complexity of the model for the perfect bias-variance tradeoff
- Fine tune the model
- set Hyperparameters
- data transformations & feature engineering are hyperparameters too
- perform random search, grid search
- Automatic hyper parameterization using Auto-ML
- set Hyperparameters
- Fit on training + validation data, test on test data
- Pick best performing model or aggregate models via Ensemble methods
- Evaluation of ML systems