This repository contains notebooks for preparing data, comparing machine learning packages, and training models. The goal of this project is to preprocess data, evaluate different machine learning algorithms, and train a predictive model.
Comparison_Of_Packages.ipynb
: Notebook comparing different machine learning packages and initial test on data.Preparing_Data_For_Training.ipynb
: Notebook for data preprocessing and preparation of dataframe for training the models.Training.ipynb
: Notebook for training and evaluating machine learning models from sklearn (lr, rf, xgboost, mlp) and PyTorch (mlp).
This notebook (Comparison_Of_Packages.ipynb
) compares different machine learning packages and evaluates their performance on a given dataset.
- Load and preprocess the dataset.
- Compare the performance of various machine learning models.
- Visualize the results and select the best package.
The notebook (Preparing_Data_For_Training.ipynb
) focuses on data preprocessing steps necessary for model training.
- Load the raw dataset.
- Handle missing values and outliers.
- Do feature selection.
- Save the processed data for model training.
The notebook (Training.ipynb
) covers the training, evaluation, and selection of the machine learning model.
- Load the preprocessed data.
- Define and initialize the model.
- Train the model using training data and validation.
- Perform Student T-Test to choose the best models from all trained models.
- Evaluate the performance of the best model on testing data.
- Save the trained model for future predictions.