simpleML

No-code Machine learning (Pre-alpha)

View project on GitHub

simpleML

:soon:

Simple ML is command-line machine learning utility written in Python3.7 wrapped over existing ML libraries.

Aim is to attain no-code ML training while still having the ability to use multiple(all) technique/model/parameters with just few clicks and output neat HTML reports(transperancy on data and models/analysis) with plots and data analysis report which can be helpful in reducing the hypothesis to insights cycle time in a ML experiment

Altough a lot of cloud providers provide this option and sometimes it may not be worth using it, and even costlier. While It’s simpler over CLI.

Currently we can run linear regression & Binary classification from command line. It provides interactive terminal to configure your machine learning pipelines and preprocessing steps.

The analysis part run on a jupyter notebook which can be changed as required easily and we can still use the existing framework to provide the reports and CLI configurations.

Linear Regression

Regression Module is a supervised machine learning module that is used for estimating the relationships between a dependent variable (often called the ‘outcome variable’, or ‘target’) and one or more independent variables (often called ‘features’, ‘predictors’, or ‘covariates’). The objective of regression is to predict continuous values such as predicting sales amount, predicting quantity, predicting temperature etc. This supports several pre-processing features that prepare the data for modeling through CLI just by clicking. It has over 25 ready-to-use algorithms and several plots to analyze the performance of trained models.

Option to run on default configuration (docs :soon:).

Provides preprocessing configuration for EDA & making data ready.

Supports comparing 25 Linear regression results based on below

R2

MAE

MAPE

RMSE metrics and provides the best model, but still users have an option to override and run their model of interest.

Option for auto hyperparameter tuning based on random grid search.

Creates a details HTML report with :

residual plots

Feature importance plot

Prediction Error plot

Learning Curve plot

Cooks Distance Plot

Validation Curve Plot

SHAP plots for SHapley Additive exPlanations

Pickling model for re-use.

Binary classification

Classification Module is a supervised machine learning module which is used for classifying elements into groups. The goal is to predict the categorical class labels which are discrete and unordered. Some common use cases include predicting customer default (Yes or No), predicting customer churn (customer will leave or stay), disease found (positive or negative). This module can be used for binary and provides several pre-processing features that prepare the data for modeling through CLI. It has over 18 ready-to-use algorithms and several plots to analyze the performance of trained models.

Option to run on customized preprocessing configurations (docs :soon:).

Provides preprocessing configuration for EDA & making data ready.

Runs 18 classification and comapres the

‘Accuracy’

‘AUC’

‘Recall’

‘Precision’

‘F1’

‘Kappa’ merics and provides the best model, but still users have an option to override and run their model of interest.

Option for auto tune hyperparameters based on random grid search.

Creates a details HTML report with below plots

Area Under the Curve

Discrimination Threshold

Precision Recall Curve

Confusion Matrix

Class Prediction Error

Classification Report

Decision Boundary

Recursive Feature Selection

Learning Curve

Manifold Learning

Calibration Curve

Validation Curve

Dimension Learning

Feature Importance

Model Hyperparameter

SHAP plots for SHapley Additive exPlanations

Pickling model for re-use.

Sample demos as of (30thMay2020) - View here>

Creating Linear regression with default configuration on boston dataset

Creating Linear regression model with customized preprocessing configurations

Creating Binary Classification model with default preprocessing configurations on credit card dataset

Creating Binary Classification with default preprocessing configurations on credit card dataset

Install & run

git clone https://github.com/iamlmn/simpleML.git
cd simpleML
pip install -r requirements.txt
python3 auto_regression/main.py

TODOs and completed work :

simpleML

Linear Regression

Binary classification

Sample demos as of (30thMay2020) - View here>

Creating Linear regression with default configuration on boston dataset

Creating Linear regression model with customized preprocessing configurations

Creating Binary Classification model with default preprocessing configurations on credit card dataset

Creating Binary Classification with default preprocessing configurations on credit card dataset

Install & run

:octocat: Contributions and ideas are welcome.s