‘Steppy’ is a light python-3 library that can be used for fast and reproducible data science/machine learning experimentation. It reduces the burden of data scientists from software development issues. The minimal interface does not impose constraints for ‘Steppy’; instead, it enables clean machine learning pipeline design.
‘Steppy’ solves some of the data science project problems with the help of minimal interface for building machine learning pipelines. It uses two simple abstractions:
Step: It is the execution wrapper over the transformer. Example: Checking intermediate results.
Tranformer: It represents the computation step and performs operation on data. Mostly, Transformers are neural networks, machine learning algorithms..
python3.5 or above.
pip3 install steppy
Getting started with steps (Code Source: https://github.com/neptune-ml/steppy-examples/blob/master/tutorials/1-getting-started.ipynb)
This notebook shows how to create steps, fit them to data, transform new data and take advantage of persistence
%load_ext autoreload %autoreload 2 import numpy as np import matplotlib.pyplot as plt %matplotlib inline from steppy.base import Step, BaseTransformer EXPERIMENT_DIR = './ex1'
import shutil # By default pipelines will try to load previously trained models so we delete the cache to ba sure we're starting from scratch shutil.rmtree(EXPERIMENT_DIR, ignore_errors=True)
Tutorial notebooks (their repository):
Credit: Google News