The new glum release introduces a formula interface that simplifies preprocessing and offers various enhancements for categorical and mixed sparse and dense data.
We present an open source library to shrink pickled scikit-learn and lightgbm models. We will provide insights of how pickling ML models work and how to improve the disk representation. With this approach, we can reduce the deployment size of machine learning applications up to 6x.