- Published on
Python
All Posts
python (12)
- onnx (3)
- array-api (3)
- ndonnx (3)
- github (2)
- conda (2)
- data (2)
- performance (2)
- mlops (2)
- spox (2)
- polars (2)
- sqlalchemy (2)
- automation (1)
- pre-commit (1)
- tests (1)
- sklearn (1)
- compression (1)
- pickling (1)
- ui (1)
- glm (1)
- causal-inference (1)
- regex (1)
- numpy (1)
- pixi (1)
- packaging (1)
- ast (1)
- compilers (1)
- pytest (1)
- github-actions (1)
- snowflake (1)
- integration-tests (1)
- pandas (1)
- Published on
TabularDelta reports table comparisons in different formats for various types of tables and use cases.- Published on
- Published on
Estimating Conditional Average Treatment Effects with the metalearners library- Published on
We show how we ship conda environments into production using the pixi package manager. To do that, we introduce a tool called pixi-pack.- Published on
The new glum release introduces a formula interface that simplifies preprocessing and offers various enhancements for categorical and mixed sparse and dense data.- Published on
pytest-action is a GitHub Action that allows you to run pytest and output GitHub Job Summaries. It simplifies your workflow by integrating pytest-md and pytest-emoji.- Published on
Tired of writing complicated Polars expressions? polarIFy automatically translates your easy-to-read Python methods into efficient Polars expressions!- Published on
In this post we discuss how we improved the runtime performance of a text mining step in a machine learning pipeline by a factor of 12.- Published on
We present an open source library to shrink pickled scikit-learn and lightgbm models. We will provide insights of how pickling ML models work and how to improve the disk representation. With this approach, we can reduce the deployment size of machine learning applications up to 6x.- Published on
When running the integration test suite of a data validation tool against a Snowflake instance, we saw a massive slow-down compared to Postgres or MS SQL.- Published on
Developing code involves several tasks that are simple yet repetitive. This includes styling your code (we use `black`) and checking for common issues. These tasks can be easily automated.