tech.quantco.com

A QuantCo Engineering Blog

All Posts

Published on

July 31, 2022

Introducing the multiregex library

python performance regex

In this post we discuss how we improved the runtime performance of a text mining step in a machine learning pipeline by a factor of 12.
Published on

June 23, 2022

Optimize pickling disk space for deploying scikit-learn trees to production

sklearn python compression pickling

We present an open source library to shrink pickled scikit-learn and lightgbm models. We will provide insights of how pickling ML models work and how to improve the disk representation. With this approach, we can reduce the deployment size of machine learning applications up to 6x.
Published on

June 20, 2022

Datajudge: A library for data tests across data sources

data tests

Datajudge is a Python library for expressing and testing expectations against data from database.
Published on

May 19, 2022

UI component sharing for enterprises

ui

How to properly set up a React UI component library and share it across your organization.
Published on

May 12, 2022

Fixing a Snowflake performance issue around introspection

snowflake python sqlalchemy integration-tests

When running the integration test suite of a data validation tool against a Snowflake instance, we saw a massive slow-down compared to Postgres or MS SQL.