We build a lot. Some of it is useful enough to share.
Bulk load CSV/Parquet files into ClickHouse with automatic schema inference and retry logic.
View on GitHub →Match company names and phones to the USFD business database. Returns firmographics and executive contacts.
View on GitHub →GPU-accelerated USDA Cropland Data Layer extraction for farm parcel analysis using CuPy.
View on GitHub →A library of ClickHouse and Snowflake patterns for business intelligence workloads.
View on GitHub →Score any business list against a known-good customer set using embedding similarity.
View on GitHub →Match farm operator records to parcel boundaries using fuzzy name matching and geospatial joins.
View on GitHub →These tools are shared as-is. Pull requests welcome.