Projects
Open source projects and tools.
Callm
High-throughput LLM API orchestration for running many requests reliably and efficiently.
Focus: Performance, reliability, and developer ergonomics.
Outformer
Structured outputs from language models — predictable, schema-driven results instead of fragile text parsing.
BertDistiller
Knowledge distillation for BERT-style models, inspired by MiniLM-style approaches — smaller, faster models with strong performance.