LLM Engineering Pipeline (Focus: Trust & Safety)
Coffee Meets Bagel ⸱ 2026
- Built end-to-end experimentation platform, reducing setup time by ~70%
- Designed evaluation framework (rule-based + LLM-as-judge), improving pass rate from 41.6% → 88.6%
- Implemented multi-layer safety architecture, reducing violations from 8.4% → 1.9%
Stack: Python, OpenAI API, Pandas