Found — Your radio. Live streaming sessions for music creators.
Coffee Meets Bagel
Built LLM eval + safety systems in prod — doubled pass rates, cut violations.
Found — Your radio. Live streaming sessions for music creators.
SGInnovate
Found hidden risks and defence signals across Singapore's national deep-tech venture capital portfolio — built ML systems + slides.
Found — Your radio. Live streaming sessions for music creators.
Origin
Rebuilding approach to mental health for a disillusioned generation. [DM for access]
Found — Your radio. Live streaming sessions for music creators.
myResponders
Improving cardiac arrest alert response rates among Singapore's first responders. [Conceptual work]
Experience
Applied LLM Lead — Coffee Meets Bagel
January 2026 — Present

ML and Venture Strategy Intern — SGInnovate
Jul 2025 — Jan 2026

ML Lead — FDM Consulting
Jan 2025 — Mar 2025

Growth Research Intern — Duke-NUS
Jan 2019 — Mar 2019
Intro
Applied ML Engineer currently under an attachment at Coffee Meets Bagel working on LLM evaluation, safety and reliability - where I build systems to assess model behaviour before real-world deployment. Alongside this, I've led work across healthcare tools, working on neuro-AI research, and decision-support systems, with an emphasis on failure modes and user experience. Currently pursuing BSc (Hons) in Data Science and Business Analytics at University of London-SIM.
Research
I study how systems influence human agency under uncertainty with a focus on human-AI interaction and how system design influences behaviour and perception.

Papers:
𐄁 Agency Compliance in Sequential Systems (preprint in progress)
𐄁 AI Autophagy in Decision Systems: Neural Data as an Exogenous Signal (academic paper, UOL)
𐄁 When Systems Shape What We Think: Cognitive Warfare in Democracy (1st Place, IAS Conference 2024)

Open Questions
𐄁 How can agency be operationalised in sequential decision systems without collapsing into proxy metrics?
𐄁 Where do interpretability tools fail to capture causal structure in real-world data?
𐄁 What evaluation regimes detect harm before deployment, not after?
All Wrongs Reserved.