Three active research projects, all asking the same question: can we actually trust how we measure AI?
I build the infrastructure and evidence base for trustworthy AI — determined to close the gap between how we think AI systems behave and how they actually do.
I'm Isabella (Thao My) Luong, a research engineer based in Ho Chi Minh City, Vietnam, focused on AI safety evaluations and adversarial robustness. Within five months of entering the field, I'm running three interconnected empirical research projects examining how current evaluation infrastructure fails at scale.
My work spans benchmark integrity (how single-pass evals miss trajectory-level failure), LLM-as-judge bias (how stylistic features distort scoring independent of quality), and cross-session threat detection (catching adversarial actors who fragment attacks across API sessions). These aren't separate interests — they're a systematic examination of where evaluation breaks down.
I was admitted to CAMBRIA (10% acceptance) and accepted to two SPAR Spring 2026 projects, and am embedded in the EA/AI safety institutional ecosystem. I hold a B.Sc. in Information Technology from RMIT University Vietnam as a Vice-Chancellor Merit Scholar.
Dynamic multi-turn benchmark evaluating frontier models on animal welfare reasoning under escalating adversarial pressure. Targeting submission to Inspect Evals (UK AISI); outputs feed into model specifications and training interventions at frontier labs.
End-to-end detection system for cross-session malicious model misuse — targeting adversarial actors who decompose attack queries across multiple API sessions to evade per-session safety classifiers. Targeting publication at USENIX Security, NeurIPS D&B, and ICLR.
Three interconnected empirical projects forming a systematic examination of benchmark and evaluation failure in frontier AI systems — with implications for scalable oversight and reward robustness.
Accepted to the April 2026 cohort of Iliad's month-long intensive on technical AI alignment. Curriculum covers RL, learning theory, mechanistic interpretability, agent foundations, and scalable oversight including Debate. Strong performance serves as a pathway into the Iliad Fellowship (June–August 2026).
1 of 20 participants admitted worldwide. Completed a 3-week technical curriculum covering CNNs, ResNets, and transformers built from scratch; RL (DQN, PPO); RLHF; and mechanistic interpretability. Completed a capstone on automated capability elicitation with LLM-as-judge.
International AI policy and advocacy organisation mobilising youth around AI governance and safety. Establishing funding pipelines with tech corporations and philanthropic funders; organizing panel discussions on AI risks and securing partnerships for technical curriculum delivery across Vietnam.
HPAIR's flagship annual conference uniting global leaders, researchers, and students across policy, technology, and business.
Building Scala AI — an intelligent surgical tutor that performs real-time phase recognition, safety assessment, and structured performance feedback for laparoscopic procedures, improving surgical training through procedure-aware analysis.
Hired as 1 of 7 from 500+ candidates through a rigorous annual campaign for a fast-tracked management position. Embedded as the sole technical hire within a 500-person manufacturing operation — serving as the de facto in-house software engineer and automation consultant for the Finance department and the entire Vietnam site.
Selected among 90 European peers for a 5-day intensive workshop on systems thinking, Balanced Scorecard, and AI-integrated strategy. Presented to an international faculty panel.
Selective fellowship pairing high-potential Vietnamese leaders with McKinsey consultants and NGO partners for real-world consulting casework under evaluation conditions.
Industry-partnered innovation challenge requiring technically grounded, financially validated decarbonization strategies for global logistics operations.
National competition requiring full-stack product development and business validation, judged by blockchain industry leaders.