Lab
Projects, writings, and experiments
A mix of things I'm working on, thinking about, or just playing with.
Projects
The Three Selves Eval Framework (Draft)
Behavioral framework for measuring LLM identity stability under adversarial pressure.
3D Product Placement in Ad Creatives
Tools for generating hyper-realistic backgrounds for 3D renders with SGK, reducing production costs and time.
ProjectArena — Evaluation Suite for Project Management Agents
Evaluation framework for autonomous agents in project management using Dockerized environments (Plane, Rocket.Chat).
RAG for Academic Question-Answering
Retrieval-augmented generation system for CMU domain QA. 293-entry test dataset, multiple model experiments including Mistral and DeBERTa.
Collaborative Agents for QA Generation
Three-agent framework for cancer-domain Question Answer synthesis. 800+ entry dataset, 40%+ Turing Test achievement.
Corporate Startup Lab — Shell
Case study on Shell's approach to AI adoption and enterprise innovation through CXO interviews.
Driving Organizational Transformation — Enterprise AI
Research on Fortune 500 AI adoption through interviews, state-of-the-art reports, and organizational readiness surveys.
Clay/Play