Projects
Selected work
The Three Selves Eval Framework (Draft)
Behavioral framework for measuring LLM identity stability under adversarial pressure.
3D Product Placement in Ad Creatives
Tools for generating hyper-realistic backgrounds for 3D renders with SGK, reducing production costs and time.
ProjectArena — Evaluation Suite for Project Management Agents
Evaluation framework for autonomous agents in project management using Dockerized environments (Plane, Rocket.Chat).
RAG for Academic Question-Answering
Retrieval-augmented generation system for CMU domain QA. 293-entry test dataset, multiple model experiments including Mistral and DeBERTa.
Collaborative Agents for QA Generation
Three-agent framework for cancer-domain Question Answer synthesis. 800+ entry dataset, 40%+ Turing Test achievement.
Corporate Startup Lab — Shell
Case study on Shell's approach to AI adoption and enterprise innovation through CXO interviews.
Driving Organizational Transformation — Enterprise AI
Research on Fortune 500 AI adoption through interviews, state-of-the-art reports, and organizational readiness surveys.