Projects

Selected work

The Three Selves Eval Framework (Draft)

Behavioral framework for measuring LLM identity stability under adversarial pressure.

Tools for generating hyper-realistic backgrounds for 3D renders with SGK, reducing production costs and time.

Evaluation framework for autonomous agents in project management using Dockerized environments (Plane, Rocket.Chat).

Retrieval-augmented generation system for CMU domain QA. 293-entry test dataset, multiple model experiments including Mistral and DeBERTa.

Three-agent framework for cancer-domain Question Answer synthesis. 800+ entry dataset, 40%+ Turing Test achievement.

Case study on Shell's approach to AI adoption and enterprise innovation through CXO interviews.

Research on Fortune 500 AI adoption through interviews, state-of-the-art reports, and organizational readiness surveys.