Software Engineer · ML Researcher
Dhairya
Mishra
Building and deploying AI/ML production systems at scale — multimodal pipelines, LLM tooling, and distributed data infrastructure. 6+ years shipping computer vision, NLP, and cloud-native services.
Affiliated With
Featured Projects
Research and production work spanning ML, full-stack, and cloud systems.
Multiplayer Frames
Solaris — Multiplayer Video World Model in Minecraft
First multiplayer video world model generating consistent first-person observations for two players simultaneously, trained on 12.6M frames of coordinated Minecraft gameplay. Published on arXiv, NYU.
Shipped
Teserax.io — Graph-Based AI Thinking Tool
A dual-lane, chat-first exploration tool that transforms linear LLM chat into a visual, non-linear graph canvas with AI orchestration, crosslink reasoning, multi-model BYOK support, and cloud persistence — shipped v2.0 with 64 issues closed across 5 development phases.
Accuracy
Cloud NLP Classification Service
Production-ready multi-model text classification service with zero-downtime model switching, deployed on GCP. DistilBERT achieves 96.57% accuracy.
Accuracy
MRI Brain Tumor Detection & Segmentation
Multimodal MRI classification and segmentation model trained on BraTS dataset with shared encoder, achieving 91.3% accuracy and 97.1% sensitivity.
Skills & Technologies
Full-stack proficiency across ML/AI, cloud infrastructure, and modern web frameworks.
Languages
ML / AI
Cloud & DevOps
Frameworks & App Dev
Data & Storage
Testing & Observability
Experience Timeline
AI/ML Analyst
Jan 2026 — PresentEvidenza · Brooklyn, NY
Automated ingestion of legacy Human-vs-AI survey spreadsheets into a schema-validated database via a 6-stage Spark ETL, producing 7,500+ JSONL records across 26 domains and powering a new enterprise customer segmentation feature
Productionized Google AlphaEvolve persona generation system to synthesize diverse respondents for survey simulation at scale, achieving >80% Monte Carlo trait-space coverage, meeting human vs synthetic response blind audit agreement of ≥65%
Built competitive ad intelligence dataset by cataloging 1,200+ customer ads across 5 B2B verticals, with feature scoring pipelines (semantic/sentiment + multimodal signals) used in Evidenza recommendations, contributing to a 34% lift in engagement on suggested ads/creatives
Sr. Software Development Engineer
Jan 2023 — Jan 2025CVS Health · New York, NY
Piloted image-to-alt-text automation using proprietary transformers for enterprise rollout, reducing downstream defects by 15%
Delivered an internal RAG support assistant leveraging Slack integrations with OpenAI and ChromaDB. Accelerated self-serve troubleshooting for teams and reduced manual ticket resolution time by 20%
Reviewed and shipped 125+ PRs and owned on-call for customer-facing core platform systems (Digital-Blocks 2.0, Experience Builder) serving millions of customers daily; reduced downtime by 12%
Built OpenTelemetry-to-Grafana observability + synthetic tests for scale events across deployed applications and microservices; improved debugging efficiency 25% (median incident MTTR)
Implemented automated UI quality gates by integrating axe-core with Playwright into GitHub Actions pipelines; shifted validation left across supported repos and cut production issues by 35%
Advanced Software Developer
Feb 2022 — Jan 2023Aetna Health · New York, NY
Developed testing automation suite CAT, RallyScore, and ThemeScore; reduced QA time by 75%
Designed Rally kanban migration pipeline for 600+ nested structures; reduced migration time by 80%
Provisioned encrypted API microservices for MongoDB access; boosted data transaction speeds by 55%
Education
New York University
Courant Institute
M.S. Computer Science (AI)
Trine University
B.S. Software Engineering & Mathematics
Let's Build Something
Interested in collaborating on ML research, full-stack systems, or production AI? I'd love to hear from you.