Software Engineer · ML Researcher

Dhairya
Mishra

Building and deploying AI/ML production systems at scale — multimodal pipelines, LLM tooling, and distributed data infrastructure. 6+ years shipping computer vision, NLP, and cloud-native services.

View Projects Get in Touch

Years Experience

Projects Shipped

NYU

Courant M.S. CS

ICML

2026 Submission

Affiliated With

NYU Courant

Trine University

CVS Health

Aetna

Evidenza

ACM

ICML

arXiv

Featured Projects

Research and production work spanning ML, full-stack, and cloud systems.

Research

12.6M

Multiplayer Frames

Solaris — Multiplayer Video World Model in Minecraft

First multiplayer video world model generating consistent first-person observations for two players simultaneously, trained on 12.6M frames of coordinated Minecraft gameplay. Published on arXiv, NYU.

JAXDiffusion TransformerMultiplayerMinecraft

Full-Stack

v2.0

Shipped

Teserax.io — Graph-Based AI Thinking Tool

A dual-lane, chat-first exploration tool that transforms linear LLM chat into a visual, non-linear graph canvas with AI orchestration, crosslink reasoning, multi-model BYOK support, and cloud persistence — shipped v2.0 with 64 issues closed across 5 development phases.

ReactReact FlowHonoZustand

Full-Stack

96.57%

Accuracy

Cloud NLP Classification Service

Production-ready multi-model text classification service with zero-downtime model switching, deployed on GCP. DistilBERT achieves 96.57% accuracy.

NLPDistilBERTFastAPIDocker

ML / AI

91.3%

Accuracy

MRI Brain Tumor Detection & Segmentation

Multimodal MRI classification and segmentation model trained on BraTS dataset with shared encoder, achieving 91.3% accuracy and 97.1% sensitivity.

PyTorchComputer VisionMedical ImagingFastAPI

View All Projects

Skills & Technologies

Full-stack proficiency across ML/AI, cloud infrastructure, and modern web frameworks.

Languages

Python TypeScript JavaScript Java C++ SQL HTML/CSS

ML / AI

PyTorch TensorFlow Hugging Face scikit-learn OpenCV RAG ChromaDB FAISS wandb

Cloud & DevOps

EC2 S3 Lambda GKE Docker Kubernetes Terraform GitHub Actions Jenkins ArgoCD PM2

Frameworks & App Dev

FastAPI React Astro Vite TailwindCSS Streamlit Uvicorn Zustand React Flow

Data & Storage

PostgreSQL MongoDB MySQL Pandas NumPy Spark HDFS REST APIs

Testing & Observability

OpenTelemetry Grafana Prometheus Elastic Stack Playwright pytest Pydantic

Experience Timeline

AI/ML Analyst

Jan 2026 — Present

Evidenza · Brooklyn, NY

Automated ingestion of legacy Human-vs-AI survey spreadsheets into a schema-validated database via a 6-stage Spark ETL, producing 7,500+ JSONL records across 26 domains and powering a new enterprise customer segmentation feature

Productionized Google AlphaEvolve persona generation system to synthesize diverse respondents for survey simulation at scale, achieving >80% Monte Carlo trait-space coverage, meeting human vs synthetic response blind audit agreement of ≥65%

Built competitive ad intelligence dataset by cataloging 1,200+ customer ads across 5 B2B verticals, with feature scoring pipelines (semantic/sentiment + multimodal signals) used in Evidenza recommendations, contributing to a 34% lift in engagement on suggested ads/creatives

Sr. Software Development Engineer

Jan 2023 — Jan 2025

CVS Health · New York, NY

Piloted image-to-alt-text automation using proprietary transformers for enterprise rollout, reducing downstream defects by 15%

Delivered an internal RAG support assistant leveraging Slack integrations with OpenAI and ChromaDB. Accelerated self-serve troubleshooting for teams and reduced manual ticket resolution time by 20%

Reviewed and shipped 125+ PRs and owned on-call for customer-facing core platform systems (Digital-Blocks 2.0, Experience Builder) serving millions of customers daily; reduced downtime by 12%

Built OpenTelemetry-to-Grafana observability + synthetic tests for scale events across deployed applications and microservices; improved debugging efficiency 25% (median incident MTTR)

Implemented automated UI quality gates by integrating axe-core with Playwright into GitHub Actions pipelines; shifted validation left across supported repos and cut production issues by 35%

Advanced Software Developer

Feb 2022 — Jan 2023

Aetna Health · New York, NY

Developed testing automation suite CAT, RallyScore, and ThemeScore; reduced QA time by 75%

Designed Rally kanban migration pipeline for 600+ nested structures; reduced migration time by 80%

Provisioned encrypted API microservices for MongoDB access; boosted data transaction speeds by 55%

Education

New York University

Courant Institute

M.S. Computer Science (AI)

May 2026 GPA: 3.7

Trine University

B.S. Software Engineering & Mathematics

Dec 2021 Angola, IN

Let's Build Something

Interested in collaborating on ML research, full-stack systems, or production AI? I'd love to hear from you.

Get in Touch View GitHub

Dhairya Mishra

Featured Projects

Solaris — Multiplayer Video World Model in Minecraft

Teserax.io — Graph-Based AI Thinking Tool

Cloud NLP Classification Service

MRI Brain Tumor Detection & Segmentation

Skills & Technologies

Languages

ML / AI

Cloud & DevOps

Frameworks & App Dev

Data & Storage

Testing & Observability

Experience Timeline

AI/ML Analyst

Sr. Software Development Engineer

Advanced Software Developer

Education

New York University

Trine University

Let's Build Something

Dhairya
Mishra