Full-Stack 2025-11 Featured

Cloud NLP Classification on GCP

Production-ready multi-model text classification service with zero-downtime model switching, deployed on GCP Compute Engine. DistilBERT trained to 96.57% accuracy on a 24,783-sample dataset; 326+ test suite at 100% pass rate.

96.57% Accuracy

GitHub

NLPDistilBERTFastAPIDockerGCPPyTorchscikit-learn

Overview

A production-grade multi-model text classification service built with FastAPI and Docker, featuring zero-downtime model switching between DistilBERT, TF-IDF + LogReg, and TF-IDF + SVM. Deployed live on GCP Compute Engine using an e2-standard-2 instance.

Model Benchmarks

Model	Accuracy	Latency	Cost Factor
DistilBERT	96.57%	60–100ms	Baseline
LogReg (TF-IDF)	85–88%	5ms (21× faster)	—
SVM (TF-IDF)	85–88%	2ms (44× faster)	—

Trained and benchmarked on a 24,783-sample dataset with comprehensive E2E validation.

Key Features

Zero-Downtime Switching: Hot-swap between models without service interruption
326+ Test Suite: Automated E2E validation with 100% pass rate
Cloud Deployment: Live on GCP at ~~$0.07/hr (~~$50/mo)
Multi-Model Architecture: Pluggable model backends behind a unified API

Tech Stack

ML: Hugging Face Transformers, PyTorch, scikit-learn, TF-IDF
API: FastAPI, Uvicorn, Pydantic
Infrastructure: Docker, GCP Compute Engine (e2-standard-2)
Testing: pytest, 326+ automated tests

Related Projects

Full-Stack

Teserax.io — Graph-Based AI Thinking Tool

A dual-lane, chat-first exploration tool that transforms linear LLM chat into a visual, non-linear graph canvas with AI orchestration, crosslink reasoning, multi-model BYOK support, and cloud persistence — shipped v2.0 with 64 issues closed across 5 development phases.