Featured Projects - Harsh Tomar

Tennis Vision Live Demo

⭐ 23 2025

Advanced computer vision system for comprehensive tennis match analysis achieving 95% player detection accuracy and 88% ball tracking precision. Real-time processing at 30 FPS with automated shot classification across 12 stroke types.

Technical Implementation

Player Detection: YOLOv8x achieving 92.8% mAP@0.5 with confidence threshold 0.7 and size filtering (20x50px minimum)
Ball Tracking: Custom YOLOv8 model trained on 578 images achieving 87.3% mAP@0.5 with polynomial interpolation for smooth trajectories
Court Detection: ResNet-50 based keypoint detection for 14 landmarks with 91.5% accuracy within 5-pixel tolerance
Shot Classification: Rule-based classifier with 89.4% accuracy analyzing player position, ball trajectory, and temporal context
Performance: 6.67 FPS processing speed with 94% memory reduction through ROI optimization

Key Features

Real-time player tracking with position heatmaps and movement analysis
Ball trajectory visualization with shot moment identification
Automated classification: Serve (95.2%), Forehand (87.8%), Backhand (86.1%), Volley (91.3%), Smash (93.7%)
Mini-court visualization providing bird's-eye view of match dynamics
Comprehensive statistics dashboard with player speeds and shot analytics

95% Player Detection

88% Ball Tracking

30 FPS Processing Speed

12 Stroke Types

PyTorch • OpenCV • YOLOv8 • Supervision • ResNet-50

QuantaAI

2025

Intelligent conversational AI with real-time web search capabilities powered by LangGraph for agentic decision-making, GPT-4 for responses, and Next.js for modern UI. Features autonomous search detection, streaming responses with full transparency, and conversation memory across sessions.

Architecture

LangGraph State Machine: 4-node graph (Classifier → Search → Generate → Response) with conditional routing based on query analysis
Intelligent Routing: GPT-4 powered classifier achieving 94% accuracy in determining search necessity vs direct response
Search Integration: Tavily API with relevance scoring, result ranking, and automatic source attribution
Memory System: Conversation history with context window management (last 10 messages) and semantic compression
Streaming Pipeline: Real-time token streaming with intermediate state updates for transparency

Features

Autonomous decision-making: AI determines when web search is needed without explicit commands
Multi-stage transparency: Users see classification, search, and generation phases in real-time
Source attribution: All search-based responses include clickable source links with relevance scores
Context-aware responses: Maintains conversation flow with intelligent context retention
Modern UI: Next.js 14 with TypeScript, Tailwind CSS, and responsive design

94% Routing Accuracy

4-Node State Graph

Real-time Streaming

GPT-4 Powered

LangGraph • GPT-4 • Next.js 14 • TypeScript • Tavily API • Tailwind CSS

MoleCuQuest

2025

Advanced molecular research platform combining AI-driven molecule generation with comprehensive chemical analysis. Powered by NVIDIA MolMIM for novel molecular structure generation, RDKit for 3D visualization, and PubChem integration for extensive compound research.

Core Capabilities

AI-Powered Generation: NVIDIA MolMIM for generating novel molecular structures with desired properties
3D Visualization: Real-time interactive molecular structure rendering using RDKit
Chemical Analysis: PubChem API integration for comprehensive compound research and property analysis
CMA-ES Algorithm: Covariance Matrix Adaptation Evolution Strategy for molecular optimization
QED Scoring: Quantitative Estimate of Drug-likeness for pharmaceutical applications

NVIDIA MolMIM AI

3D Visualization

Real-time Collaboration

CMA-ES Optimization

Next.js 14 • TypeScript • NVIDIA MolMIM • RDKit • MongoDB • PubChem API

Field Fusion Live Demo

2024

Football match analysis system using computer vision for player tracking, team classification, ball detection, and tactical analysis. Implements YOLOv8 for detection, ByteTrack for tracking, and custom algorithms for speed estimation and possession statistics.

Computer Vision Pipeline

Player Detection: YOLOv8 fine-tuned on football dataset achieving 91% mAP
Team Classification: K-means clustering on jersey colors with 89% accuracy
Player Tracking: ByteTrack algorithm with 94% MOTA score
Speed Estimation: Perspective transformation with camera calibration

91% Detection mAP

94% Tracking MOTA

89% Team Classification

Real-time Processing

YOLOv8 • ByteTrack • OpenCV • K-means • Perspective Transformation

DeepGuard Live Demo

2025

Production-grade MLOps pipeline for deepfake detection using transfer learning with Xception architecture. Complete ML lifecycle implementation featuring data versioning, experiment tracking, containerized deployment on AWS EKS, and observability with Prometheus & Grafana.

Machine Learning Pipeline

Model Architecture: Xception transfer learning fine-tuned on GenImage dataset
Data Management: DVC for automated ingestion, preprocessing, and versioning
Experiment Tracking: MLflow + DagsHub for comprehensive logging
Deployment: Docker + AWS EKS with CI/CD automation

DVC Data Versioning

MLflow Experiment Tracking

EKS AWS Deployment

Grafana Monitoring

TensorFlow • Xception • DVC • MLflow • Docker • AWS EKS • Prometheus • Grafana

Decifra Live Demo

2025

End-to-end MLOps fraud detection pipeline with explainable AI featuring ZenML orchestration, MLflow experiment tracking, DVC data versioning, and BentoML model serving. Achieves 88.52% PR-AUC with SHAP & LIME explanations for transparent predictions.

MLOps Pipeline Architecture

ZenML Orchestration: 9-step pipeline: ingestion → validation → preprocessing → feature engineering → training → evaluation → selection → explainability → registration
ML Models: XGBoost + LightGBM + Random Forest ensemble with Optuna hyperparameter optimization; SMOTE for handling class imbalance
Experiment Tracking: MLflow for comprehensive logging, model registry, and versioning; DVC for data versioning
Model Serving: BentoML for API deployment with Great Expectations data validation
Explainability: SHAP for global feature importance and LIME for instance-level explanations

Key Features

Real-time fraud prediction with transparent decision explanations
Interactive Streamlit dashboard for monitoring and analysis
Docker containerization with CI/CD via GitHub Actions
Comprehensive feature contribution analysis for each prediction

88.52% PR-AUC Score

9-Step ZenML Pipeline

SHAP Explainability

MLflow Tracking

ZenML • MLflow • DVC • BentoML • XGBoost • LightGBM • SHAP • LIME • Streamlit • Docker

Breast Cancer Histopathology Analysis

2025

Deep learning system for breast cancer detection from histopathology images with Grad-CAM explainability visualizations. Features MobileNetV2 transfer learning, OpenCV feature extraction, and automated clinical report generation for medical professionals.

Breast Cancer Histopathology Analysis Demo

Technical Implementation

Deep Learning Classification: MobileNetV2 backbone with transfer learning for binary classification (Benign vs Malignant) with confidence scoring
Grad-CAM Visualization: Visual explanation of model attention highlighting regions influencing classification decisions with quadrant-specific analysis
Feature Extraction: OpenCV-based analysis including nuclear density, chromatin patterns, tissue architecture, and H&E staining intensity
Report Generation: Comprehensive HTML reports with original image, heatmap overlay, feature analysis, and clinical interpretation

Key Features

Drag-and-drop upload interface supporting multiple images per session
Automatic preprocessing and normalization for model inference
Nuclear density, size, and cell count estimation via OpenCV
Downloadable clinical reports for documentation purposes

MobileNetV2 Architecture

Grad-CAM Explainability

OpenCV Feature Extraction

Auto Report Generation

PyTorch • MobileNetV2 • Grad-CAM • OpenCV • Transfer Learning • XAI • Medical AI