Harsh Tomar

I build AI systems that actually work — from tracking tennis balls at 30 FPS to generating novel molecules

AI/ML Developer building CV systems and LLM pipelines since 2024. Implemented Vision Transformers, VLMs, and LoRA from scratch in PyTorch. Tennis Vision earned 31 GitHub stars for detecting players at 95% accuracy and 30 FPS with a custom-annotated dataset. Currently fine-tuning a small medical LLM on PubMedQA and extending Tennis Vision with multi-camera support. I ship code, break things, fix them, and document it all.

Technical Skills

Programming Languages

Python JavaScript TypeScript Bash

ML / AI Frameworks

PyTorch TensorFlow Keras Scikit-learn HuggingFace XGBoost LightGBM Optuna

Computer Vision

YOLOv5–v8 OpenCV ONNX ByteTrack Supervision PIL Optical Flow Object Detection Image Segmentation

Generative AI & LLM

LangChain LangGraph LlamaIndex CrewAI AG2 (AutoGen) RAG LoRA / QLoRA / PEFT Prompt Engineering OpenAI API Anthropic API Gemini API

Data Science

NumPy Pandas Matplotlib Seaborn Statistical Analysis Feature Engineering

MLOps & Cloud

Docker Kubernetes AWS (EC2, S3, EKS) GCP MLflow DVC ZenML BentoML FastAPI GitHub Actions Prometheus Grafana

Databases

MongoDB PostgreSQL FAISS Pinecone Chroma Vector DBs

Development Tools

Git Jupyter VS Code Streamlit Flask Gradio HuggingFace Spaces Firebase Netlify CI/CD

XAI / Explainability

SHAP LIME Grad-CAM Feature Importance ViT Attention Maps

Professional Experience

AI Intern

i3 Digital Health May 2025 – Jan 2026

Architected an intelligent research profiling system that automatically generates researcher profiles by aggregating data from PubMed, ResearchGate, and Google Scholar APIs. Built RAG-powered search agents providing contextual recommendations and identifying potential research collaborators. Collaborated with healthcare professionals to refine search algorithms and improve researcher profiling accuracy. Engaged full-time for 3 months, then continued part-time for 4 months. Stack: LangChain, FastAPI, Docker.

Multi-source API integration RAG-powered search Healthcare AI

Volunteer Contributor

CNCF & Google Developer Groups Jan 2023 – Present

Volunteer member of Cloud Native Computing Foundation and Google Developer Groups. Participated in 15+ cloud-native technology discussions at community meetups. Delivered 2 tech talks on AI/ML best practices at college tech events. Provided peer guidance on ML concepts and project feedback to fellow students at workshops.

15+ community discussions 2 tech talks Volunteer

By the Numbers

31
Stars on Tennis Vision
3
Upstream OSS PRs merged
4+
Live deployed demos

Open Source Contributions

#12 merged

BBoxMaskPose

Added Docker support for reproducible environment setup — ICCV 2025 paper implementation for detection, pose estimation & segmentation

#23 merged

multimodal-agents-course

Fixed missing Kubrick UI source file that caused Docker build failure in the multimodal agents course

#6849 merged

hive

Refactored and removed deprecated FileStorage backend class (267 lines) from the Hive platform

Education

Bachelor of Technology in AI & Data Science

Lakshmi Narain College of Technology, Bhopal

Nov 2022 - May 2026

Relevant Coursework: Machine Learning, Computer Vision, Deep Learning, NLP, Data Structures & Algorithms, Reinforcement Learning, Statistical Analysis, Neural Networks

Recommendation

"I was impressed by Harsh's commitment and technical prowess — he attacks each challenge with enthusiasm, learning desire, and will to accomplish. His interest in Machine Learning, Computer Vision, and AI has surpassed what one might initially expect from someone at his level."
Yashvardhan Singh Software Engineer at BARCO, B.Tech. IIT Delhi

Open to AI/ML internships and full-time opportunities — feel free to reach out via the links above.