
Hi, I'm Soham
AI Developer
About Me
Meet Soham
I'm Soham, an AI & Machine Learning Engineer passionate about building intelligent, high-impact systems. I specialize in developing high-accuracy ML models, scalable AI applications, and multimodal solutions that combine computer vision, NLP, and generative AI.


Technical Skills
Programming Languages
AI/ML
Achievements
2nd Position IIT Ropar
@Medino'sXAdvitiya'25
Secured Runner Up(2nd) position at Medino'sXAdvitiya'25 2025, Hosted at IIT Ropar. The event was a online hackathon that challenged my AI/ML skills and I was able to create a symptom analyzer chatbot and a OCR based prescription reader.
Featured Projects

VISION AI
An AI-powered solution for efficient video and image analysis, leveraging advanced machine learning models to enhance visual data processing and superscaling solutions.
MOSAIC
MOSAIC (Multimodal Orchestration for Synthesis, Analysis & Intelligent Comprehension) - AI-powered video analysis platform featuring FastAPI backend, React frontend, and MCP server. Process videos, extract clips, ask questions about content, and perform multimodal search using vision models and transcription, Even creates clips from a video!
Autogen Agents
Multi-agent system demonstrating AutoGen framework with Gemini 2.5 Pro, Docker for code execution, and Streamlit interfaces for practical AI applications.

RAG Doc-Chat
An AI-powered tool that allows users to interact with PDF files through chat-based queries, enabling easy and efficient information retrieval.

