Profile

Hi, I'm Soham
AI Developer

GitHubLinkedInPyPI

About Me

Meet Soham

I'm Soham, an AI & Machine Learning Engineer passionate about building intelligent, high-impact systems. I specialize in developing high-accuracy ML models, scalable AI applications, and multimodal solutions that combine computer vision, NLP, and generative AI.

Artificial IntelligenceMachine LearningDeep LearningData ScienceAgentic AIGenerative AIWeb Development
AI & SDE InternSR CounsellingDec 2024 - Oct 2025
GenAI DeveloperBCGX (job Simulation)May 2025
AI Developer InternUnLawcSept 2025 - present

Technical Skills

Programming Languages

Python
Python
JavaScript
JavaScript
Java
Java
SQL
SQL

AI/ML

PyTorch
PyTorch
TensorFlow
TensorFlow
LangChain
LangChain
LangGraph
LangGraph
Transformers
Transformers
Neural Networks
Neural Networks
Computer Vision
Computer Vision

Achievements

2nd Position IIT Ropar

@Medino'sXAdvitiya'25

Secured Runner Up(2nd) position at Medino'sXAdvitiya'25 2025, Hosted at IIT Ropar. The event was a online hackathon that challenged my AI/ML skills and I was able to create a symptom analyzer chatbot and a OCR based prescription reader.

Featured Projects

RegressNCompare

RegressNCompare

Python-based machine learning tool for regression analysis and model comparison, helping evaluate different regression algorithms.

VISION AI

VISION AI

An AI-powered solution for efficient video and image analysis, leveraging advanced machine learning models to enhance visual data processing and superscaling solutions.

Cold email generator

Cold email generator

A generative AI project utilizing Langchain and Llama 3.1 to develop advanced language models that generate context-aware responses and automate various tasks efficiently.

M

MOSAIC

MOSAIC (Multimodal Orchestration for Synthesis, Analysis & Intelligent Comprehension) - AI-powered video analysis platform featuring FastAPI backend, React frontend, and MCP server. Process videos, extract clips, ask questions about content, and perform multimodal search using vision models and transcription, Even creates clips from a video!

A

Autogen Agents

Multi-agent system demonstrating AutoGen framework with Gemini 2.5 Pro, Docker for code execution, and Streamlit interfaces for practical AI applications.

RAG Doc-Chat

RAG Doc-Chat

An AI-powered tool that allows users to interact with PDF files through chat-based queries, enabling easy and efficient information retrieval.