Driven AI Engineer passionate about building and deploying advanced Large Language Model (LLM) and Generative AI (GenAI) solutions that solve real business problems. Specialized in designing scalable, end-to-end machine learning systems, transforming ideas from prototype to production with measurable results.
LLM Ops: Prompt engineering, Retrieval-Augmented Generation (RAG), pipeline orchestration, custom LLM fine-tuning, deploying models at scale (GPT-4, Claude, Llama)
MLOps & CI/CD: MLflow, Docker, AWS SageMaker, Lambda, Kubernetes
GenAI apps: Chatbots, Document search(Docubot), E-commerce Gen AI, medical cost prediction
Tech stack: Python, PyTorch, TensorFlow, LangChain, RAG, Gen AI, ChromaDB, FAISS, OpenAI APIs, FastAPI, AWS (Bedrock, SageMaker)
I’m always eager to connect with professionals and teams on the leading edge of AI. Feel free to reach out for a casual coffee chat about any topic that piques your interest!
GPA: 3.57 / 4.0
GPA: 3.62 / 4.00
A Python and PyTorch-based application that fine-tunes LLaMA 3.2-3B with LoRA for predicting medical insurance costs. Achieved 0.21 loss on 1,338 records while reducing training time by 75%, and delivered a full production pipeline with custom tokenization and inference.
Python PyTorch Transformers Unsloth Fine-tuning (LoRA) Pandas
A Python and Streamlit-based AI tool for real-time web data extraction using RAG and Chroma DB. Achieved 95%+ accurate, source-backed answers, improved document retrieval efficiency by 80%, and delivered a responsive, cloud-deployed UI.
Python Streamlit LangChain RAG Chroma DB Sentence Transformers
Built a LLaMA 3.3–powered chatbot with RAG and Chroma DB/SQLite for real-time product responses and 24/7 support, increasing user engagement by 40%. Reduced hallucinations by 30% through multi-prompt testing and deployed semantic routing with intent classification for accurate FAQ and product query handling via a Streamlit Cloud–hosted interface.
Python, Semantic Router, Chroma DB, SQLite, RAGPython RAG Gen AI Semantic Router Chroma DB SQLite
Developed a Python-based AI HR system automating employee lifecycle tasks, cutting manual workload by 60%. Leveraged Pydantic validation, FastMCP, and Gmail SMTP for reliable, enterprise-grade operations.
Python MCP SMTP Integration Claude Desktop
Created an end-to-end MLOps pipeline for gemstone classification using Python, MLflow, DVC, and AWS S3. Automated data preprocessing and model training, reduced data retrieval time by 30%, and ensured reproducible, scalable deployments via Docker and GitHub Actions.
Python MlFlow Airflow DVC GitHub Actions Aws S3 Docker