Hi, I'm Vivek 👋
AI Engineer and Researcher. I love building intelligent systems and solving complex problems with cutting-edge technology.
VV

About

Graduated with a B.Tech in AI and ML from Usha Rama College of Engineering (May 2025). I specialize in deep learning, computer vision, and large language models. I've published research on pose-guided image generation and assistive technology for visually impaired individuals. I enjoy participating in hackathons and won 1st prize in a nationwide AI Agent Hackathon conducted by the Andhra Pradesh Police.

Skills

PyTorch
TensorFlow
Hugging Face Transformers
LangChain
Diffusers
PyTorch3D
YOLO
ControlNet
Python
C++
JavaScript
TypeScript
React.js
Next.js
FastAPI
Flask
Gradio
Docker
Kubernetes
MLflow
Weights & Biases
ONNX Runtime
TensorRT
Model Quantization
Model Parallelism
Data Parallelism
Azure
Google Cloud Platform (GCP)
AWS
MongoDB
MySQL
PostgreSQL
Vector Databases
FAISS
Milvus
Pinecone
CUDA
Flash Attention
Gradient Checkpointing
Mixed Precision Training
Kernel Fusion
Git
Prometheus
Grafana
CI/CD Pipelines
Retrieval-Augmented Generation (RAG)
Instruction Tuning
Fine-tuning Large Language Models
GANs
Diffusion Models
3D Reconstruction
Pose Estimation
My Projects

Check out my latest work

I've worked on a variety of projects, from simple websites to complex web applications. Here are a few of my favorites.

wingman-AI

Developed a stealth desktop app for AI-powered coding/interview assistance with advanced process hiding. Integrated Google Gemini and OCR for real-time speech/screen analysis and instant responses. Designed modular, cross-platform (Win/macOS/Linux) architecture and intelligent caching.

TypeScript
Electron
Next.js
Gemini AI
OCR

GSPO-DeepSeek-R1-Distill-Qwen-1.5B

Implemented Group Sequence Policy Optimization (GSPO) algorithm from Qwen Team research, achieving superior stability with 50-75% clipping rates vs 0.01-0.02% for baseline PPO/GRPO methods on reasoning tasks. Engineered complete knowledge distillation pipeline from DeepSeek-R1 to Qwen-1.5B architecture, incorporating 8-bit optimization and gradient checkpointing for memory-efficient training on H100/RTX hardware.

PyTorch
Transformers
Wandb
Flash Attention

Dial 112 AI: Emergency Call Intelligence

Developed production AI system for Andhra Pradesh Police processing 1000+ emergency calls daily, implementing speech-to-text, sentiment analysis, and priority classification. Built real-time geospatial analysis and emergency dispatch optimization, reducing average response time by 25% through intelligent resource allocation.

Speech Recognition
NLP
Real-time Processing
Python

CADify: AI-Powered 3D CAD Generation

Developed multimodal AI system transforming 2D engineering diagrams into 3D CAD models with 92% geometric accuracy using computer vision and LLM integration. Built robust feature extraction pipeline handling technical drawings, sketches, and annotations with advanced OCR and geometric reasoning capabilities.

PyTorch3D
OpenAI
OpenCV
Computer Vision

DeepRE: Deep Reinforcement Learning for Self-Verification

Reproduced DeepSeek R1 Zero achieving 90% functional parity, implementing advanced RL techniques for LLM self-verification on complex reasoning tasks. Engineered distributed training pipeline using VLLM backend and Flash Attention 2, enabling cost-effective training of 3B parameter models on consumer hardware with 40% cost reduction.

PyTorch
VLLM
Ray
Flash Attention

PPAG: Pose-Guided Image Generation

Engineered production-ready pose transfer system integrating 5 ControlNet models, achieving 90% pose accuracy on COCO-Pose benchmark with real-time inference. Implemented advanced prompt engineering pipeline with dynamic negative prompting and attention guidance, improving generation quality by 40% and reducing artifacts by 65%.

PyTorch
ControlNet
Gradio
Diffusers

Owl CLI: OS-Level AI Agent

Built intelligent CLI agent leveraging Google Gemini LLM for natural language to system command translation, enabling conversational OS interaction with 95% command accuracy. Engineered autonomous security auditing system with real-time monitoring, policy violation detection, and automated threat response capabilities.

LLMs
System Integration
Windows APIs
Python
Hackathons

I like building things

During my time in university, I attended 1+ hackathons. People from around the country would come together and build incredible things in 2-3 days. It was eye-opening to see the endless possibilities brought to life by a group of motivated and passionate individuals.

  • A

    AI Agent Hackathon by Andhra Pradesh Police

    Andhra Pradesh, India

    Won 1st prize in nationwide AI Agent Hackathon conducted by Andhra Pradesh Police for developing emergency call intelligence system.
Contact

Get in Touch

Want to chat? Just shoot me a dm with a direct question on twitter and I'll respond whenever I can. I will ignore all soliciting.