Vivek Varikuti

Hi, I'm Vivek 👋

AI Engineer and Researcher. I love building intelligent systems and solving complex problems with cutting-edge technology.

About

Graduated with a B.Tech in AI and ML from Usha Rama College of Engineering (May 2025). I specialize in deep learning, computer vision, and large language models. I've published research on pose-guided image generation and assistive technology for visually impaired individuals. I enjoy participating in hackathons and won 1st prize in a nationwide AI Agent Hackathon conducted by the Andhra Pradesh Police.

Work Experience

ai4APPolice

Aug 2025 - Present

AI Engineer

Building cutting-edge AI tools for public safety and law enforcement to enhance community protection and improve police operations. Developing intelligent systems like Dial 112 AI for emergency response optimization, implementing advanced machine learning algorithms for crime prediction and resource allocation. Creating AI-powered solutions that bridge the gap between technology and public service, making communities safer through innovative applications of artificial intelligence and data science.

GGS Information Services

Dec 2024 - Feb 2025

Machine Learning Intern

Engineered production-ready 3D compression algorithms achieving 60% model size reduction while maintaining 95% geometric accuracy, deployed across 15+ enterprise clients with 40% rendering speed improvement. Built end-to-end AI pipeline for 2D-to-3D conversion using custom GANs, processing 500+ STEP files daily with 85% accuracy and reducing manual CAD modeling time by 70% for engineering teams. Implemented MLOps infrastructure with real-time monitoring dashboards tracking model drift, performance degradation, and inference latency, achieving 99.5% uptime in production.

Education

Usha Rama College of Engineering

Dec 2021 - May 2025

Bachelor of Technology in AI and ML

Skills

LangChain

LangGraph

RAG Pipelines

Agent Orchestration

Finetuning

VLLM

Python

FastAPI

Microservices

REST APIs

MongoDB

PostgreSQL

Redis

Vector Databases

Pinecone

Azure

CI/CD

Weights & Biases

Docker

Kubernetes

PyTorch

TensorFlow

scikit-learn

XGBoost

Hugging Face

Diffusers

My Projects

Check out my latest work

I've worked on a variety of projects, from simple websites to complex web applications. Here are a few of my favorites.

wingman-AI

2025

Developed a stealth desktop app for AI-powered coding/interview assistance with advanced process hiding. Integrated Google Gemini and OCR for real-time speech/screen analysis and instant responses. Designed modular, cross-platform (Win/macOS/Linux) architecture and intelligent caching.

TypeScript

Electron

Next.js

Gemini AI

OCR

Source

GSPO-DeepSeek-R1-Distill-Qwen-1.5B

2025

Implemented Group Sequence Policy Optimization (GSPO) algorithm from Qwen Team research, achieving superior stability with 50-75% clipping rates vs 0.01-0.02% for baseline PPO/GRPO methods on reasoning tasks. Engineered complete knowledge distillation pipeline from DeepSeek-R1 to Qwen-1.5B architecture, incorporating 8-bit optimization and gradient checkpointing for memory-efficient training on H100/RTX hardware.

PyTorch

Transformers

Wandb

Flash Attention

Source

Dial 112 AI: Emergency Call Intelligence

2025

Developed production AI system for Andhra Pradesh Police processing 1000+ emergency calls daily, implementing speech-to-text, sentiment analysis, and priority classification. Built real-time geospatial analysis and emergency dispatch optimization, reducing average response time by 25% through intelligent resource allocation.

Speech Recognition

NLP

Real-time Processing

Python

Source

CADify: AI-Powered 3D CAD Generation

2024

Developed multimodal AI system transforming 2D engineering diagrams into 3D CAD models with 92% geometric accuracy using computer vision and LLM integration. Built robust feature extraction pipeline handling technical drawings, sketches, and annotations with advanced OCR and geometric reasoning capabilities.

PyTorch3D

OpenAI

OpenCV

Computer Vision

Source

DeepRE: Deep Reinforcement Learning for Self-Verification

2025

Reproduced DeepSeek R1 Zero achieving 90% functional parity, implementing advanced RL techniques for LLM self-verification on complex reasoning tasks. Engineered distributed training pipeline using VLLM backend and Flash Attention 2, enabling cost-effective training of 3B parameter models on consumer hardware with 40% cost reduction.

PyTorch

VLLM

Ray

Flash Attention

Source

PPAG: Pose-Guided Image Generation

2025

Engineered production-ready pose transfer system integrating 5 ControlNet models, achieving 90% pose accuracy on COCO-Pose benchmark with real-time inference. Implemented advanced prompt engineering pipeline with dynamic negative prompting and attention guidance, improving generation quality by 40% and reducing artifacts by 65%.

PyTorch

ControlNet

Gradio

Diffusers

Owl CLI: OS-Level AI Agent

2025

Built intelligent CLI agent leveraging Google Gemini LLM for natural language to system command translation, enabling conversational OS interaction with 95% command accuracy. Engineered autonomous security auditing system with real-time monitoring, policy violation detection, and automated threat response capabilities.

LLMs

System Integration

Windows APIs

Python

Source

Hackathons

I like building things

During my time in university, I attended 1+ hackathons. People from around the country would come together and build incredible things in 2-3 days. It was eye-opening to see the endless possibilities brought to life by a group of motivated and passionate individuals.

2025

AI Agent Hackathon by Andhra Pradesh Police

Andhra Pradesh, India

Won 1st prize in nationwide AI Agent Hackathon conducted by Andhra Pradesh Police for developing emergency call intelligence system.

Contact

Get in Touch

Want to chat? Just shoot me a dm with a direct question on twitter and I'll respond whenever I can. I will ignore all soliciting.