About

Prajwal Ganugula

Software Engineer and ML Systems Engineer

Tempe, Arizona (Open to relocation) | pganugul@asu.edu

I build dependable software and ML systems end to end. My work spans backend services, data pipelines, and applied ML, including computer vision and LLM/RAG systems. I have shipped production systems in Python, C++, and Java, optimized models for edge devices, and published research at ICCV. I focus on clean system design, rigorous evaluation, and reliable delivery.

Experience

AI Engineer - Data Infrastructure and Platforms

Everest Global Insurance | Remote, US

Oct 2025 - Present
  • Owned a production ML ranking service in Python and C++, exposing gRPC/REST APIs used across internal workflows and increasing straight-through processing of low-risk claims by 18%.
  • Designed and operated batch and online ML pipelines using Airflow, Spark, and Redis, adding metrics, alerts, and SLOs to maintain latency and error budgets during peak traffic.

Machine Learning Engineer

ASU Decision Theater | Tempe, AZ

Dec 2024 - Sep 2025
  • Built and deployed recommendation and retrieval services over 100K+ documents, integrating PyTorch models behind APIs and improving click-through rate by 23% via online experiments.
  • Developed evaluation and experimentation pipelines with dashboards and A/B testing to iterate on ranking quality, feature changes, and user engagement signals.

Software Engineer - Deep Learning Research and Innovation

OPLUS (OPPO-OnePlus) Research and Development | Hyderabad, India

Jun 2022 - Aug 2024
  • Optimized on-device LLM inference pipelines for Edge SoCs using FlashAttention, QLoRA, and INT8 quantization, achieving sub-100ms latency under strict memory and power constraints
  • Delivered production mobile ML features by compressing Stable Diffusion models via distillation and pruning, reducing model size by 50% and accelerating inference by 8x.

Education

Arizona State University

M.S. in Computer Science

Aug 2024 - May 2026

Indian Institute of Technology, Hyderabad

B.Tech in Computer Science and Engineering

Jul 2018 - May 2022

Projects

Real-Time Interactive Video Segmentation (EdgeSAM)

Jan 2026

  • Engineered a zero-shot segmentation pipeline with EdgeSAM + ONNX Runtime, achieving <40ms CPU latency by decoupling the image encoder.
  • Deployed an interactive demo with Streamlit/OpenCV supporting text prompts and click-based tracking for tool isolation and privacy filtering.

AI Eraser for Mobile Photos

OPLUS | 2023

  • On-device object removal with Edge-SAM selection and diffusion inpainting.

Multi-Agent RAG Copilot for Cloud Operations

Personal Project | Aug 2024 - Nov 2024

  • Agentic RAG copilot integrating runbooks, tickets, and live logs.
  • Built with LangChain/LangGraph, Python, FastAPI, and Docker.

Structured Image Captioning with Distributed Training

Mar 2025 - May 2025

  • BLIP-2 + ViT-G pipeline fine-tuned with QLoRA and scaled with DeepSpeed.
  • Multi-GPU training with NCCL and memory-efficient attention.

Feature Store & ML Experimentation Platform

Aug 2024 - Nov 2024

  • Built a shared feature store on PostgreSQL and Redis with versioned schemas and validation, enabling consistent online and offline features across ML services.
  • Developed an experimentation service exposing Python APIs and reusable Airflow jobs, allowing teams to launch ML A/B tests with minimal integration overhead.

Personalized Feed Ranking System

Mar 2025 - May 2025

  • Implemented a two-tower ranking model with user and item embeddings and sampled softmax, improving NDCG@10 by 12% over a logistic-regression baseline.
  • Served models via Python/gRPC services with Redis feature caching and offline training jobs, integrating with an A/B framework to measure engagement impact

Research and Publications

Application for Produced Crop Price Forecasting Through Deep Learning

IRJMETS | Nov 2023

Skills

Languages

Python, C++, Java, SQL, Bash, JavaScript

ML and CV

PyTorch, TensorFlow, Diffusers, TensorRT, ONNX, Segmentation

LLMs and RAG

QLoRA, RAG, LangChain, LangGraph, Evaluation

Platforms

FastAPI, REST, gRPC, Docker, Kubernetes, GitHub Actions

Data and ETL

Airflow, Kafka, Spark, PostgreSQL, Redshift

Cloud

AWS, Azure, Observability, CI/CD

Certificates

Awards

Global SES Code Jam - 3rd Place

OPLUS (OPPO-OnePlus)

Contact