M.S. Applied Data Science @ USC. Building production-ready AI systems for autonomous mobility and LLM-driven analytics.
๐ About Me
I'm Aryan Maheshwari, an AI Engineer currently pursuing an M.S. in Applied Data Science at the University of Southern California, after completing a B.Tech in Artificial Intelligence and Data Science at K.J. Somaiya Institute of Technology.
My work sits at the intersection of machine learning engineering, data workflows, and applied research. At USC Marshall School of Business, I build LLM-based NLP pipelines over nuclear energy and SMR datasets; at USC AutoDrive Lab, I work on perception, planning, and deep reinforcement learning for autonomous driving. I have also delivered production-focused AI systems at Convexia (YC'25) and Easley Dunn Productions.Inc.
What drives me is converting complex ideas into measurable impact: reducing research-analysis effort with agent-style data pipelines, accelerating model training on CUDA clusters, and building reproducible ML workflows for high-stakes domains.
My technical stack spans Python-first ML engineering with TensorFlow and PyTorch, MLOps and experiment tracking, cloud-native deployment, and modern LLM tooling including LangChain and retrieval pipelines.
I enjoy collaborating across technical and non-technical teams, and I continue to build depth through research publications and certifications in deep learning, GANs, machine learning, and data analytics.
Let's build the future of AI together.
๐ Education
๐๏ธ University of Southern California
2025 โ 2026 Los Angeles, California
Master of Science in Applied Data Science
Focused on advanced machine learning, statistical modeling, and scalable AI systems for real-world applications.
๐ซ K.J. Somaiya Institute of Technology
2020 โ 2024 Mumbai, India
Bachelor of Technology in Artificial Intelligence and Data Science
Comprehensive undergraduate program covering AI fundamentals, machine learning algorithms, data structures, and blockchain technology. Specialized coursework in deep learning, computer vision, and natural language processing.
๐ฏ Pace Junior Science College
2018 โ 2020 Mumbai, India
Higher Secondary Education
Completed higher secondary education with focus on science and mathematics, building the academic base for engineering studies.
๐ฑ VIBGYOR Group of Schools
2004 โ 2018 Mumbai, India
10th Grade, ICSE
Completed secondary education with a strong academic foundation in mathematics, sciences, and languages.
AI-powered educational platform with RAG-based Q&A, quiz/flashcard generation, and real-time chat. Integrates GPT-4, LangChain, Chroma, FastAPI, and Next.js, enabling 10,000+ pages of textbook ingestion with sub-2s query latency and scalable deployment on AWS Fargate.
Developed a scalable multi-level text retrieval framework integrating Gaussian Mixture Models, GPT-4-turbo, BM25, and semantic search (SPIDER), enabling granular content extraction from 300+ page academic textbooks while improving re-ranking precision by 30% with low-latency query execution.
End-to-end histopathology image classification pipeline using MobileNetV2, achieving 92% accuracy on IDC detection. Integrated Grad-CAM for interpretability, deployed DCGAN for synthetic data augmentation, and built interactive Streamlit app for real-time predictions with modular, production-ready architecture.
๐ผ Experience
๐ฌ AI Engineer โ USC Marshall School of Business
Dec 2025 โ Present Los Angeles, CA
Developed LLM-based NLP pipelines to extract, normalize, and semantically index insights from 500+ nuclear energy and SMR-related documents for structured analysis.
Instrumented agent-style data workflows across 10+ public sources with dataset versioning, quality checks, and prompt-level evaluation, reducing manual analysis time by 60%.
Created end-to-end toxicity evaluation pipeline integrating 6 ML models with MLflow tracking, SHAP-based feature visualization, and confidence/disagreement detection across organ-toxicity modules, ensuring 100% reproducibility.
Streamlined infrastructure by modularizing training and inference workflows, implementing structured logging, and reducing CI/CD runtime by 30% through optimized directory design and automated quick-start setup.
MLOpsMLflowSHAPCI/CDPython
๐ฎ Machine Learning Engineer โ Easley Dunn Productions.Inc
Jun 2025 โ Sep 2025 Los Angeles, CA
Built hybrid coach selection engine integrating rule-based constraints with AI-driven scoring models, dynamically evaluating 15+ attributes to improve team matching accuracy by 25% and reduce lineup imbalance by 40%.
Led cross-functional team of 8 interns to design AI-powered sports simulation platform, overseeing model architecture, data pipelines, and deployment to deliver a production-ready system.
Engineered perception, motion prediction, and planning models for autonomous driving; deployed transformer-based generative AI on NVIDIA CUDA clusters, achieving 4 times faster training throughput.
Programmed Deep RL algorithms (PPO, SAC) for vehicle navigation, attaining 0.4 m mean positional deviation across 500+ closed-loop test runs.
Deep RLPPOSACCUDAAutonomous Driving
๐ค AI Engineer โ AGIE AI
Jan 2024 โ Nov 2024 Mumbai, India
Developed and deployed Dialogflow-based chatbot with Vertex AI integration, reducing response latency by 23% (1.3s โ 1.0s) and improving engagement across 2 pilot client campaigns.
Directed a team of 3 interns to deliver Proof of Concept leveraging GPT-3.5 for semantic similarity scoring and NLP-based retrieval, achieving >80% relevance precision for mapping AI research insights to funded startups.
Optimized ML data pipelines for airborne communication systems, reducing latency by 20% (250ms โ 200ms) and improving experiment efficiency by 25% through API integrations with PostgreSQL.
Tuned feature store queries and indexing strategies, cutting execution time by 35% under high concurrency, ensuring scalability for training and batch inference.
Researched AI-driven interview systems, synthesizing insights from 20+ academic papers and open-source projects; engineered preprocessing pipelines boosting sentiment model accuracy by 15%.
Devised and deployed sentiment analysis models with BERT and VADER to classify polarity, leveraging TensorFlow/PyTorch for NLP workflows and advancing team-wide ML capability by 30%.
This publication explores the implementation impact of AI across developing industries, highlighting practical adoption patterns, opportunities, and challenges in real-world systems.
This work examines the Blockchain-Based Election Conducting and Management System (BCECMS) as a framework to improve election integrity, transparency, and process trust through auditable digital workflows.
My research interests span machine learning systems, autonomous mobility, and applied NLP. I am particularly interested in the intersection of reinforcement learning, LLM-driven analysis, and robust real-world deployment.
๐ Recommendations
๐ฌ Contact
Get in Touch
Have a project in mind or want to collaborate? I'd love to hear from you!
Connect With Me
Prefer direct contact? Reach out through any of these channels: