Hello, I'm Riyanshi Bohra

I'm a

About Me

"Without data, you're just another person with an opinion."

— W. Edwards Deming

Riyanshi Bohra

Hi! I'm Riyanshi, currently pursuing my Master's in Data Science and graduating in May 2025. I'm currently specializing in predictive analytics and machine learning, with expertise in LangChain, TensorFlow, PyTorch, and AWS Cloud. My project experience spans healthcare to fintech optimization, focusing on building scalable ML solutions using advanced analytics.

Through my projects, I've built ML pipelines, LLM-powered Applications, and AI-driven solutions for real-world challenges. From designing neural networks for computer vision to leveraging retrieval-augmented generation (RAG) for intelligent AI systems, I thrive on turning complex data into impactful models. My experience with LLMs and transformers has helped me push the boundaries of NLP, automation, and AI-driven insights.

3+ Years of Academic Research
15+ Projects Completed
8+ Certifications

What I Do

Predictive Analytics
Generative AI
Deep Learning

Current Focus

  • Generative AI Research: Experimenting with LLMs and exploring their applications in data analysis and automation.
  • Large-Scale ML Systems: Architecting end-to-end machine learning pipelines for real-time data processing and automated model deployment at scale.
  • Innovating in Public Health: Exploring data-driven techniques to uncover insights that can save lives.

My Journey Timeline

May 2025

University of Arizona Expected

Master of Science in Data Science

Expected graduation: May 2025

Specializing in Deep Learning & Generative AI

  • Current GPA: 3.8/4.0
  • Graduate Research Intern @ HUM Lab
  • Selected from a 100+ cohort to attend GHC 2024 and represent the department.
  • Presented research on AI-driven distracted driving detection at iShowcase 2024.
Jan 2024

HUM Lab, University of Arizona Current

Research Professional II

  • Presented research on urban heat equity and shade access at the Southwest Urban Integrated Field Laboratory (SW-IFL) conference
  • Leading geospatial data analysis for public health research.
  • Publications: Co-authoring 2 impactful research papers, currently under review, addressing public health and equity.
  • Research contributed to a 20% increase in funding for urban shade structures in low-income neighborhoods.
Aug 2023

University of Arizona

Started my Master's in Data Science Program

  • Engaged in coursework on AI, Advanced Machine Learning, and Data Mining.
  • Selected to present at the first MS DS Lightning Talks in my first semester.
  • Collaborated with peers on projects exploring AI and data engineering solutions.
May 2023

Manipal University Jaipur

Bachelor of Technology (B.Tech) in Information Technology

Minor in Data Science

  • Graduated with distinction (Top 3%)
  • Published research on sentiment analysis for early mental illness detection, presented at ICCIS 2022.
  • Recognized for academic excellence and selected for an exchange program at the University of Florida for Spring 2023.
Jan 2023

University of Florida

Senior Certificate Program in Computer Science & Engineering.

  • Achieved a perfect 4.0 GPA.
  • Gained global academic exposure, collaborating with esteemed professors and diverse peers.
  • Explored advanced concepts in UX design, RDBMS, and software engineering.
Jul 2022

PricewaterhouseCoopers (PwC)

Data Science Intern

  • Developed predictive models for pharmaceutical manufacturing, reducing variability by 32% with $75K annual savings.
  • Built Python/SQL data pipeline reducing processing time & boosting accuracy by 40%.
  • Created Power BI dashboard for real-time monitoring, driving data-based decision making.
  • Secured company-wide model deployment through executive presentation.
Jul 2019

Manipal University Jaipur

Started my B.Tech in Information Technology

  • Laid the foundation in programming, data structures, and software engineering.
  • Joined the Computing Society, initiating a passion for coding and innovation.
  • Set my sights on becoming a data scientist, taking the first steps toward my dream with a Minor in Data Science.

Featured Projects

  • All
  • Artificial Intelligence
  • Data Analysis
  • Machine Learning
DataLens

DataLens: Intelligent Chart Generator

A chart generator that converts natural language queries into dynamic visualizations using LangGraph, OpenAI's GPT models, and web scraping with Tavily and BeautifulSoup

Github

Video Mind AI

VideoMind AI: Videos to Insights

A RAG-based video analysis system built with Python, integrating OpenAI's APIs, Whisper transcription, and Pinecone vector storage for intelligent video content exploration.

Github

DocTalk

DocTalk: Your AI Document Assistant

A voice-enabled research assistant that integrates PDF parsing, LangChain RAG with Pinecone, Whisper, and ElevenLabs API for interactive academic insights.

Github

Biometric Predictors of Emotional States

Deep Learning project to classify emotional states using biometric data and advanced ML models.

Github

Portfolio+ Simulation

Portfolio+: Virtual Trading Platform

A comprehensive stock portfolio management system with real-time data tracking, user authentication, and performance analytics using Flask and Yahoo Finance API.

Github

SafeDrive-AI: Distracted Driving Detection

AI-powered system for real-time detection of distracted driving behavior.

Github

Metropolitan Climate Profiling

Analysis of Urban Heat Islands using advanced data science techniques.

Github

Mental Health Analysis on Social Media

Sentiment Analysis and visualization of mental health indicators in social media data.

Publication

Fetal Health Classification

Fetal Health Classification

ML model to classify fetal health states using cardiotocogram data, aiding in prevention of child and maternal mortality.

Github

My Resume

Download My Resume

Education

University of Arizona

University of Arizona

Master of Science (M.S.)

Data Science

August 2023 - May 2025

GPA: 3.8 / 4.0

Tucson, Arizona

Deep Learning & Neural Networks
Applied NLP
Data Warehousing in the Cloud
Artificial Intelligence (AI)
University of Florida

University of Florida

Senior Certificate Program

Computer Science

January 2023 - May 2023

GPA: 4.0 / 4.0

Gainesville, Florida

Advanced Data Structures
Algorithm Design & Analysis
Software Engineering
Database Management Systems
Manipal University Jaipur

Manipal University Jaipur

Bachelor of Technology (B.Tech)

Information Technology

July 2019 - May 2023

GPA: 3.8 / 4.0

Jaipur, India

Data Mining & Warehousing
Big Data Analytics
Object-Oriented Programming
Natural Language Processing

Experience

Jan 2024 - May 2025

Research Intern

HUM Lab Tucson, Arizona
University of Arizona, Mel and Enid Zuckerman College of Public Health

Key Achievements

  • Developed a Random Forest-based shade classification model achieving 87% accuracy using 500+ image training samples.
  • Engineered NDVI, NISI, and GLCM-based features, boosting classification accuracy by 20% across 50+ metrics.
  • Integrated demographic and socioeconomic census data for 1,000+ schools, applying geospatial analysis to identify disparities.
  • Correlated 30+ geospatial and demographic factors to model shade access disparities with actionable insights.

Tools & Technologies

R Google Earth Engine (GEE) Python ArcGIS JavaScript SQL Excel API Integration

Key Skills

Geospatial Modeling Spatial Analysis Data Integration Image Classification Machine Learning
Jul 2022 - Sep 2022

Data Science Intern

PricewaterhouseCoopers (PwC) Mumbai, India

Key Achievements

  • Developed predictive models using regression techniques, reducing batch variability by 32% and stabilizing production across 12 units.
  • Optimized yield with machine learning, increasing production efficiency by 20% and saving $75,000 annually.
  • Built dynamic dashboards for real-time KPI tracking, reducing decision latency by 35% and boosting operational insights.
  • Improved defect detection accuracy by 25% through advanced feature engineering, minimizing product defects to < 3% per batch.

Tools & Technologies

Python PowerBI Apache Hadoop ETL Docker Jupyter Notebook SQL Git/GitHub

Key Skills

Predictive Modeling Data Pipeline Design Data Visualization Process Optimization Workflow Automation

Professional Certifications

LangChain & Vector Databases in Production

Activeloop Issued Feb 2025
LangChain Generative AI Vector Databases LLMs OpenAI APIs
View Certificate

Multi AI Agent Systems with crewAI

CrewAI Issued Feb 2025
Agentic AI Generative AI LangChain Large Language Models Multi-Agent Systems
View Certificate

Machine Learning with Python

IBM Issued Sep 2021
Machine Learning Algorithms Scikit-learn Supervised Learning Model Evaluation Python Libraries
View Certificate

Python for Data Science, AI & Development

IBM Issued Jul 2021
Data Analysis NumPy Pandas APIs Data Visualization
View Certificate

AI For Everyone

DeepLearning.AI Issued Jun 2021
AI Strategy Deep Learning Neural Networks AI Ethics AI Applications
View Certificate

Crash Course on Python

Google Issued Jun 2021
Python Fundamentals Object-Oriented Programming Data Structures Automation
View Certificate

Programming for Everybody (Getting Started with Python)

University of Michigan Issued Jun 2021
Python Programming Basics
View Certificate

Python (Basic)

HackerRank Issued Jun 2021
Problem Solving Algorithms Data Structures Code Optimization
View Certificate

Building AI Powered Chatbots

IBM Issued Jun 2021
Natural Language Processing Watson Assistant Conversational AI Dialog Design
View Certificate

Technical Expertise

🐍

Python

Expert
NumPy Pandas Scikit-learn Matplotlib Seaborn Plotly
💾

SQL

Advanced
MySQL PostgreSQL SQLite SQLAlchemy Snowflake
📈

R

Intermediate
tidyverse ggplot2 tidycensus sf tidymodels plotly shiny
💻

JavaScript

Intermediate
Node.js Next.js React.js RESTful APIs
🧠

Deep Learning

Expert
TensorFlow PyTorch Keras Hugging Face Transformers CNNs RNNs Transformer Models Attention Mechanisms
🤖

Large Language Models (LLMs)

Advanced
LangChain LangGraph Retrieval-Augmented Generation (RAG) Pinecone Deep Lake ChromaDB
⚙️

Model Deployment & DevOps

Advanced
FastAPI Flask Streamlit MLflow Docker Kubernetes Git Jenkins CI/CD
💾

Data Warehousing

Intermediate
Snowflake Amazon Redshift Google BigQuery Data Modeling ETL Pipelines Data Integration
🔄

Data Pipelines

Intermediate
Apache Airflow Apache Beam Apache Kafka Apache Nifi Data Streaming Data Orchestration
☁️

Cloud Computing

Advanced
AWS Azure Google Cloud Serverless Architecture Cloud Security Infrastructure as Code
📊

Business Intelligence & Dashboarding

Expert
Microsoft Power BI Tableau Looker Interactive Dashboards Data Storytelling KPI Monitoring
📑

Spreadsheet & Data Processing

Advanced
Microsoft Excel Pivot Tables VBA Macros Advanced Formulas SQL Analytics Window Functions CTEs Query Optimization
🔍

Statistical & Exploratory Analysis

Advanced
Hypothesis Testing ANOVA Chi-Square Anomaly Detection DBSCAN Time Series Analysis ARIMA

Latest Articles

Exploring data science, AI, and technology through in-depth articles

Featured Talks

MS DS Lightning Talks - 2023 & 2024

Presented at the MS DS Lightning Talks at the College of Information Science, University of Arizona for two consecutive years (2023-2024).

Get In Touch

Let's Connect!

If you're looking for a motivated, adaptable, and skilled team member, I'd love to hear from you.