Project Portfolio

A list of my top data science and data analysis projects.


Interactive Crime Map: Analyzing San Francisco Crime Trends

Tools: R, R Shiny App, APIs

Developed an interactive web application that uses the San Francisco Police Department's API to retrieve daily crime data, enabling users to visualize recent crimes on an interactive map and explore additional analytics to better understand when and where crimes are occurring in the city.

Link to Code


Supercenter Sales Analysis and Customer Segmentation

Tools: Python, MySQL, Power BI

Created a MySQL database and analyzed supercenter sales data, creating insights on customer spending habits. Employed clustering algorithms to cluster customers into distinct segments, providing insights for targeted marketing and business strategy.



Econometric Analysis: The Impact of Prescription Drug Monitoring Programs (PDMPs) on Opioid Overdose Rates

Tools: R

PDMPs are electronic databases that track controlled substance prescriptions, and are accessible by healthcare providers. I utilized panel data regression modeling to examine the association between PDMPs and opioid overdose rates.

Wine Quality Prediction Using Machine Learning Regression Analysis

Tools: Python, scikit-learn

Created machine learning regression models using decision tree and random forest regression to predict wine quality based on alcohol percentage, density, color, and pH. Evaluated model effectiveness and provided insights for improving prediction accuracy.


Creating the K-Means Clustering Algorithm from Scratch

Tools: R

I created the K-means clustering algorithm without using any built in clustering functions, to demonstrate how the algorithm works. I also create animated visualization of the clustering algorithm iteration by iteration.


Interactive App for Exploratory Data Analysis on Netflix Shows and Movies

Tools: R, R Shiny App

Created an interactive app that conducts an Exploratory Data Analysis on the movies on Netflix, allowing users to select inputs to analyze various characteristics, enabling a deeper understanding of the types of movies on the platform.

Link to Data Cleaning

Link to Code for App