Project Portfolio
A list of my top data science and data analysis projects.
Interactive Crime Map: Analyzing San Francisco Crime Trends
Tools: R, R Shiny App, APIs
Developed an interactive web application that uses the San Francisco Police Department's API to retrieve daily crime data, enabling users to visualize recent crimes on an interactive map and explore additional analytics to better understand when and where crimes are occurring in the city.
Supercenter Sales Analysis and Customer Segmentation
Tools: Python, MySQL, Power BI
Created a MySQL database and analyzed supercenter sales data, creating insights on customer spending habits. Employed clustering algorithms to cluster customers into distinct segments, providing insights for targeted marketing and business strategy.
Econometric Analysis: The Impact of Prescription Drug Monitoring Programs (PDMPs) on Opioid Overdose Rates
Tools: R
PDMPs are electronic databases that track controlled substance prescriptions, and are accessible by healthcare providers. I utilized panel data regression modeling to examine the association between PDMPs and opioid overdose rates.
Wine Quality Prediction Using Machine Learning Regression Analysis
Tools: Python, scikit-learn
Created machine learning regression models using decision tree and random forest regression to predict wine quality based on alcohol percentage, density, color, and pH. Evaluated model effectiveness and provided insights for improving prediction accuracy.
Creating the K-Means Clustering Algorithm from Scratch
Tools: R
I created the K-means clustering algorithm without using any built in clustering functions, to demonstrate how the algorithm works. I also create animated visualization of the clustering algorithm iteration by iteration.
Interactive App for Exploratory Data Analysis on Netflix Shows and Movies
Tools: R, R Shiny App
Created an interactive app that conducts an Exploratory Data Analysis on the movies on Netflix, allowing users to select inputs to analyze various characteristics, enabling a deeper understanding of the types of movies on the platform.