Projects
A selection of projects demonstrating my experience in data analysis, machine learning, and data-driven problem solving. Each project reflects my ability to work with real-world datasets, build analytical pipelines, and generate actionable insights.
Spotify Data Analytics
Built a comprehensive data analysis pipeline covering data collection, cleaning, database design, statistical analysis, and machine learning. The project explored audio features and user behaviour patterns across multiple countries and time periods, enabling insight generation and visualization.Problem
Understand patterns in music data and identify factors influencing song popularity and user listening behaviour.
Tools
Python, SQL, Pandas, Machine Learning (Regression, Clustering, Classification), Power BI.
Outcome
Developed an end-to-end analytics pipeline and interactive dashboards to generate insights and support data-driven recommendations.
Chronic Disease Prediction & Progression Analysis
Analyzed anonymized patient health data to study chronic disease risk and progression using both exploratory analysis and predictive modeling. The project examined how variables such as smoking, diet quality, gender, biomarkers, and lifestyle factors relate to disease outcomes, while also building interpretable classification models for chronic disease prediction.Problem
Analyze how demographic and lifestyle factors influence chronic disease risk and identify meaningful patterns across patient populations.
Tools
R, tidyverse, ggplot2, caret, recipes, logistic regression, multinomial logistic regression.
Outcome
Combined exploratory healthcare analytics with predictive modeling to identify disease-related patterns and support data-driven interpretation.
Amazon Laptop Data Analysis
Collected real-world product data through web scraping, performed data cleaning and feature engineering, and developed machine learning models to predict laptop prices and analyze feature importance across different product categories.Problem
Analyze laptop pricing and specifications to understand market trends and key pricing factors.
Tools
Python, Selenium, BeautifulSoup, Pandas, Linear Regression, Random Forest.
Outcome
Built predictive models to estimate laptop prices and evaluate how technical specifications affect pricing.