Compilation of personal and online courses projects. This portfolio, as a whole, aims to demonstrate proven experience in Data Science principals including obtaining/cleaning data, building Extract, Transform, Load, (ETL) pipelines, Exploratory Data Analysis (EDA), and building and validating Machine Learning models.
In this project of the Power BI dashboard provides a comprehensive analysis of a business’s aggregated values over time, across regions, and by product type. Its interactive features allow stakeholders to explore trends in revenue, costs, and profit margins. This dashboard consolidates essential business metrics into one easy-to-use tool, offering multiple ways to view and compare financial data for smarter decision-making.
In this project of the Airline have been seen the high numbers of Dissatisfied Passengers. Satisfaction of Passengers are one of the concern of the airline. We are one of those airlines who focuses on both Business Travellers and Personal Travellers and our majority of the customers are Loyal Customers.
In this project, I will load and manipulate a music app dataset similar to Spotify with Spark to engineer relevant features for predicting churn. Where Churn is cancelling their service altogether. By identifying these customers before they churn, the business can offer discounts and incentives to stay thereby potentially saving the business revenue.
We will implement and test out different machine learning algorithms and the best methods have been used, here are:
For this project I was interested in predicting customer churn for a fictional music streaming company: Sparkify.
The project involved:
I was curious to look into the AirBnB dataset for Seattle. I needed to discover more about pricing patterns, customer feedback, and pricing forecasting. Some of the questions I’ve looked into are:
This project focuses on analyzing interactions between users and articles on the IBM Watson Studio platform. New article recommendations are made to users based on their interactions with articles. Based on the data available, we can use various methods to make these recommendations. The methods used here are Rank Based, Collaborative Filtering, and Matrix Factorization.
Data Analytics:
The findings show that the new and old pages have roughly equivalent chances of converting users, based on the statistical tests we conducted, the Z-test, logistic regression model, and actual difference identified. The null hypothesis is not rejected. I advise the e-commerce business to retain the old page. This would save you time and money by avoiding the need to establish a new website.
The dataset that I will be wrangling (and analyzing and visualizing) is the tweet archive of Twitter user @dog_rates, also known as WeRateDogs. WeRateDogs is a Twitter account that rates people’s dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10. The numerators, though? Almost always greater than 10. 11/10, 12/10, 13/10, etc. Why? Because “they’re good dogs Brent.” WeRateDogs has over 4 million followers and has received international media coverage. The Tweet Image Predictions, i.e., what breed of dog (or other object, animal, etc.) is present in each tweet according to a neural network. This file (image_predictions.tsv) is hosted on Udacity’s servers and should be downloaded programmatically using the Requests library and the following URL: https://d17h27t6h515a5.cloudfront.net/topher/2017/August/599fd2ad_image-predictions/image-predictions.tsv