SQL, ETL Pentaho, Data Meodeling & PowerBI
๐ฌ Super Store Data Warehouse
๐ Overview: Powering data-driven decisions with insights into sales, revenue, profit, and order quantities across regions.
๐ฏ Objective: Build a scalable, efficient data warehouse using a Star Schema to support business intelligence and reporting.
๐พ Data Flow:
Source: Structured CSV files
ETL Tool: Pentaho (Extract, Transform, Load)
Database: MySQL
๐๏ธ Schema Design:
๐ Fact Tables: Sales & Returns
๐ Dimension Tables: Date, Territory, Customer, Products
๐ Key Metrics Tracked:
๐ฐ Total Revenue | ๐ Profit | ๐๏ธ Order Quantities | ๐ฅ Customer Segments
Driving smarter decisions through data! ๐ก๐
PowerBI, DAX & Data model
This project involves advanced data analysis using Power BI to gain insights from a Superstore dataset. ๐ After thorough data cleaning and processing, a dashboard was created using DAX and various visualizations. ๐ Key insights include:
1๏ธโฃ Total Customers ๐ฅ
2๏ธโฃ Total Orders ๐ฆ
3๏ธโฃ Products Sold ๐๏ธ
4๏ธโฃ Total Returns โฉ๏ธ
5๏ธโฃ Top-Performing Products and Regions ๐
Python, Seaborn & Matplotlib
This project focuses on advanced data analysis of an E-Commerce dataset using Python. ๐ The data was cleaned by handling null values, correcting datatypes, and removing outliers. ๐งน Visualization charts were created to reveal key insights, including:
1๏ธโฃ Top 5 Highest-Priced Products ๐ฐ
2๏ธโฃ Best Month for Sales ๐
3๏ธโฃ Top 5 Countries by Revenue ๐
This project focuses on analyzing an Apple App Store dataset using SQLite. Despite initial challenges with data import and SQLite's handling of large datasets, I successfully split the data for analysis. ๐ The project involved performing Exploratory Data Analysis (EDA) ๐ง, including checking for unique apps, missing values, app distribution by genre, and analyzing user ratings.
Key Insights ๐: 1๏ธโฃ App Type and Ratings: Examined whether paid apps have higher ratings than free apps.
2๏ธโฃ Apps with Multilingual Support: Explored if apps supporting more languages tend to have higher user ratings.
3๏ธโฃ Highest-Priced Categories: Identified which app categories have the highest prices.
4๏ธโฃ Low-Rated Genres: Checked for genres with low user ratings.
5๏ธโฃ Description Length and User Ratings: Investigated if there's a correlation between app description length and user ratings. ๐๐
This project involves web scraping the Horn Africa Jobs website to gather information on job postings. ๐ The goal is to extract essential job details, such as:
Job Name ๐ผ
Salary ๐ฐ
Location ๐
The extracted data is then saved into a CSV file for further analysis and accessibility. ๐๏ธ This process helps in tracking job opportunities and analyzing trends in the job market, making it easier for job seekers to find relevant positions.
Python, PowerBI & BeautifulSoup
This project involved three key stages: data collection, transformation, and visualization.
1๏ธโฃ Data Collection: Using web scraping, I gathered valuable table data from a website, akin to a pig foraging for truffles. ๐ฝ
2๏ธโฃ Data Transformation: I refined the dataset by removing null values and changing column data types.
3๏ธโฃ Data Visualization: I created three clear visuals showing the top 10 countries by population, countries with the highest median age, and the top 5 countries by land area. ๐๐
Additionally, I automated the process using Python to send the results via email. This project demonstrates how data analysis can uncover valuable insights effectively." ๐
๐ Problem Statement:
The goal is to analyze the dataset to extract meaningful insights regarding employee retention.
๐ Data Analysis
We meticulously cleaned the data by addressing duplicates, null values, and errors. Additionally, we categorized working years into four groups: (1-10 years), (11-20 years), (21-30 years), and (31-40 years). Using pivot tables and pivot charts, we created an interactive dashboard.
๐ก Insights
Employee Count: The current number of employees exceeds those who left. ๐ฅ
Education: Departing employees were more likely to hold bachelor's degrees compared to master's degrees. ๐
Departments: The Research and Development department had the highest turnover, yet it also housed the most employees. ๐ข
Job Roles: The Sales Executive role saw the highest number of departures. ๐
These insights highlight key factors influencing employee retention and turnover. ๐๐๐