__STYLES__
The objective of this Power BI project was to inspect its insights to explore if education level influences the success of data professionals, in a scenario that would capture the attention of business stakeholders.
Source of data set was found from this hyperlink:
Numerous business insightful scenarios can be conceptualized based upon the interactivity of the data professional survey dashboard. However, the business stakeholders were mainly interested in gaining insights into whether education level basis for a data professional to be successful is not a crucial factor.
To answer this primary question for the stakeholders, a range of values would be created for a particular survey question that centered on education levels. The range of values would start at the high school education level and progress to that of a PhD education level. Additionally, the top five earning occupations were brought into consideration which were dependent on the education level being observed. Survey participants also provided the country that they reside in.
Based on data professional survey, the following results:
In the United States, among 36 survey participants holding high school degrees, the average yearly salary of data architects was $85,000.
In the United States, among 16 survey participants holding associate’s degrees, the average yearly salary of data scientists was $115,000.
In the United States, among 329 survey participants holding bachelor‘s degrees, the average yearly salary of data scientists was $93,000.
In the United States, among 192 survey participants holding masters degrees, the average yearly salary of data scientists was $104,000.
In the United States, among 5 survey participants holding PhD‘s, the average yearly salary of data scientists was $206,000.
Special mention:
Survey participants chose not to select their education level; however, data results will be shown.
In the United States, among 52 survey participants holding undetermined education level, the average yearly salary of data engineers was $98,000.
Based on the insights derived from the data above, business stakeholders can be informed that the top five occupations on a education level basis for a data professional to be successful is not a crucial factor due to earning higher than average yearly salaries in the United States.
Other possible questions that stakeholders may think of based on the information given in the dashboard:
Special mention:
The yearly salary data found in the data set was configured to United States dollars. So, I used the following website to convert $93,000 United States dollars to 7,644,693.00 rupees
https://usd.currencyrate.today/convert/amount-93000-to-inr.html
Imported the data set into Microsoft Excel to preview the data columns. Afterwards, Microsoft Power BI was utilized for data analysis and data visualization purposes.
Raw data set contained 28 columns and 630 rows.
The following column names were found in the data set:
Upon loading the Microsoft Excel workbook into Microsoft Power BI, I selected the Power BI option to transform data. By doing this, I was directed to the Power Query Editor.
Special mention:
"Power BI Desktop also comes with Power Query Editor. Use Power Query Editor to connect to one or many data sources, shape and transform the data to meet your needs, then load that model into Power BI Desktop."
Source of this special mention is found in the hyperlink below:
Query overview in Power BI Desktop - Power BI | Microsoft Learn
The very first step in the data cleaning process involved selecting the entire data set and then used the format command followed by the trim and clean features.
By selecting the trim and clean features, I have ensured that no invisible character values are not found anywhere within the data set.
I deleted the following empty columns in Power Query Editor:
Six steps were used in the standardizing data process with column: Q1 - Which Title Best Fits your Current Role?
1.) Duplicate column:
2.) Split column by delimiter value:
3.) Split new column by delimiter value:
4.) Split new column by delimiter value:
5.) Renamed data values for Student/Looking/None:
6.) Viewed all data values within renamed column:
Three steps were used in the standardizing data process with the split column: Q1 - Which Title Best Fits your Current Role? - Copy
1.) Renamed new column:
2.) Standardize data values by renaming to proper job titles:
3.) Viewed all data values within renamed column:
The next column to be standardized was the Q3 - Current Yearly Salary (in USD).
Upon viewing this column, I discovered the data values were in a range rather than whole numbers. As a result, I decided to use DAX functionality within Power Query Editor to obtain the average values.
Special mention:
"Data Analysis Expressions (DAX) is a library of functions and operators that can be combined to build formulas and expressions in Power BI, Analysis Services, and Power Pivot in Excel data model.."
Source of this special mention is found in the hyperlink below:
Six steps were used in the standardizing data process with column: Q3 - Current Yearly Salary (in USD).
1.) Duplicate column:
2.) Split column by delimiter value:
3.) Replaced values within column Q3 - Current Yearly Salary (in USD):
4.) Created a new custom column:
Before new custom column was created, #Q3 - Current Yearly Salary (in USD) - Copy.1 and #Q3 - Current Yearly Salary (in USD) - Copy.2 columns were changed to a different data type. Selected whole as the new data type.
5.) Deleted columns
6.) Viewed all data values within renamed column:
Two steps were used in the standardizing data process with column: Q11 - Which Country do you live in?
I filtered for the countries that users specified. 21 rows were found. Examples of countries that survey participants listed were:
However, I also filtered for countries that were preselected from the survey. 26 rows were found. The preselected countries were:
Overall, I decided to only use the preselected countries due to more rows of data for analysis.
1.) Split column by delimiter value:
2.) Viewed all data values within column:
Overview of designing Power BI dashboard elements:
Dashboard header:
Card 1:
Card 2:
Chart 1:
Chart 2:
Chart 3:
Chart 4:
Chart 5:
When thinking of the color scheme for this Power BI Dashboard, I decided to select the accessible theme called Tidal. By choosing this particular theme, the colors utilized throughout the Power BI dashboard will be easy on the eyes to as many viewers as possible.
The Power BI dashboard data elements have been organized in a manner that illustrates helpful statistical information that was collected from the data professional survey.
Please feel free to reach out to me on LinkedIn if you have any comments or questions.
Lastly, thank you very much for viewing this Power BI data project.