Purpose: Preparing the table for processing in Power BI involves data cleaning, aligning data types, calculating the age and length of service in the company for each employee.
Data Structure:The Human table in SSMS contains data about employees, including their date of birth, first name, last name, date of hire and date of termination (if the employee has left), and personal information about the employee.
--checking data type
SELECT COLUMN_NAME, DATA_TYPE
FROM INFORMATION_SCHEMA.COLUMNS
WHERE TABLE_NAME = 'Human';
}
exec sp_rename '[projects].[dbo].[Human].last_name','Surname','Column';
exec sp_rename '[projects].[dbo].[Human].location','Squard','Column';
UPDATE Human
-- change data type for Birthdate
UPDATE Human
SET birthdate =
CASE
WHEN birthdate LIKE '%-%' THEN CONVERT(DATE, birthdate, 102)
ELSE NULL
END;
ALTER TABLE Human
ALTER COLUMN birthdate DATE;
--Let's do the same for term_date as well:
--change type of termDate
UPDATE Human
SET termdate = CONVERT(date, LEFT(termdate, 10), 126)
WHERE termdate IS NOT NULL AND termdate != '';
ALTER TABLE Human
ALTER COLUMN hire_date DATE;
--And change data type for Age column:
--age selection for PBI
alter table Human
add AgeRange INT;
--calculate age of employee
update Human
set Age = DATEDIFF(YEAR,birthdate,GETDATE());
and calculate the age type of employee as our client wants. For this reason, we can use case to divide all employees of to three age groups
- young, middle and senior:
--add Age type
alter table Human
add AgeType Varchar(20);
--add age type from table
UPDATE Human
SET AgeType =
CASE
WHEN Age BETWEEN 18 AND 30 THEN 'young'
WHEN Age BETWEEN 31 AND 45 THEN 'middle'
WHEN Age > 45 THEN 'senior'
ELSE NULL
END;
-year in the company
--add time column
alter table Human
add Duration VARCHAR(50);
ALTER TABLE Human
ALTER COLUMN Duration VARCHAR(50);
--calculate duration emplyeing in the company
UPDATE Human
SET Duration =
CASE
WHEN termdate IS NULL THEN 'Working'
WHEN DATEDIFF(year, hire_date, termdate) = 0 THEN 'Working'
ELSE CAST(DATEDIFF(year, hire_date, termdate) AS VARCHAR(50))
END;
Our data is ready and waiting to be imported into Power BI. Therefore, we can easily proceed with it.
Power Bi part
1.The dashboard is dedicated to studying hiring and firing trends within the organisation, as well as exploring the reasons why people leave the company. It also investigates whether there are signs of gender inequality or racism.
2. I will not describe simple procedures for calculating the average age of employees or sums. Instead, I want to highlight one transformation - the column "Duration," which describes the length of an employee's tenure at the company, contains both text and numerical values. Therefore, I calculated the average tenure using a calculated column:
Duration_int = FORMAT(
IF(
'Human'[Duration] = "Working",
0,
'Human'[Duration]
),
"0"
)
The findings from the report are as follows: