Skip to content

parastoorazavi/sql-challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

22 Commits
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

sql-challenge

๐Ÿ“ Table of Contents

๐Ÿง About

There are some data of the company "Pewlett Hackard". They are going to work on a project which doing some research on employees of the corporation from the 1980s and 1990s.All that remain of the database of employees from that period are six CSV files. For this project we are donig:

  1. Data Modeling
  2. Data Engineering
  3. Data Analysis

๐Ÿ Getting Started

Data Modeling:
Inspect the CSVs and sketch out an ERD of the tables.

Data Engineering:
โ€ข Use the information to create a table schema for each of the six CSV files. โ€ข Import each CSV file into the corresponding SQL table.

Data Analysis:

  1. List the following details of each employee: employee number, last name, first name, sex, and salary.
  2. List first name, last name, and hire date for employees who were hired in 1986.
  3. List the manager of each department with the following information: department number, department name, the manager's employee number, last name, first name.
  4. List the department of each employee with the following information: employee number, last name, first name, and department name.
  5. List first name, last name, and sex for employees whose first name is "Hercules" and last names begin with "B."
  6. List all employees in the Sales department, including their employee number, last name, first name, and department name.
  7. List all employees in the Sales and Development departments, including their employee number, last name, first name, and department name.
  8. In descending order, list the frequency count of employee last names, i.e., how many employees share each last name.

๐ŸฉBonus Part - Import the SQL database into Pandas.

  1. Create a histogram to visualize the most common salary ranges for employees.
  2. Create a bar chart of average salary by title.

About

The purpose of this project is modelling, engineering and analysis the data of employees of a company by using PostgreSQL, PgAdmin and ERD. Then, use panda to add some visualisation on the analysis.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors