In this post, I’m going to show you a python script that visualizes dataset in order to answer specific questions. Something that you’d do as a data analyst or data scientist very often. While this is not an advanced topic, it should give you an idea of how to get things done using Python.
NOTE: This is not a tutorial nor a “how-to” guide, just a completed exercise that I’m sharing.
In the script, I’ve used the following python modules:
- Pandas
- Numpy
- Matplotlib
We’ll be working with a dataset on the job outcomes of students who graduated from college between 2010 and 2012. Each row in the dataset represents a different major in college and contains information on gender diversity, employment rates, median salaries, and more.
To make this more practical and realistic, I’ve used Jupyter notebook and uploaded it into my github repository. You’ll be able to download the dataset from there as well, if you wish to practice it.
Click to jump into the notebook.
—
Don’t forget to Like (if you like this) and share!
Follow me @iHijazi to stay updated with everything that I share.