Week 5 Friday – Connecting the Dots: Understanding the Correlation Between Variables in Police Shootings Data Sets

In this blog I mainly talk about the different variables and the correlation between them in the given dataset by the Washington Post. There are two datasets which are giving the information about the fatal police shootings.

The second data set is providing the information of the cops which of them are named for an encounter in an incident. The variables are ID, Name, Type (basically the position of the cops), States, ORI codes, and Total Shootings. As I hover the data, I can mention in my remark that most of them are local police and the highest number of killing by one cop is 129.

In my previous blog, I have given the glimpse of the first data set and mentioned that there are 12 variables and I am looking forward to find out the correlation in between them. I also marked out the key features by which I can relate the frameworks for both the data sets in future.

I also unveil and visualise the data counts as per the shooting happened in year from 2015 to till date. I plot the data on the bar graph where one can easily get the understanding that how much encounters has been happened in each year. I can say that the most killings happened in the year 2022 and the least shooting happened in 2023 till now as compared to previous years. One can refer the picture below for their understanding.

In my next blog, I will provide some extra information related to the variables and the correlation in between them.

Leave a Reply

Your email address will not be published. Required fields are marked *