I wrote python script to fetch Ethnicity and Race data of all counties for year 2015-2023 from data.census.gov using API key.
I transformed the data and calculated race distribution of races in each county per year.
In Washington police shooting data, there were 4692 records where county was missing. I impute the counties with the help of “geopy” library with city and state fields in data.
I transformed the Washington police shooting data and calculated shot race distribution of races in each county per year.
Next steps will be working on how to compare them to have a better idea of counties where a particular racial group experiences a disproportionate number of shootings despite not being the majority of the population.
I have to go through extensive the documentation of United States Census Bureau https://data.census.gov/ for fetching data.
It took me so much time, to understand the metadata and the specific codes by which the data is provided for races and ethnicities.
I have made a python program to fetch data using API, for specific county, year combination and also modifying the code to fetch data for all countries from 2015-2023 time period.
I want to identify counties where a particular racial group experiences a disproportionate number of shootings despite not being the majority of the population.