Big Data UK Crime Analysis

This was a university assignment to analyse, plot and identify trends in a provided dataset on UK crimes.

This project was done on Azure servers in Python via Jupyter Notebooks and used PySpark as well as serveral data science packages such as NumPy, Pandas, Seaborn, MatPlotLib and SKLearn.


This graph shows a clear downward trend in violent crimes between 2010 and 2013, however, when this data is included with the data from 2013 to 2021, there is a very strong, positive trend showing that violent crime has definitely been increasing.

Interestingly, 2013 was the year that the dataset changed the classification from "violent crime" to "violent crime and sexual offences". This means that the data could very possibly be misleading, with violent crime itself decreasing,  and non-violent sexual offences inflating the numbers due to the combined category, with no way to distinguish the two. However, I personally find this unlikely for the increase to be this drastic.