Capstone Project:Opioid Abuse Detection System

Detection System Against the U.S. Opioid Abuse Epidemic

During my senior year at Liberty University, I had the opportunity to work directly with the Department of Homeland Security (DHS) and their Criminal Investigations and Network Analysis (CINA) center on a project to help confront the escalating opioid abuse crisis that the United States is facing. This sudden rise in deaths caused by overdose has been rapidly accelerating ever since 2013, with 2020 alone having nearly 92,000 deaths caused by opioid overdose. With this annual rate steadily climbing, the DHS has been testing ideas for systems to help combat this epidemic.

One of these ideas proposed by the DHS CINA center was picked up by Liberty University, and a prototype was developed over the course of nearly 2 years. The idea was to create a system that would scrape data off of social media posts and comments in real time which confirmed the use of opioids in a given geolocation. Using this data, trends could be noticed to detect when an outbreak of opioid abuse was currently occurring and when a future outbreak is imminent. The ability to see current and potential future opioid outbreaks would help the DHS be able to more efficiently confront the opioid crisis.

When I joined the project for my capstone in August 2020, not much work had been accomplished in developing the prototype, but social media streams and data provider queries were fully developed and ready for use. The project team that I joined was therefore tasked with constructing the prototype so that it could be both used and presented for demonstration purposes. Additionally, we were to test the overall security of the developed prototype and report any found vulnerabilities as research for future development.

To create this prototype, me and my team set up over a dozen virtual machines that were connected on a private virtual network. All of these machines were hosted by a server on-site at the university, and they each had an important role in creating the desired product. Some of these machines were responsible for automatically scraping procedurally-generated data from a separate machine that was generating such data from a purchased, open-source, and false dataset. We used a dataset rather than real data so there would be no real identities or corresponding information discovered in our project should the server hosting the prototype ever be infiltrated. Other machines utilized distributed computing to process this data, which was passed off to other machines to generate graphs of the data and perform graph analysis. These graphs would reveal the traffic of individuals abusing opioids as well as their general location to confirm whether an opioid outbreak was currently occurring or was imminent in specific neighborhoods. I cannot go into too much detail concerning the specific technologies used to accomplish this as the DHS is planning on using the same software solutions that we implemented, so I have signed an NDA against disclosing more specific details.

Once the prototype was developed, research was performed to see how accurate a system like this could be in achieving its goal. The model using the generated data was able to accurately predict opioid outbreak zones in real time as the data was being scraped. Additionally, one of the project managers developed their own version of the prototype separate from ours, hosted on a separate server. We then performed penetration testing on both our own prototype and the project manager's prototype to report and outline any potential security vulnerabilities that the DHS may want to be cautious of when adapting their own version of this project.