Computer Science in the Majors: Public Policy, International Relations, and Government

Have you ever wanted to learn about web scraping? At this workshop geared towards Public Policy, IR, and government majors, we will explain how to use Python tools to extract current keywords describing a news topic.

Sep 24, 2020, 11:30 PM – Sep 25, 2020, 12:30 AM

17
RSVP'd

RSVP Now

Key Themes

Web

About this event

Event Information

Web scraping is the process of using code to pull data from the internet automatically. In this workshop we will learn how to programmatically pull URLs from a google search, extract the text, authors, publish date, and keywords (using Natural Language Processing) to create a graph that shows the most popular key words describing a current news topic. This same technique can be applied to resources used for Public Policy, IR, and Government research. 

After coming to this workshop you will be able to:

  • Understand how to read and write from files
  • Format the urls by editing strings with Python
  • Extract information from articles
  • Have a basic understanding of Natural Language Processing   

Don't worry if you have no coding experience! We have a couple of intro videos (20 minutes total) that go over the basics of Python and this lesson assumes no background knowledge about coding. 


Before you attend

While no prior computer science is necessary, we have created two videos to help you get started with Kaggle and Python. They are twenty minutes in total.

Organizers

  • Charlie Dobson

    GDSC Lead

  • Alex Batts

    Co-lead

  • Colleen Polka

    Vice Lead: Workshops

  • Jason LaPierre

    Vice Lead: Consulting

  • Aniedi Udo-Obong

    Regional Leader

  • Cheyenne Hwang

    Co-Lead

Contact Us