Data scraping with R
Dates: | 15 June 2022 |
Times: | 13:00 - 15:00 |
What is it: | Workshop |
Organiser: | methods@manchester |
Who is it for: | University staff, Current University students |
Speaker: | Renata Topinkova |
|
To sign up, click here: https://www.eventbrite.co.uk/e/web-scraping-with-r-tickets-265887024247
Overview
In recent years, there has been an increase in interest in collecting and analysing data from online sources among social scientists. Online data can have many shapes and forms – from traditionally offline data made online (e.g., newspaper articles, speeches) to new data (e.g., social media).
Despite the growing interest in the data and the online environment in general, learning to access the data is seldom a part of university curriculums. This workshop will provide an introduction to the two most prominent ways of collecting such data - APIs (application programming interfaces) and screen scraping.
The workshop will include hands-on exercises in R. To get the most out of the workshop, participants should ideally have some prior experience with R (installing and loading packages, assigning variables, using existing functions).
Participants will learn:
- About the characteristics of online data – What are the (dis)advantages?
- How to access the data with both APIs and screen scraping of static websites with R
- To process the data into a structured format
Prerequisites
About the instructor
Renata Topinkova is a PhD candidate in Sociology within the Czech Academy of Sciences. Her domains of interest include data-heavy quantitative research projects, amongst which the study of behaviour on online dating platforms, age homophily in partner preferences and the relation between capital and housework. You can find out more about her research and publications here: https://www.researchgate.net/profile/Renata-Topinkova
Travel and Contact Information