Create datasets from a variety of web platforms - including Reddit, Telegram, 4chan and others - and analyze them.
4CAT: Capture and Analysis Toolkit is a tool that can be used to scrape incoming posts on various forums, image boards and web platforms (including Reddit, Telegram, and 4chan) and then process them for further analysis.
It allows one to query a corpus of posts selectively by keyword, date range or other criteria, and then output the results for further analysis. The tool was inspired by the TCAT
, a tool with comparable functionality that can be used to scrape and analyse Twitter data.
Currently, access to DMI's 4CAT instance is restricted. Collaborators can request an account via the instance. It is also possible (and quite straightforward) to install 4CAT for yourself with Docker. The GitHub repository
has more instructions.
The Digital Methods Initiative About Us The Digital Methods Initiative (DMI) is one of Europe's leading Internet Studies research groups. Comprised of new media...
Tool Instagram Scraper How To
This tool interfaces with the API of Instagram to retrieve overviews of posts for a given set of usernames or hashtags. Unlike some of the other DMI tools, this t...
Winter School 2019 Cartoons
Cartoon Politics: How Trump’s Tweets Generate Memes – a Cross Platform Analysiscovfefe! Visual Responses to President Trump’s Infamous Tweet on Reddit, Tumblr and...
Winter School 2019 Normiefication
Understanding Normiefication A Cross Platform Analysis of the QAnon Conspiracy Theory The results from this project are also uploaded on this webpage. Team Membe...
Winter School 2021 Shadow Ban
Am I shadow banned? Studying online discussions on a contested form on content moderation Team Members Laura Savolainen, University of Helsinki Mea Lakso, Univ...