Author: Bernhard Rieder

Call for Participation: Internet Research with Foundation Models?

CAT4SMR is convening its closing workshop on the exploration of generative methods in internet research on September 3, 2024 in Amsterdam. Situated within the Digital Methods Initiative, CAT4SMR has been working on tools for capturing and analyzing social media data, such as 4CAT and the YouTube Data Tools, but also on how to develop and implement new and innovative research methods. 

As the incorporation of large pre-trained models (sometimes referred to as “foundation models”, including Large Language Models, Large Vision Models, etc.) into social sciences and humanities (SSH) research moves from novelty to standard, the need to exchange, evaluate, and critique is becoming pressing. This one-day workshop will serve as a forum for investigating how these advanced computational techniques are being integrated into digital research, what opportunities, limitations, and pitfalls they imply, and what this means for academic research.

Our conversations will center around several themes:

  • The evolution of digital methods in light of foundation models, exploring how tools like LLMs and LVMs augment and transform established methodologies and introduce new ones into digital research, from thematic textual and visual analysis to the study of misinformation;
  • The critical examination of best practices in the deployment of LLMs or LVMs, from the art of effective prompting to the nuances of model selection, fine-tuning, and evaluation, all within the context of internet research;

We are particularly interested in fostering discussions about new epistemologies, ethical implications, and practical issues related to the use of these models in research. This includes the careful examination of models’ black-boxed nature and their (potential) imprint on research, as well as the broader ethical landscape that researchers navigate.

We welcome short proposals (about 500 words) for brief presentations on empirical research projects, methodologies, tools, or critiques aligned with our workshop themes, especially those adopting experimental, explorative, or speculative approaches. Potential topics include, but are not limited to:

  • interesting new methods, advancing visual and textual analysis;
  • effective strategies for employing pre-trained models in internet research;
  • taming black boxes: observing, testing, and deploying pre-trained models for digital research;
  • the evolution of interpretive practices with the integration of LLMs;
  • the role of pre-trained models as surrogate expert readers;
  • the delegation of interpretative agency to pre-trained models;
  • reflections on the role of pre-trained models in, or next to, qualitative and quantitative research;
  • challenges and opportunities in the ‘traceability’ of digital actors and research procedures;
  • researching platformed models;
  • platform research with pre-trained models and their (mis)uses for medium-sensitive studies;

This will be a one-day meeting with presentations and ample room for discussion. We hope to make this an inspiring event by bringing together scholars and practitioners from diverse disciplines, resulting in fruitful collaborations. ​​We have some ability to fund transport and accommodation, and will provide lunch and dinner.

Deadline for submissions: June 7, 2024

Submit proposals to: cat4smr [diddalidoo] list.uva.nl

Notice of acceptance: June 21, 2024

Workshop date: September 3, 2024

Location: Amsterdam City Centre

Organizers: Erik Borra, Sal Hagen, Stijn Peeters, Bernhard Rieder

Invitation – Capture and Analysis Tools for Social Media Research Workshop (May 8, 2023)

When: May 8th, 13:00-16:00

Where: On site in Amsterdam, City Center Campus (location follows per email)

Interested in learning how to work with social media data? Join us for the third workshop of CAT4SMR, the initiative that builds and maintains Capture and Analysis Tools for Social Media Research! 

On May 8th in Amsterdam, we will introduce interested scholars to two powerful tools (4CAT and the YouTube Data Tools) for the data-driven analysis of online platforms such as 4chan, Instagram, Reddit, Telegram, Twitter, TikTok, and YouTube. The goal is to help students (Master’s, PhD) and researchers (all levels) integrate social media analysis in their research and teaching.

The first part (13:00-15:00) will be dedicated to the presentation and discussion of 4CAT and YouTube Data Tools. How and why to use them, how to get up and running, best practices and things to watch out for.

During the second part (15:00 – 16:00), we open the floor to discuss specific research questions and projects, and how to approach them in terms of methodology, logistics, ethics, and so forth. This part of the workshop is optional.

Entry is free, but registration is required. Please sign up here before April 24 to reserve a spot and help us plan the workshop. A limited amount of space is available.

The workshop is organized an facilitated by the CAT4SMR project and the team members Erik Borra, Stijn Peeters, and Bernhard Rieder.

Links:

Signup

Invitation – Capture and Analysis Tools for Social Media Research Workshop (November 22)

(update: due to limited capacity, sign-ups for the workshop are now closed)

When: Nov 22rd, 13:30-17:00 (CET)

Where: Online on Zoom

Interested in learning how to work with social media data? Join us for the second workshop of CAT4SMR, the initiative that builds and maintains Capture and Analysis Tools for Social Media Research! 

On Nov 22nd on Zoom, we will introduce interested scholars to two powerful tools (4CAT and the YouTube Data Tools) for the data-driven analysis of online platforms such as 4chan, Instagram, Reddit, Telegram, Twitter, TikTok, and YouTube. The goal is to help students (Master’s, PhD) and researchers (all levels) integrate social media analysis in their research and teaching.

The first part (13:30-15:30 CET) will be dedicated to the presentation and discussion of 4CAT and YouTube Data Tools. How and why to use them, how to get up and running, best practices and things to watch out for.

In second part (15:45-17:00 CET), we split into small groups and open the floor to discuss specific research questions and projects, and how to approach them in terms of methodology, logistics, ethics, and so forth. This part of the workshop is optional, but for those participating we are asking for a short project description to help us prepare.

Entry is free, but registration is required. Please sign up here before November 8th to reserve a spot and help us plan the workshop. A limited amount of space is available.

The workshop is organized and facilitated by the CAT4SMR project and the team members Erik Borra, Stijn Peeters, and Bernhard Rieder.

Links:

https://tinyurl.com/cat4smr-workshop-november

Location:

Zoom (link will be provided beforehand)            

Invitation – Capture and Analysis Tools for Social Media Research Workshop (May 22)

(update: due to limited capacity, sign-ups for the workshop are now closed)

When: May 23rd, 10:00-16:00

Where: On site in Amsterdam

Interested in learning how to work with social media data? Join us for the first workshop of CAT4SMR, the initiative that builds and maintains Capture and Analysis Tools for Social Media Research! 

On May 23rd in Amsterdam, we will introduce interested scholars to two powerful tools (4CAT and the YouTube Data Tools) for the data-driven analysis of online platforms such as 4chan, Instagram, Reddit, Telegram, Twitter, TikTok, and YouTube. The goal is to help students (Master’s, PhD) and researchers (all levels) integrate social media analysis in their research and teaching.

The morning (10:00-12:30) will be dedicated to the presentation and discussion of 4CAT and YouTube Data Tools. How and why to use them, how to get up and running, best practices and things to watch out for.

In the afternoon (13:30 – 16:00), we split into small groups and open the floor to discuss specific research questions and projects, and how to approach them in terms of methodology, logistics, ethics, and so forth. This part of the workshop is optional.

Entry is free, but registration is required. A catered lunch is included. Please sign up here before May 16th to reserve a spot and help us plan the workshop. A limited amount of space is available.

The workshop is organized an facilitated by the CAT4SMR project and the team members Erik Borra, Stijn Peeters, and Bernhard Rieder.

Links:

https://www.buzzhouse.co

https://tinyurl.com/cat4smr-workshop-signup

Location:

BuzzHouse (BG5)

Oudezijds Achterburgwal 233-237

1012 DL Amsterdam                               

Starting up the project

After years of working on research software development mostly in our spare time, we are very happy that CAT4SMR (Capture and Anaysis Tools for Social Media Research) was funded in May 2020 by the Dutch PDI- SSH (Platform Digital Infrastructure Social Science and Humanities).

We (Erik Borra, Stijn Peeters, and Bernhard Rieder) will be able to use this opportunity to improve and stabilize the social media analysis tools we have been working on over the years: DMI-TCAT, 4CAT, YouTube Data Tools, and others. Who knows, we may even be able to revive Netvizz, which had do close down after changes in Facebook’s API governance.

The project started work in September 2020, beginning with the usual practicalities and some initial improvements to code quality and documentation. 2021 will see much more activity on all fronts. Stay tuned.