ISCA's 2025 DML Walkthrough: How to Create a Discourse Dataset from X/Twitter (Challenge #1)

Автор: ISCA Indiana University

Загружено: 2025-07-29

Просмотров: 104

Описание: This tutorial offers a complete walkthrough of Challenge #1 of the 2025 Datathon and Machine Learning Competition on Antisemitism, hosted by Indiana University’s Institute for the Study of Contemporary Antisemitism (ISCA). It guides participants through the key steps for collecting, processing, and annotating social media data from X (formerly Twitter) as part of a hands-on hate speech detection task.

The recording covers all technical and methodological components, including how to search and scrape posts using Bright Data, pre-process the data in Google Colab, and structure your annotations using the Annotation Portal.

All necessary tools, links, and templates are provided below to support an independent and structured workflow.

Chapters:
01:00 Agenda & Objectives

01:47 Prerequisites & Setup

02:26 What is the Datathon?

02:57 Instructions for Challenge #1

03:53 Working With X's Advanced Search Function

08:34 Working With Bright Data

11:28 Working With Google Colab

17:28 Annotation Portal Walkthrough

23:02 Setting Up An Annotation Scheme

29:02 Download Annotated Dataset

Useful Resources and Links

Annotation Portal: https://annotate.osome.iu.edu

Challenge Description (PDF): https://github.com/damieh1/datathon_2...

GitHub ReadMe (tools & setup): https://github.com/AnnotationPortal/D...

Bright Data Scraper (no code): https://brightdata.com/cp/scrapers/no...

Google Colab (preprocessing script): https://colab.research.google.com/git...

Download the full slide deck: https://github.com/damieh1/datathon_2...

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

ISCA's 2025 DML Walkthrough: How to Create a Discourse Dataset from X/Twitter (Challenge #1)

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Workshop 2 – Creating a Discourse Dataset from X | 2025 Datathon

Workshop 2 – Creating a Discourse Dataset from X | 2025 Datathon

Workshop 3 – Transformer-Based Hate Speech Detection | 2025 Datathon

Workshop 3 – Transformer-Based Hate Speech Detection | 2025 Datathon

ISCA Early Career Speaker Series, Session Three: Antisemitism and the Digital Sphere

ISCA Early Career Speaker Series, Session Three: Antisemitism and the Digital Sphere

The most powerful AI Agent I’ve ever used in my life

The most powerful AI Agent I’ve ever used in my life

MIS 769 HW5 Walkthrough: Named Entity Recognition (NER) with spaCy

MIS 769 HW5 Walkthrough: Named Entity Recognition (NER) with spaCy

Project Presentation - UAlbany Campus Platform

Project Presentation - UAlbany Campus Platform

Серия лекций для молодых специалистов ISCA, четвертая сессия: Атмосфера в кампусе, идентичность и...

Серия лекций для молодых специалистов ISCA, четвертая сессия: Атмосфера в кампусе, идентичность и...

АЛГОРИТМ получения ЗНАНИЙ в ЭПОХУ ИИ

АЛГОРИТМ получения ЗНАНИЙ в ЭПОХУ ИИ

Cary Nelson: "Still on the Cliff's Edge: The Continuing Campus Aftereffects of 10/7"

OpenAI is Suddenly in Trouble

OpenAI is Suddenly in Trouble

Nvidia’s Blowout Can’t Calm AI Anxiety | Prof G Markets

Nvidia’s Blowout Can’t Calm AI Anxiety | Prof G Markets

Datathon 2025 - Antisemitism in the Age of AI: Trends, Challenges, and Research Frontiers

Datathon 2025 - Antisemitism in the Age of AI: Trends, Challenges, and Research Frontiers

Czarzasty, Stanowski, depresja lansera i zakute łby Tuska

Czarzasty, Stanowski, depresja lansera i zakute łby Tuska

Samsung S26 Ultra - czy TO wystarczy?

Samsung S26 Ultra - czy TO wystarczy?

Nvidia CEO Jensen Huang on AI's pressure on software stocks

Nvidia CEO Jensen Huang on AI's pressure on software stocks

This $40M AI Company Is Using AI Tutors to Teach 2 Hours/Day | #233

This $40M AI Company Is Using AI Tutors to Teach 2 Hours/Day | #233

#709 Wojna czy pokój. USA-Iran w Genewie. Iran fortyfikuje zakłady. Co z ropą? Kuba-starcie na morzu

#709 Wojna czy pokój. USA-Iran w Genewie. Iran fortyfikuje zakłady. Co z ropą? Kuba-starcie na morzu

ISCA Early Career Speaker Series, Session Two: Left-Wing Antisemitism and Anti-Zionism

ISCA Early Career Speaker Series, Session Two: Left-Wing Antisemitism and Anti-Zionism

Роберт Кац: «Эволюция правовой защиты евреев в американском антидискриминационном законодательстве».

Роберт Кац: «Эволюция правовой защиты евреев в американском антидискриминационном законодательстве».

$1 Trillion Gone

$1 Trillion Gone