Asking For It: Question-Answering for Predicting Rule Infractions in Online Content Moderation

Mattia Samory; Diana Pamfile; Andrew To; Shruti Phadke

doi:10.48550/arxiv.2510.06350

Back

$Asking For It: Question-Answering for Predicting Rule Infractions in Online Content Moderation$

Preprint

Asking For It: Question-Answering for Predicting Rule Infractions in Online Content Moderation

Mattia Samory, Diana Pamfile, Andrew To and Shruti Phadke

ArXiv.org

07 Oct 2025

DOI: https://doi.org/10.48550/arxiv.2510.06350

Files and links (1)

url

https://arxiv.org/pdf/2510.06350View

Open

Abstract

Computer Science - Artificial Intelligence

Computer Science - Computation and Language

Computer Science - Computers and Society

Computer Science - Human-Computer Interaction

Computer Science - Learning

Online communities rely on a mix of platform policies and community-authored rules to define acceptable behavior and maintain order. However, these rules vary widely across communities, evolve over time, and are enforced inconsistently, posing challenges for transparency, governance, and automation. In this paper, we model the relationship between rules and their enforcement at scale, introducing ModQ, a novel question-answering framework for rule-sensitive content moderation. Unlike prior classification or generation-based approaches, ModQ conditions on the full set of community rules at inference time and identifies which rule best applies to a given comment. We implement two model variants - extractive and multiple-choice QA - and train them on large-scale datasets from Reddit and Lemmy, the latter of which we construct from publicly available moderation logs and rule descriptions. Both models outperform state-of-the-art baselines in identifying moderation-relevant rule violations, while remaining lightweight and interpretable. Notably, ModQ models generalize effectively to unseen communities and rules, supporting low-resource moderation settings and dynamic governance environments.

Metrics

7 Record Views

Details

Title: Asking For It: Question-Answering for Predicting Rule Infractions in Online Content Moderation
Creators: Mattia Samory
Diana Pamfile
Andrew To
Shruti Phadke
Publication Details: ArXiv.org
Resource Type: Preprint
Language: English
Academic Unit: Information Science
Other Identifier: 991022123438804721

Asking For It: Question-Answering for Predicting Rule Infractions in Online Content Moderation

Files and links (1)

Abstract

Metrics

Details

Drexel University Social media