← heapsort-ai

content moderation

22 items

ARTICLE↑ trendingReddit r/LocalLLaMA·4/14/2026

Please stop using AI for posts and showcasing your completely vibe coded projects

The user expresses frustration with the overwhelming presence of fully AI-coded projects and AI-generated posts with minimal human input in an AI-focused community. They argue that while AI assistance is acceptable, the sub should not become an "AI slop sub" due to a lack of original human contribution.

53
RESEARCHarXiv CS.AI·5d ago

Consensus is Strategically Insufficient: Reasoning-Trace Disagreement as a Knowledge-Representation Signal

This paper argues that reducing disagreement in multi-agent systems is insufficient for value-laden tasks, proposing a knowledge-representation layer. This layer abstracts reasoning traces and agent decisions into symbolic disagreement states, distinguishing four types, with application in content moderation.

28
ARTICLEDEV.to AI·4/20/2026

ModSense Moderation Intelligence System

ModSense is an AI-assisted moderation intelligence system, a production-grade prototype designed for large communities like Reddit. It combines real-time anomaly detection and graph-based community health modeling with an agentic AI layer (Gemini 3 Flash) to identify and respond to evolving issues like toxicity, brigading, and misinformation.

27
RESEARCHarXiv CS.AI·4/25/2026

Escaping the Agreement Trap: Defensibility Signals for Evaluating Rule-Governed AI

This paper proposes a new framework for evaluating rule-governed AI, particularly in content moderation, by moving beyond simple agreement metrics. It introduces the Defensibility Index (DI), Ambiguity Index (AI), and Probabilistic Defensibility Signal (PDS) to assess policy-grounded correctness and reasoning stability, using LLM traces to verify logical derivability from governing rules.

27