RESEARCHarXiv CS.CL·19d ago
CR4T: Rewrite-Based Guardrails for Adolescent LLM Safety
Current LLM safety mechanisms for adolescents are often adult-centric and refusal-based, which can create conversational dead-ends and fail to address developmental vulnerabilities. This paper introduces CR4T, a model-agnostic safeguarding framework designed to transform unsafe or refusal-style outputs into age-appropriate, guidance-oriented responses for teenagers.
28