Studying to Motive for Hallucination Span Detection

Giant language fashions (LLMs) usually generate hallucinations — unsupported content material that undermines reliability. Whereas most prior works body hallucination detection as a binary activity, many real-world purposes require figuring out hallucinated spans, which is a multi-step choice making course of. This naturally raises the query of whether or not express reasoning might help the advanced activity of detecting hallucination spans. To reply this query, we first consider pretrained fashions with and with out Chain-of-Thought (CoT) reasoning, and present that CoT reasoning has the potential to generate at the very least one right reply when sampled a number of instances. Motivated by this, we suggest RL4HS, a reinforcement studying framework that incentivizes reasoning with a span-level reward operate. RL4HS builds on Group Relative Coverage Optimization and introduces Class-Conscious Coverage Optimization to mitigate reward imbalance difficulty. Experiments on the RAGTruth benchmark (summarization, query answering, data-to-text) present that RL4HS surpasses pretrained reasoning fashions and supervised fine-tuning, demonstrating the need of reinforcement studying with span-level rewards for detecting hallucination spans.

† Nationwide Taiwan College, Taiwan

Main Menu

What's Hot

SurxRAT Android Malware Makes use of LLMs for Phishing and Information Theft

Andrej Karpathy's new open supply 'autoresearch' allows you to run tons of of AI experiments an evening — with revolutionary implications

Studying to Motive for Hallucination Span Detection

Studying to Motive for Hallucination Span Detection

Run NVIDIA Nemotron 3 Nano as a totally managed serverless mannequin on Amazon Bedrock

Google Stax: Testing Fashions and Prompts Towards Your Personal Standards

The 6 Finest AI Agent Reminiscence Frameworks You Ought to Attempt in 2026

SurxRAT Android Malware Makes use of LLMs for Phishing and Information Theft

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

SurxRAT Android Malware Makes use of LLMs for Phishing and Information Theft

Andrej Karpathy's new open supply 'autoresearch' allows you to run tons of of AI experiments an evening — with revolutionary implications

Studying to Motive for Hallucination Span Detection

Smooth robotic fin boosts underwater car stability

Main Menu

Subscribe to Updates

What's Hot

Studying to Motive for Hallucination Span Detection

Related Posts