SelfReflect: Can LLMs Talk Their Inside Reply Distribution?

The widespread strategy to speak a big language mannequin’s (LLM) uncertainty is so as to add a share quantity or a hedging phrase to its response. However is that this all we will do? As a substitute of producing a single reply after which hedging it, an LLM that’s absolutely clear to the person wants to have the ability to mirror on its inside perception distribution and output a abstract of all choices it deems attainable, and the way seemingly they’re. To check whether or not LLMs possess this functionality, we develop the SelfReflect metric, an information-theoretic distance between a given abstract and a distribution over solutions. In interventional and human research, we discover that SelfReflect signifies even slight deviations, yielding a tremendous measure of faithfulness between a abstract string and an LLM’s precise inside distribution over solutions. With SelfReflect, we make a convincing unfavourable commentary: trendy LLMs are, throughout the board, incapable of showing what they’re unsure about, neither by reasoning, nor chains-of-thoughts, nor express finetuning. Nonetheless, we do discover that LLMs are in a position to generate devoted summaries of their uncertainties if we assist them by sampling a number of outputs and feeding them again into the context. This straightforward strategy shines a light-weight on the common method of speaking LLM uncertainties whose future improvement the SelfReflect rating permits.

† Unbiased Researcher
‡ Tübingen AI Heart

Main Menu

What's Hot

Setting Up a Google Colab AI-Assisted Coding Surroundings That Really Works

Pricing Breakdown and Core Characteristic Overview

65% of Organisations Nonetheless Detect Unauthorised Shadow AI Regardless of Visibility Optimism

SelfReflect: Can LLMs Talk Their Inside Reply Distribution?

Setting Up a Google Colab AI-Assisted Coding Surroundings That Really Works

We ran 16 AI Fashions on 9,000+ Actual Paperwork. Here is What We Discovered.

Quick Paths and Sluggish Paths – O’Reilly

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

Meta resumes AI coaching utilizing EU person knowledge

Setting Up a Google Colab AI-Assisted Coding Surroundings That Really Works

Pricing Breakdown and Core Characteristic Overview

65% of Organisations Nonetheless Detect Unauthorised Shadow AI Regardless of Visibility Optimism

Nvidia's new open weights Nemotron 3 tremendous combines three totally different architectures to beat gpt-oss and Qwen in throughput

Main Menu

Subscribe to Updates

What's Hot

SelfReflect: Can LLMs Talk Their Inside Reply Distribution?

Related Posts