The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation Paper • 2604.16830 • Published 4 days ago • 10
The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation Paper • 2604.16830 • Published 4 days ago • 10
The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation Paper • 2604.16830 • Published 4 days ago • 10
From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models Paper • 2601.15690 • Published Jan 22 • 4
From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models Paper • 2601.15690 • Published Jan 22 • 4
Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models Paper • 2410.09629 • Published Oct 12, 2024 • 1
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models Paper • 2401.02132 • Published Jan 4, 2024 • 3
Holistic Evaluation for Interleaved Text-and-Image Generation Paper • 2406.14643 • Published Jun 20, 2024
Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models Paper • 2410.09629 • Published Oct 12, 2024 • 1
LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer Paper • 2506.06952 • Published Jun 8, 2025 • 9
MMPersuade: A Dataset and Evaluation Framework for Multimodal Persuasion Paper • 2510.22768 • Published Oct 26, 2025 • 8
SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency Paper • 2311.01740 • Published Nov 3, 2023 • 1
SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency Paper • 2311.01740 • Published Nov 3, 2023 • 1