Emergent Social Intelligence Risks in Generative Multi-Agent Systems Paper • 2603.27771 • Published 3 days ago • 46
The Role of Computing Resources in Publishing Foundation Model Research Paper • 2510.13621 • Published Oct 15, 2025 • 17
Building a Foundational Guardrail for General Agentic Systems via Synthetic Data Paper • 2510.09781 • Published Oct 10, 2025 • 27
Breaking Focus: Contextual Distraction Curse in Large Language Models Paper • 2502.01609 • Published Feb 3, 2025 • 1
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective Paper • 2502.14296 • Published Feb 20, 2025 • 45
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge Paper • 2410.02736 • Published Oct 3, 2024 • 1
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? Paper • 2410.21259 • Published Oct 28, 2024 • 1
Breaking Focus: Contextual Distraction Curse in Large Language Models Paper • 2502.01609 • Published Feb 3, 2025 • 1
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published Feb 3, 2025 • 40