Open Legal Data Collection A collection of our favorite open-source legal datasets on Hugging Face. • 13 items • Updated about 17 hours ago • 5
Explainable AI through a Democratic Lens: DhondtXAI for Proportional Feature Importance Using the D'Hondt Method Paper • 2411.05196 • Published Nov 7, 2024 • 1
Open-AgentRL Collection RLAnything & DemyAgent: Open-Source RL for LLMs and Agentic Scenarios • 12 items • Updated 20 days ago • 6
MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 31 items • Updated 13 days ago • 67
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning Paper • 2509.24650 • Published Sep 29, 2025 • 3
view article Article The Explicit State Loop in LLM-Based Systems: How AI and the FIX Protocol Work Together Jan 13 • 2
Turkish Subwords Research Collection Collection models, tokenizers and testsets for the research work "Optimal Turkish Subword Strategies at Scale". The models are experimental models. • 35 items • Updated 12 days ago • 2
Earth-2 Collection Open, state of the art models for Climate and Weather forecasting. Nowcasting, Medium range, S2S range, Downscaling. • 7 items • Updated 18 days ago • 20
view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 25 days ago • 101
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated Jan 21 • 209
TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval Paper • 2511.16528 • Published Nov 20, 2025 • 24
view article Article Explainability and Trustworthiness in Large Language Models: Implications for Industry and Policy Dec 28, 2025 • 1