view article Article Introducing Cohere-transcribe: state-of-the-art speech recognition 5 days ago • 28
view article Article Konkani LLM: Bringing a Multi-Script Low-Resource Language to the AI Era 24 days ago • 8
konkani-gemma-3 Collection Finetuned Gemma-3 model for konkani • 6 items • Updated about 1 month ago • 1
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated 29 days ago • 706
World events Collection Dataset containing real world events from 2023 till present • 3 items • Updated Jan 26 • 5
Konkani-Bench Collection contains the dataset used to evaluate the model • 1 item • Updated 24 days ago • 1
Ministral 3 Collection Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 19 days ago • 31
arabic datasets Collection datasets related to Arabic-tunisian dialect • 17 items • Updated Nov 22, 2025 • 3
Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance Paper • 2504.09753 • Published Apr 13, 2025 • 6