mirth/chonky_distilbert_base_uncased_1 Token Classification • 66.4M • Updated Jan 17 • 58.4k • • 15
mirth/chonky_mmbert_small_multilingual_1 Token Classification • 0.1B • Updated Jan 17 • 163 • 23
google/embeddinggemma-300m Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 1.45M • • 1.47k
mamei16/chonky_distilbert-base-multilingual-cased Token Classification • 0.1B • Updated Nov 14, 2025 • 37 • 4
Text chunking / splitting models Collection It intelligently segments text into meaningful semantic chunks. Could be useful for RAG systems as text-chunking module. • 4 items • Updated 25 days ago • 1
Text chunking / splitting models Collection It intelligently segments text into meaningful semantic chunks. Could be useful for RAG systems as text-chunking module. • 4 items • Updated 25 days ago • 1
mirth/chonky_mmbert_small_multilingual_1 Token Classification • 0.1B • Updated Jan 17 • 163 • 23
mirth/chonky_mmbert_small_multilingual_1 Token Classification • 0.1B • Updated Jan 17 • 163 • 23
mamei16/chonky_distilbert_base_uncased_1.1 Token Classification • 66.4M • Updated Nov 13, 2025 • 3 • 2