Multilingual for Translation Corpus Helsinki-NLP/opus_books Viewer • Updated Mar 29, 2024 • 1.25M • 19k • 87
models rasa/LaBSE Feature Extraction • Updated May 20, 2021 • 20.6k • • 22 nomic-ai/nomic-embed-text-v1.5 Sentence Similarity • Updated Jul 21, 2025 • 3.91M • 773 NovaSearch/stella_en_1.5B_v5 Sentence Similarity • Updated Jul 28, 2025 • 24.4k • 261 llmware/llama-3.2-1b-gguf 1B • Updated Feb 8, 2025 • 17 • 1
Vietnamese ngtoanrob/vien-translation Translation • Updated Feb 24, 2023 • 6 • 1 ngtoanrob/envi-translation Updated Apr 1, 2023 • 2 • 1 gozu888/Envit5-tuned Translation • 0.3B • Updated Jun 28, 2023 • 16 • 3 IWSLT/mt_eng_vietnamese Updated Jan 18, 2024 • 176 • 29
Wish list HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 34.3k • 660 bookcorpus/bookcorpus Updated May 3, 2024 • 9.18k • 349 sentence-transformers/wikipedia-en-sentences Viewer • Updated Apr 25, 2024 • 7.87M • 152 • 7 sentence-transformers/paq Viewer • Updated May 1, 2024 • 64.4M • 971 • 2
LLMs TheBloke/Llama-2-13B-chat-GGML Text Generation • Updated Sep 27, 2023 • 350 • 696 TheBloke/Llama-2-7B-32K-Instruct-GGML Updated Sep 27, 2023 • 1 • 8 openchat/openchat-3.6-8b-20240522 Text Generation • 8B • Updated May 28, 2024 • 9.64k • • 157
corpuses Skylion007/openwebtext Viewer • Updated Dec 26, 2025 • 8.01M • 60.3k • 490 humarin/chatgpt-paraphrases Viewer • Updated Apr 5, 2023 • 419k • 125 • 59 stanford-oval/ccnews Viewer • Updated Aug 31, 2024 • 893M • 3.08k • 32 stanford-oval/wikipedia Viewer • Updated Apr 29, 2025 • 345M • 2.57k • 14
Multilingual for Translation Corpus Helsinki-NLP/opus_books Viewer • Updated Mar 29, 2024 • 1.25M • 19k • 87
Wish list HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 34.3k • 660 bookcorpus/bookcorpus Updated May 3, 2024 • 9.18k • 349 sentence-transformers/wikipedia-en-sentences Viewer • Updated Apr 25, 2024 • 7.87M • 152 • 7 sentence-transformers/paq Viewer • Updated May 1, 2024 • 64.4M • 971 • 2
models rasa/LaBSE Feature Extraction • Updated May 20, 2021 • 20.6k • • 22 nomic-ai/nomic-embed-text-v1.5 Sentence Similarity • Updated Jul 21, 2025 • 3.91M • 773 NovaSearch/stella_en_1.5B_v5 Sentence Similarity • Updated Jul 28, 2025 • 24.4k • 261 llmware/llama-3.2-1b-gguf 1B • Updated Feb 8, 2025 • 17 • 1
LLMs TheBloke/Llama-2-13B-chat-GGML Text Generation • Updated Sep 27, 2023 • 350 • 696 TheBloke/Llama-2-7B-32K-Instruct-GGML Updated Sep 27, 2023 • 1 • 8 openchat/openchat-3.6-8b-20240522 Text Generation • 8B • Updated May 28, 2024 • 9.64k • • 157
Vietnamese ngtoanrob/vien-translation Translation • Updated Feb 24, 2023 • 6 • 1 ngtoanrob/envi-translation Updated Apr 1, 2023 • 2 • 1 gozu888/Envit5-tuned Translation • 0.3B • Updated Jun 28, 2023 • 16 • 3 IWSLT/mt_eng_vietnamese Updated Jan 18, 2024 • 176 • 29
corpuses Skylion007/openwebtext Viewer • Updated Dec 26, 2025 • 8.01M • 60.3k • 490 humarin/chatgpt-paraphrases Viewer • Updated Apr 5, 2023 • 419k • 125 • 59 stanford-oval/ccnews Viewer • Updated Aug 31, 2024 • 893M • 3.08k • 32 stanford-oval/wikipedia Viewer • Updated Apr 29, 2025 • 345M • 2.57k • 14