Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wikilangs
/
en
like
0
Follow
WikiLangs
10
Text Generation
fastText
omarkamali/wikipedia-monthly
English
wikilangs
nlp
tokenizer
embeddings
n-gram
markov
wikipedia
feature-extraction
sentence-similarity
tokenization
n-grams
markov-chain
text-mining
babelvec
vocabulous
vocabulary
monolingual
family-germanic_west_anglofrisian
License:
mit
Model card
Files
Files and versions
xet
Community
Use this model
main
en
/
models
/
tokenizer
4.9 MB
1 contributor
History:
1 commit
omarkamali
Upload all models and assets for en (latest)
395ebaa
verified
4 days ago
en_tokenizer_16k.model
514 kB
xet
Upload all models and assets for en (latest)
4 days ago
en_tokenizer_16k.vocab
239 kB
Upload all models and assets for en (latest)
4 days ago
en_tokenizer_32k.model
794 kB
xet
Upload all models and assets for en (latest)
4 days ago
en_tokenizer_32k.vocab
504 kB
Upload all models and assets for en (latest)
4 days ago
en_tokenizer_64k.model
1.34 MB
xet
Upload all models and assets for en (latest)
4 days ago
en_tokenizer_64k.vocab
1.02 MB
Upload all models and assets for en (latest)
4 days ago
en_tokenizer_8k.model
376 kB
xet
Upload all models and assets for en (latest)
4 days ago
en_tokenizer_8k.vocab
112 kB
Upload all models and assets for en (latest)
4 days ago