·
AI & ML interests
multilingual NLP, tokenization
Recent Activity
Organizations
view article There is no such thing as a tokenizer-free lunch
view article An Analysis of Multilingual Models on Hugging Face
view article Best Practices for Open Multilingual LLM Evaluation
published an article over 1 year ago published an article over 1 year ago view article Releasing the largest multilingual open pretraining dataset
published an article over 1 year ago published an article over 1 year ago