Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
11
43
Mohammed Khalil
mohamed-khalil
Follow
adamm-hf's profile picture
X779's profile picture
21world's profile picture
5 followers
·
5 following
https://v3xlrm1nOwo1.github.io
v3xlrm1nOwo1
v3xlrm1nOwo1
AI & ML interests
ML Researcher || NLP || anime
Recent Activity
liked
a dataset
2 months ago
nyu-mll/glue
reacted
to
Jaward
's
post
with ❤️
7 months ago
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4. Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
upvoted
a
paper
9 months ago
Large Language Diffusion Models
View all activity
Organizations
mohamed-khalil
's models
None public yet