Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Yuqian Hong
lavinal712
AI & ML interests
Diffusion Models
Multimodal Models
Organizations
models 28
lavinal712/transfusion-7b
13B • Updated • 1
lavinal712/omini-kontext-viton-hd-kontext
Updated
lavinal712/NextStep-1-f8ch16-Tokenizer-diffusers
Updated • 3
lavinal712/omnitok_pretrain_vitamin_base_all_tokens_vae_embed_dim_32_wo_foundation_model
Updated
lavinal712/omnitok_pretrain_vitamin_base_all_tokens_vae_embed_dim_16
Updated
lavinal712/omnitok_pretrain_vitamin_base_all_tokens_vae_embed_dim_32
Updated
lavinal712/omnitok_pretrain_vitamin_base_all_tokens
Updated
lavinal712/omnitok_pretrain_vitamin_base_siglip
Updated
lavinal712/omnitok_pretrain_vitamin_base_wo_foundation_model
Updated
lavinal712/omnitok_pretrain_vitamin_base_w_augmentation
Updated