VISIONx @ NYU

university

https://www.sainingxie.com/

AI & ML interests

None defined yet.

Recent Activity

bytetriper updated a model about 6 hours ago

nyu-visionx/RAE-collections

junwann authored a paper 1 day ago

Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching

bytetriper updated a model 11 days ago

nyu-visionx/dinov2-large_decoder

View all activity

Papers

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding

View all Papers

nyu-visionx 's models 38

nyu-visionx/RAE-collections

Unconditional Image Generation • Updated about 5 hours ago • 43

nyu-visionx/dinov2-large_decoder

Updated 11 days ago • 18

nyu-visionx/webmae_decoder

Updated 20 days ago • 13

nyu-visionx/siglip2_decoder

Image-to-Image • Updated 25 days ago • 1.24k

nyu-visionx/webssl300m_decoder

Image-to-Image • Updated 25 days ago • 82

nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B-WebSSL

Text-to-Image • 4B • Updated 25 days ago • 145

nyu-visionx/Scale-RAE-Qwen7B_DiT9.8B

Text Generation • 17B • Updated Jan 8 • 254 • 1

nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B

Text Generation • 4B • Updated Jan 8 • 1.21k

nyu-visionx/Cambrian-S-3B-S3

3B • Updated Jan 4 • 708

nyu-visionx/Cambrian-S-3B-S2

3B • Updated Jan 4 • 3

nyu-visionx/Cambrian-S-3B-S1

3B • Updated Jan 4 • 2

nyu-visionx/Cambrian-S-1.5B-S3

2B • Updated Jan 4 • 518

nyu-visionx/Cambrian-S-1.5B-S2

2B • Updated Jan 4 • 1

nyu-visionx/Cambrian-S-1.5B-S1

2B • Updated Jan 4 • 1

nyu-visionx/Cambrian-S-0.5B-S3

0.9B • Updated Jan 4 • 2.03k

nyu-visionx/Cambrian-S-0.5B-S2

0.9B • Updated Jan 4 • 44

nyu-visionx/Cambrian-S-0.5B-S1

0.9B • Updated Jan 4

nyu-visionx/Cambrian-S-7B-S1

8B • Updated Dec 24, 2025 • 3

nyu-visionx/Cambrian-S-7B-S2

8B • Updated Dec 24, 2025 • 2.33k

nyu-visionx/Cambrian-S-7B-S3

8B • Updated Dec 24, 2025 • 13.5k

nyu-visionx/FreeFlow

Unconditional Image Generation • Updated Nov 29, 2025 • 1

nyu-visionx/Cambrian-S-1.5B

Image-to-Text • 2B • Updated Nov 7, 2025 • 53 • 3

nyu-visionx/Cambrian-S-3B

Image-to-Text • 3B • Updated Nov 7, 2025 • 1.64k • 1

nyu-visionx/Cambrian-S-0.5B

Image-to-Text • 0.9B • Updated Nov 7, 2025 • 239 • 2

nyu-visionx/Cambrian-S-7B

Image-to-Text • 8B • Updated Nov 7, 2025 • 4.58k • 5

nyu-visionx/Cambrian-S-7B-LFP

8B • Updated Nov 6, 2025 • 12k • 3

nyu-visionx/SiT-collections

Updated Nov 5, 2025

nyu-visionx/DiffuseNNX-collections

Updated Nov 4, 2025

nyu-visionx/pyramid_flow_ft_ckpt

Updated Mar 30, 2025

nyu-visionx/cambrian-phi3-3b

Text Generation • 4B • Updated Jul 6, 2024 • 175 • 11