LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published 18 days ago • 43
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 107 • 5
Rendering-Aware Reinforcement Learning for Vector Graphics Generation Paper • 2505.20793 • Published May 27, 2025 • 13
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation Paper • 2508.16763 • Published Aug 22, 2025 • 2
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 107
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 107
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 13
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 101
aarashfeizi/jean-francois-godbout-batch3-repeats4-rank8-snr5.0 Text-to-Image • Updated Apr 29, 2024 • 6 •
aarashfeizi/jean-francois-godbout-batch2-repeats4-rank16-snr5.0 Text-to-Image • Updated Apr 29, 2024 • 3 •
aarashfeizi/jean-francois-godbout-batch3-repeats4-rank32-snrNone Text-to-Image • Updated Apr 29, 2024 • 2 •
aarashfeizi/jean-francois-godbout-batch3-repeats3-rank32-snr5.0 Text-to-Image • Updated Apr 29, 2024 • 3 •
aarashfeizi/jean-francois-godbout-batch2-repeats3-rank16-snrNone Text-to-Image • Updated Apr 29, 2024 • 3 •
aarashfeizi/jean-francois-godbout-batch3-repeats3-rank8-snr5.0 Text-to-Image • Updated Apr 29, 2024 • 5 •
aarashfeizi/jean-francois-godbout-batch3-repeats3-rank32-snrNone Text-to-Image • Updated Apr 29, 2024 • 9 •