HTR ByteDance/Sa2VA-4B Image-Text-to-Text • Updated Sep 8, 2025 • 60.9k • 96 Finnish-NLP/Ahma-2-4B-Instruct Text Generation • 4B • Updated Nov 25, 2025 • 135 • 4 black-forest-labs/FLUX.2-dev Image-to-Image • Updated 23 days ago • 927k • • 1.43k mistralai/Mistral-Large-Instruct-2407 Updated Jul 28, 2025 • 8.01k • 857
Computer Vision Vision Grid Transformer for Document Layout Analysis Paper • 2308.14978 • Published Aug 29, 2023 • 4
HTR ByteDance/Sa2VA-4B Image-Text-to-Text • Updated Sep 8, 2025 • 60.9k • 96 Finnish-NLP/Ahma-2-4B-Instruct Text Generation • 4B • Updated Nov 25, 2025 • 135 • 4 black-forest-labs/FLUX.2-dev Image-to-Image • Updated 23 days ago • 927k • • 1.43k mistralai/Mistral-Large-Instruct-2407 Updated Jul 28, 2025 • 8.01k • 857
Computer Vision Vision Grid Transformer for Document Layout Analysis Paper • 2308.14978 • Published Aug 29, 2023 • 4