FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-Language Navigation Paper • 2601.13976 • Published Jan 20 • 21
Running on Zero 1.38k FLUX Prompt Generator 😻 1.38k Launch an interactive demo interface for the tool
Running on Zero Featured 826 Florence 2 📉 826 Perform image captioning, detection, OCR and more with Florence‑2