1 14 3

T

Rookienovice

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

upvoted a paper 13 days ago

From Scale to Speed: Adaptive Test-Time Scaling for Image Editing

upvoted a paper 17 days ago

MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published 13 days ago • 137

upvoted a paper 13 days ago

From Scale to Speed: Adaptive Test-Time Scaling for Image Editing

Paper • 2603.00141 • Published 20 days ago • 134

upvoted a paper 17 days ago

MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios

Paper • 2602.22638 • Published 18 days ago • 106

upvoted 2 papers about 1 month ago

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published Feb 10 • 200

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 150

upvoted 3 papers about 2 months ago

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Paper • 2601.20354 • Published Jan 28 • 111

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Paper • 2601.20614 • Published Jan 28 • 120

VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

Paper • 2601.05138 • Published Jan 8 • 18

liked a model about 2 months ago

dx8152/Qwen-Image-Edit-2511-Gaussian-Splash

Image-to-Image • Updated Jan 28 • 1.37k • • 166

upvoted a paper about 2 months ago

Urban Socio-Semantic Segmentation with Vision-Language Reasoning

Paper • 2601.10477 • Published Jan 15 • 155

upvoted 2 papers 2 months ago

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published Jan 8 • 169

Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

Paper • 2512.24146 • Published Dec 30, 2025 • 14

upvoted 3 papers 10 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 155

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Paper • 2505.07263 • Published May 12, 2025 • 30

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 99

liked a model about 2 years ago

IDEA-CCNL/Taiyi-Stable-Diffusion-1B-Chinese-v0.1

Text-to-Image • Updated May 25, 2023 • 434 • 442

liked a Space about 2 years ago

Stable Diffusion XL on TPUv5e

🏋

2.04k

Generate images from text prompts

T

AI & ML interests

Recent Activity

Organizations

Rookienovice's activity

Stable Diffusion XL on TPUv5e