1 279 37

jasonjiang

mikinyaa

jasonjiang8866

AI & ML interests

None yet

Recent Activity

liked a model about 15 hours ago

khazarai/Qwen3-4B-Qwen3.6-plus-Reasoning-Distilled-GGUF

liked a model about 15 hours ago

zai-org/GLM-5.1

upvoted a paper about 19 hours ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

View all activity

Organizations

None yet

liked 2 models about 15 hours ago

khazarai/Qwen3-4B-Qwen3.6-plus-Reasoning-Distilled-GGUF

Text Generation • 4B • Updated less than a minute ago • 6.6k • 7

zai-org/GLM-5.1

Text Generation • 754B • Updated about 8 hours ago • 1.3k • • 721

upvoted a paper about 19 hours ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 3 days ago • 164

liked a model about 20 hours ago

kai-os/Carnice-9b

Text Generation • 9B • Updated 5 days ago • 2.33k • 124

upvoted a paper 1 day ago

Self-Distilled RLVR

Paper • 2604.03128 • Published 6 days ago • 136

upvoted a paper 3 days ago

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Paper • 2604.00528 • Published 8 days ago • 11

liked a model 3 days ago

mudler/Qwen3.5-35B-A3B-APEX-GGUF

Text Generation • 35B • Updated 7 days ago • 55.7k • 73

upvoted a paper 4 days ago

CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery

Paper • 2604.01658 • Published 7 days ago • 46

liked a model 4 days ago

Jackrong/Qwopus3.5-27B-v3

Image-Text-to-Text • 27B • Updated 3 days ago • 11.9k • 150

upvoted a paper 5 days ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published 13 days ago • 243

upvoted a paper 12 days ago

Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought

Paper • 2603.22847 • Published 16 days ago • 25

upvoted a paper 13 days ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published 15 days ago • 35

upvoted 2 papers 15 days ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 16 days ago • 29

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published 18 days ago • 77

liked a model 21 days ago

Rakuten/RakutenAI-3.0

Text Generation • 671B • Updated 23 days ago • 17.5k • 71

upvoted a paper 24 days ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published Feb 13 • 58

upvoted 4 papers about 1 month ago

Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory

Paper • 2603.04257 • Published Mar 4 • 19

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Paper • 2602.23166 • Published Feb 26 • 44

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 193

From Scale to Speed: Adaptive Test-Time Scaling for Image Editing

Paper • 2603.00141 • Published Feb 24 • 138

jasonjiang

AI & ML interests

Recent Activity

Organizations

mikinyaa's activity