Kaicheng Yang

Kaichengalex

https://kaichengyang0828.github.io/Kaicheng-Yang0828.github.io/

kaichengyang0828

AI & ML interests

Multimodal Representation Learning/ Vision-Language Pretraining/DeepResearch

Recent Activity

authored a paper 1 day ago

UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

updated a collection 1 day ago

UniDoc-RL

updated a collection 1 day ago

UniDoc-RL

View all activity

Organizations

authored a paper 1 day ago

UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

Paper • 2604.14967 • Published 2 days ago • 8

updated a collection 1 day ago

UniDoc-RL

Collection

4 items • Updated 1 day ago

published a dataset 1 day ago

DeepGlint-AI/UniDoc-RL

Viewer • Updated 2 days ago • 3.76k • 10

published 2 models 1 day ago

DeepGlint-AI/UniDoc-RL-3B

4B • Updated 2 days ago • 20

DeepGlint-AI/UniDoc-RL-7B

8B • Updated 2 days ago • 20

upvoted a paper 1 day ago

UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

Paper • 2604.14967 • Published 2 days ago • 8

submitted a paper to Daily Papers 1 day ago

UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

Paper • 2604.14967 • Published 2 days ago • 8

upvoted a paper 2 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 3 days ago • 135

upvoted a paper 10 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 12 days ago • 233

upvoted a paper 17 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 20 days ago • 143

updated a dataset 24 days ago

DeepGlint-AI/DanQing100M

Viewer • Updated 24 days ago • 99.9M • 2.31k • 50

updated a collection about 1 month ago

SFT Dataset

Collection

8 items • Updated Mar 16

upvoted 2 papers about 1 month ago

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Paper • 2603.03447 • Published Mar 3 • 37

Phi-4-reasoning-vision-15B Technical Report

Paper • 2603.03975 • Published Mar 4 • 20

upvoted a paper about 2 months ago

Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension

Paper • 2602.13310 • Published Feb 10 • 8

upvoted a paper 2 months ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published Feb 9 • 52

liked a model 2 months ago

zai-org/GLM-OCR

Image-to-Text • Updated 4 days ago • 7.13M • • 1.63k

Kaicheng Yang

AI & ML interests

Recent Activity

Organizations

Kaichengalex's activity