1 4 1

Hejun Dong

fickle1101

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

commentedon a paper about 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

upvoted a paper about 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

View all activity

Organizations

upvoted a paper about 1 month ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 122

commented a paper about 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 136 •

upvoted a paper about 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 136

updated a Space about 2 months ago

MinerU Diffusion V1 0320 2.5B

🦀

demo of MinerU-Diffusion

published a Space about 2 months ago

MinerU Diffusion V1 0320 2.5B

🦀

demo of MinerU-Diffusion

updated a model 3 months ago

fickle1101/nolayout_final_108k

3B • Updated Jan 30 • 1

published a model 3 months ago

fickle1101/nolayout_final_108k

3B • Updated Jan 30 • 1

updated a model 4 months ago

fickle1101/no_merger_pm2x_custom_lr_best_results

Updated Jan 26

published a model 4 months ago

fickle1101/no_merger_pm2x_custom_lr_best_results

Updated Jan 26

updated a model 4 months ago

fickle1101/no_merger_pm2x_s2_4e_best

3B • Updated Jan 20 • 8

published a model 4 months ago

fickle1101/no_merger_pm2x_s2_4e_best

3B • Updated Jan 20 • 8

liked a model 8 months ago

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Apr 9 • 1.33M • 356

upvoted a paper 8 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 161

published a model 11 months ago

fickle1101/native_qwen2_5_vit_ocr

1B • Updated Jun 16, 2025 • 1

updated a model 11 months ago

fickle1101/native_qwen2_5_vit_ocr

1B • Updated Jun 16, 2025 • 1

upvoted a paper about 1 year ago

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Paper • 2504.09925 • Published Apr 14, 2025 • 39

Hejun Dong

AI & ML interests

Recent Activity

Organizations

fickle1101's activity

MinerU Diffusion V1 0320 2.5B

MinerU Diffusion V1 0320 2.5B