arxiv:2312.11370
ZHANG Jipeng
2003pro
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
about 1 month ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
upvoted
a
paper
about 2 months ago
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
upvoted
a
paper
about 2 months ago
N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
Organizations
None yet