Data-Juicer

community

https://github.com/datajuicer/data-juicer

datajuicer

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

xiaoniqiu authored a paper 3 days ago

The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective

xiaoniqiu authored a paper 3 days ago

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

xiaoniqiu authored a paper 3 days ago

Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for and with Foundation Models

View all activity

xiaoniqiu

authored 5 papers 3 days ago

The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective

Paper • 2407.08583 • Published Jul 11, 2024 • 13

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Paper • 2505.17826 • Published May 23, 2025 • 10

Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for and with Foundation Models

Paper • 2501.14755 • Published Dec 23, 2024

Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

Paper • 2509.24203 • Published Sep 29, 2025 • 8

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Paper • 2602.03392 • Published 10 days ago • 52

hiyuchang

authored 7 papers 4 days ago

Exploring Selective Layer Fine-Tuning in Federated Learning

Paper • 2408.15600 • Published Aug 28, 2024

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Paper • 2505.17826 • Published May 23, 2025 • 10

Enhancing Latent Computation in Transformers with Latent Tokens

Paper • 2505.12629 • Published May 19, 2025

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15, 2025 • 8

Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

Paper • 2509.24203 • Published Sep 29, 2025 • 8

R$^3$L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification

Paper • 2601.03715 • Published Jan 7 • 1

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Paper • 2602.03392 • Published 10 days ago • 52

liyuyi

in datajuicer/VeriSciQA 4 days ago

v2: Migrate to Parquet format with embedded images

#1 opened 4 days ago by

liyuyi

LuckyBanana

updated a dataset 2 months ago

datajuicer/Img-Diff

Updated Dec 3, 2025 • 29 • 6

yxdyc

published a dataset 3 months ago

datajuicer/VeriSciQA

Viewer • Updated 4 days ago • 20.3k • 70

liyuyi

updated a dataset 3 months ago

datajuicer/VeriSciQA

Viewer • Updated 4 days ago • 20.3k • 70

SarahZhout

updated a dataset 3 months ago

datajuicer/HumanVBench

Viewer • Updated Nov 21, 2025 • 2.18k • 4.2k • 3

yxdyc

updated a dataset 3 months ago

datajuicer/RealMedConv

Viewer • Updated Nov 10, 2025 • 2k • 13

yxdyc

published a dataset 3 months ago

datajuicer/RealMedConv

Viewer • Updated Nov 10, 2025 • 2k • 13

xiaoniqiu

updated a dataset 4 months ago

datajuicer/geometry_sft

Viewer • Updated Oct 27, 2025 • 300 • 57

AI & ML interests

Recent Activity

Team members 18

datajuicer's activity

v2: Migrate to Parquet format with embedded images