AI2 Adapt Dev

community

AI & ML interests

Open science can (maybe) save the world

Recent Activity

DongfuJiang authored a paper 5 days ago

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

DongfuJiang authored a paper 5 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

DongfuJiang authored a paper 5 days ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

View all activity

authored 3 papers 5 days ago

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

Paper • 2603.12698 • Published 17 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 11 days ago • 62

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published 12 days ago • 90

authored a paper about 1 month ago

References Improve LLM Alignment in Non-Verifiable Domains

Paper • 2602.16802 • Published Feb 18 • 2

authored 11 papers about 2 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 22

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Paper • 2502.10341 • Published Feb 14, 2025 • 3

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

Paper • 2502.18443 • Published Feb 25, 2025 • 11

DataDecide: How to Predict Best Pretraining Data with Small Experiments

Paper • 2504.11393 • Published Apr 15, 2025 • 18

Teaching Models to Understand (but not Generate) High-risk Data

Paper • 2505.03052 • Published May 5, 2025 • 6

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5, 2025 • 60

FlexOlmo: Open Language Models for Flexible Data Use

Paper • 2507.07024 • Published Jul 9, 2025 • 10

olmOCR 2: Unit Test Rewards for Document OCR

Paper • 2510.19817 • Published Oct 22, 2025 • 16

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 63

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 30

Bolmo: Byteifying the Next Generation of Language Models

Paper • 2512.15586 • Published Dec 17, 2025 • 17

authored a paper 3 months ago

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 30

authored a paper 3 months ago

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 30

submitted a paper to Daily Papers 3 months ago

Feedforward 3D Editing via Text-Steerable Image-to-3D

Paper • 2512.13678 • Published Dec 15, 2025 • 14

authored a paper 4 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 63

authored a paper 4 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 63