Harvard-DCML/boomerang-qwen3-2.3B
Text Generation
•
3B
•
Updated
•
138
•
1
Data-Centric ML
A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)
Boomerang Distillation Enables Zero-Shot Model Size Interpolation