DECS NRP Detector

This repository contains the NRP detector model used in the DECS algorithm. It is designed to determine whether a given reasoning chunk contains the ground truth signal.

Citation

If you use this model, please cite:

@inproceedings{jiang2026overthinking,
title={Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling},
author={Shuyang Jiang and Yusheng Liao and Ya Zhang and Yanfeng Wang and Yu Wang},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026},
url={https://openreview.net/forum?id=kdeiRledV6}
}
Downloads last month
18
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pixas/DECS_NRP_DETECTOR

Base model

Qwen/Qwen2.5-1.5B
Finetuned
(1432)
this model
Quantizations
1 model