-
PretrainZero: Reinforcement Active Pretraining
Paper • 2512.03442 • Published • 48 -
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
Paper • 2512.03383 • Published • 5 -
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Paper • 2511.21689 • Published • 125 -
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models
Paper • 2511.18890 • Published • 35
Flavius Burca
flaviusburca
AI & ML interests
None yet
Recent Activity
updated
a model 3 days ago
surogate/Qwen3-0.6B-NVFP4 published
a model 3 days ago
surogate/Qwen3-0.6B-NVFP4