A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret
Shu Zhao
TreezzZ
AI & ML interests
None yet
Organizations
models 14
TreezzZ/Ferret_Search-R1_Qwen2.5-14b-instruct_ppo
15B • Updated
TreezzZ/Ferret_ParallelSearch_Qwen3-30b-a3b-instruct_ppo
31B • Updated
TreezzZ/Ferret_ParallelSearch_Qwen2.5-3b-instruct_ppo
3B • Updated
• 3
TreezzZ/Ferret_ParallelSearch_Qwen3-4b-instruct_ppo
4B • Updated
TreezzZ/Ferret_ExpandSearch_Qwen2.5-3b-instruct_Llama4-Maverick-17b-128e-instruct_ppo
3B • Updated
TreezzZ/Ferret_ParallelSearch_Qwen2.5-7b-instruct_ppo
8B • Updated
• 2
TreezzZ/Ferret_Search-R1_Qwen2.5-3b-instruct_ppo
3B • Updated
• 42
TreezzZ/ExpandSearch-3b-instruct-Squeezer-LLaMA4-Maverick
3B • Updated
TreezzZ/ParallelSearch-7b-base
8B • Updated
• 1 • 1
TreezzZ/ParallelSearch-7b-instruct
8B • Updated
• 1 • 2