eshmoideas 's Collections Training
updated
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn
Tool-Integrated Reasoning
Paper
• 2509.02479
• Published
• 84
scikit-learn/sklearn-transformers
Text Classification
• Updated
• 25
keras-io/swin-transformers
Image Classification
• Updated
• 19
• 4
keras-io/structured-data-classification-grn-vsn
Tabular Classification
• Updated
• 26
• 9
keras-io/timeseries_transformer_classification
Time Series Forecasting
• Updated
• 14
• 13
nvidia/Llama-4-Maverick-17B-128E-Eagle3
Updated
• 11
• 9
nvidia/DeepSeek-R1-0528-NVFP4
Text Generation
• 397B • Updated
• 15.9k
• 42
EnvX: Agentize Everything with Agentic AI
Paper
• 2509.08088
• Published
• 8
MachineLearningLM/MachineLearningLM-7B-v1
Text Generation
• 8B • Updated
• 54
• 34
mradermacher/MachineLearningLM-7B-v1-GGUF
8B • Updated
• 338
• 5
nvidia/DirectDiscriminativeOptimization
Text Classification
• 73B • Updated
• 23
• 10
Qwen/WorldPM-72B-UltraFeedback
Text Classification
• 73B • Updated
• 1.25k
• 7
Qwen/WorldPM-72B-HelpSteer2
Text Classification
• Updated
• 816
• 10
Text Classification
• 73B • Updated
• 47
• 81