eshmoideas 's Collections Training
updated
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn
Tool-Integrated Reasoning
Paper
• 2509.02479
• Published • 84
scikit-learn/sklearn-transformers
Text Classification
• Updated • 25
keras-io/swin-transformers
Image Classification
• Updated • 20
• 4
keras-io/structured-data-classification-grn-vsn
Tabular Classification
• Updated • 28
• 9
keras-io/timeseries_transformer_classification
Time Series Forecasting
• Updated • 20
• 14
nvidia/Llama-4-Maverick-17B-128E-Eagle3
Updated • 47
• 10
nvidia/DeepSeek-R1-0528-NVFP4
Text Generation
• 397B • Updated • 6.9k
• 44
EnvX: Agentize Everything with Agentic AI
Paper
• 2509.08088
• Published • 8
MachineLearningLM/MachineLearningLM-7B-v1
Text Generation
• 8B • Updated • 20
• 14
mradermacher/MachineLearningLM-7B-v1-GGUF
8B • Updated • 118
• 5
nvidia/DirectDiscriminativeOptimization
Text Classification
• 73B • Updated • 105
• 11
Qwen/WorldPM-72B-UltraFeedback
Text Classification
• 73B • Updated • 1.66k
• 8
Qwen/WorldPM-72B-HelpSteer2
Text Classification
• 73B • Updated • 1.19k
• 11
Text Classification
• 73B • Updated • 41
• 82