RFTSystems: Agent Forensics Suite Collection Audit, prove, replay, and diff agent runsβturning βtrust meβ into verification. β’ 8 items β’ Updated Jan 18 β’ 3
Portuguese LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: β’ 18 items β’ Updated about 1 hour ago β’ 42
Growing Transformers:Layer-wise Expansion Comparative Study Collection Paper: 2507.07129 'Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate' (4.2.2, 5.2. βResults) β’ 8 items β’ Updated Jan 4 β’ 1
Qwen/Qwen3-Next-80B-A3B-Thinking-GGUF Text Generation β’ 80B β’ Updated Dec 3, 2025 β’ 2.36k β’ 29
view post Post 2302 We just released TRL v0.26.0!It comes packed with updates:> Agent training with tools in GRPO> New CISPO & SAPO losses + reasoning rewards> vLLM quantization in colocate mode> Dataset shuffling in SFT> Lots of NEW examples> Tons of fixes and documentation improvements See translation 3 replies Β· π€ 7 7 + Reply
view post Post 2585 Currently having a blast learning the transformers library.I noticed that model cards usually have Transformers code as usage examples.So I tried to figure out how to load a model just using the transformers library without using ollama, lmstudio, or llamacpp.Learned how to install dependencies required to make it work like pytorch and CUDA. I also used Conda for python environment dependencies. Once I got the model loaded and sample inference working, I made an API to serve it.I know it's very basic stuff for machine learning experts here in HF but I'm completely new to this so I'm happy to get it working!Model used: Qwen/Qwen3-VL-8B-InstructGPU: NVIDIA GeForce RTX 3090Here's the result of my experimentation See translation 6 replies Β· π 6 6 + Reply