-
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 43 -
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Paper • 2412.14161 • Published • 51 -
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Paper • 2408.10945 • Published • 10 -
PDFTriage: Question Answering over Long, Structured Documents
Paper • 2309.08872 • Published • 55
Ron Wolf
ron-wolf
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 hour ago
unsloth/Qwen3.5-397B-A17B-GGUF
liked
a model
about 1 hour ago
Qwen/Qwen3.5-397B-A17B
liked
a model
2 days ago
mradermacher/Magidonia-24B-v4.3-i1-GGUF
Organizations
None yet