Llama 3.2 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.2 models, including the configurations • 4 items • Updated Dec 6, 2024 • 26
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 657
Llama 3.3 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.3 models, including the configurations • 1 item • Updated Dec 6, 2024 • 23
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 195
LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context Paper • 2412.17596 • Published Dec 23, 2024 • 6