Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
embedl 's Collections
Cosmos-Reason2
NVIDIA Jetson Orin Nano
NVIDIA Jetson AGX Orin
NVIDIA Jetson AGX Thor
FlashHead
EdgeN

FlashHead

updated about 11 hours ago

FlashHead: Efficient Drop-In Replacement for the Classification Head in Language Model Inference

Upvote
1

  • embedl/Qwen3-0.6B-FlashHead

    Updated Dec 18, 2025 • 7 • 4

  • embedl/gemma-3-270m-it-FlashHead

    Updated Dec 20, 2025 • 25 • 4

  • embedl/Qwen3-1.7B-FlashHead

    2B • Updated Dec 9, 2025 • 28 • 2

  • embedl/Llama-3.2-1B-Instruct-FlashHead

    1B • Updated Dec 16, 2025 • 24 • 4

  • embedl/Llama-3.2-3B-Instruct-FlashHead

    3B • Updated Dec 16, 2025 • 22 • 4

  • embedl/Llama-3.2-3B-Instruct-FlashHead-W4A16

    1B • Updated Dec 16, 2025 • 17 • 4

  • embedl/Llama-3.2-1B-Instruct-FlashHead-W4A16

    0.7B • Updated Dec 16, 2025 • 15 • 6

  • embedl/gemma-3-1b-it-FlashHead

    1.0B • Updated Dec 16, 2025 • 5 • 2

  • embedl/Qwen3-1.7B-FlashHead-W4A16

    0.8B • Updated Dec 16, 2025 • 1 • 3

  • embedl/gemma-3-1b-it-FlashHead-W4A16

    0.4B • Updated Dec 16, 2025 • 3
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs