TheStageAI/thewhisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated about 18 hours ago • 8.4k • 21
TheStageAI/Elastic-whisper-large-v3-turbo Automatic Speech Recognition • Updated about 18 hours ago • 300 • 2
TheStageAI/Elastic-whisper-large-v3 Automatic Speech Recognition • Updated about 18 hours ago • 196 • 2
view post Post 2609 We thought it would be easier, but finally we have integrated CuDNN Paged Attention to our models!Read article here: https://app.thestage.ai/blog/Integrating-cuDNN-Paged-Attention-to-TheStage-AI-Inference?id=8Llama-8B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Llama-3.1-8B-InstructMistral-Small-24B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503 See translation 🚀 6 6 🔥 2 2 😎 2 2 + Reply
TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503 Text Generation • Updated Jan 15 • 9 • 3