167 30 571

Djuunaa

djuna

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

OpenResearcher/OpenResearcher-30B-A3B

liked a model 1 day ago

Klingspor/StarPO-4B

reacted to danielhanchen's post with 🔥 9 days ago

We collaborated with Hugging Face to enable you to train MoE models 12× faster with 35% less VRAM via our new Triton kernels (no accuracy loss). 🤗 Train gpt-oss locally on 12.8GB VRAM with our free notebooks: https://unsloth.ai/docs/new/faster-moe

View all activity

Organizations

liked 2 models 1 day ago

OpenResearcher/OpenResearcher-30B-A3B

Feature Extraction • 32B • Updated about 22 hours ago • 10.2k • 53

Klingspor/StarPO-4B

Text Generation • 4B • Updated 9 days ago • 237 • 2

reacted to danielhanchen's post with 🔥 9 days ago

Post

5124

We collaborated with Hugging Face to enable you to train MoE models 12× faster with 35% less VRAM via our new Triton kernels (no accuracy loss). 🤗

Train gpt-oss locally on 12.8GB VRAM with our free notebooks: https://unsloth.ai/docs/new/faster-moe

1 reply

liked 5 models 9 days ago

liked 3 models 24 days ago

KaraKaraWitch/Mazoku-8B-Qwen3

Text Generation • 8B • Updated 25 days ago • 6 • 1

aifeifei798/gemma-3-27b-it-FT-reasoning

Image-Text-to-Text • 27B • Updated 25 days ago • 25 • 1

koute/GLM-4.7-Flash-Derestricted

Text Generation • Updated 28 days ago • 662 • 27

liked 2 models about 1 month ago

YatharthS/NovaSR

Audio-to-Audio • Updated Jan 19 • 459 • 77

MihaiPopa-1/TinySR

Audio-to-Audio • Updated Jan 16 • 3

reacted to YatharthS's post with 🚀 about 1 month ago

Post

4519

I just released NovaSR, a tiny 52kb audio upsampler that can enhance 3600 seconds of muffled 16khz audio in to clearer 48khz audio in just 1 second!

NovaSR can
- Enhance TTS model quality.
- Restore poor quality datasets.
- Work on any device(just 52kb which is smaller than a 3 second audio file!)

Model: YatharthS/NovaSR
Space to try it: YatharthS/NovaSR
Github repo: https://github.com/ysharma3501/NovaSR

5 replies

reacted to AdinaY's post with 🔥🔥 about 1 month ago

Post

365

Agentic capability is the new battleground🔥

LongCat-Flash-Thinking-2601, the latest reasoning model from Meituan- LongCat

✨ MoE - 560B total / 27B active
✨ MIT license
✨ Agentic tool use
✨ Multi-environment RL
✨ Parallel + iterative reasoning

meituan-longcat/LongCat-Flash-Thinking-2601

reacted to danielhanchen's post with 🔥 about 1 month ago

Post

2858

You can now do reinforcement learning training with 7× longer context and no accuracy loss, via our new batching algorithms.

Long reasoning chains in RL are costly, but now we enable you to train gpt-oss with GRPO & reach 380K context on a 192GB GPU.

Blog: https://unsloth.ai/docs/new/grpo-long-context

liked 3 models about 1 month ago

microsoft/FrogMini-14B-2510

Text Generation • Updated Jan 15 • 212 • 60

Skywork/Unipic3

Any-to-Any • Updated 16 days ago • 55 • 19

zai-org/GLM-TTS

Text-to-Speech • Updated Jan 12 • 240 • 320

Djuunaa

AI & ML interests

Recent Activity

Organizations

djuna's activity