51 8 227

sometimesanotion

https://ko-fi.com/sometimesanotion

AI & ML interests

Agentic LLM services, model merging, finetunes, distillation

Recent Activity

liked a model 2 days ago

LiquidAI/LFM2-24B-A2B-GGUF

liked a model 3 days ago

DavidAU/LFM2.5-1.2B-MEGABRAIN-Thinking-Claude-Polaris-Deepseek-GLM

replied to DavidAU's post 5 days ago

The "ERNIE" 21B MOE Distill High Reasoning Fine Tune Invasion: 3 Ernie 21B-A3B MOE Models (64 experts) fine tuned with Unsloth using Gemini Pro 3, Claude 4.5 Opus, and GLM 4.7 Flash high reasoning datasets. All benched, all exceeding org model specs too. https://huggingface.co/DavidAU/models?search=ernie Enjoy the freedom and added power.

View all activity

Organizations

liked a model 2 days ago

LiquidAI/LFM2-24B-A2B-GGUF

Text Generation • 24B • Updated 9 days ago • 10.5k • 47

liked a model 3 days ago

DavidAU/LFM2.5-1.2B-MEGABRAIN-Thinking-Claude-Polaris-Deepseek-GLM

Text Generation • 1B • Updated 20 days ago • 24 • 4

replied to DavidAU's post 5 days ago

Neat! I'd love to see one of these fine-tuned on a tool-calling dataset.

reacted to DavidAU's post with 👍 5 days ago

Post

3923

The "ERNIE" 21B MOE Distill High Reasoning Fine Tune Invasion:

3 Ernie 21B-A3B MOE Models (64 experts) fine tuned with Unsloth using Gemini Pro 3, Claude 4.5 Opus, and GLM 4.7 Flash high reasoning datasets.

All benched, all exceeding org model specs too.

https://huggingface.co/DavidAU/models?search=ernie

Enjoy the freedom and added power.

1 reply

liked 2 models 6 days ago

TymofiiNasobko/Mamay-function-calling-no-thinking

Updated 28 days ago • 1

LiquidAI/LFM2-1.2B-Extract

Text Generation • 1B • Updated Dec 5, 2025 • 15.5k • 103

upvoted an article 7 days ago

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

7 days ago

•

New activity in sometimesanotion/Lamarck-14B-v0.7 13 days ago

Excellent model!

#4 opened 10 months ago by

CATAMERCA

reacted to sequelbox's post with ❤️👀 about 1 month ago

Post

2686

NEW RELEASE: it's here! Meet the newest member of the Valiant crew: Guardpoint, our new medical reasoning model!
- Trained on medical knowledge, management, diagnosis, and tasks from DeepSeek-V3.2-Speciale!
- Structured medical reasoning responses are efficient and informative, cutting token costs for faster inference!
- Wide-ranging knowledge base: trained on a wide variety of medical disciplines, patient types, and query structures!
- High quality medical responses emphasize performance, brevity, specificity, statistical rationality, and openness.

Get it now:
Guardpoint for Qwen 3 32B: ValiantLabs/Qwen3-32B-Guardpoint
Guardpoint for Qwen 3 14B: ValiantLabs/Qwen3-14B-Guardpoint
Powered by our new structured medical reasoning dataset: sequelbox/Superpotion-DeepSeek-V3.2-Speciale

We've been working hard on Guardpoint; we're really excited to share it with everyone!

We'll be bringing Guardpoint to more models soon, along with further releases for the Shining Valiant and Esper series!

Get our experimental models: https://huggingface.co/collections/sequelbox/experimental-reasoning-models
Get our reasoning datasets: https://huggingface.co/collections/sequelbox/reasoning-datasets

Help support our releases, donations used for our experimental models and datasets: sequelbox/SupportOpenSource

2026 is going to be an amazing year for open source AI! It's time for the AI revolution you need; from the bottom up, built together by all of us.

for love, friendship, and better days,
allegra

1 reply

liked 4 models about 1 month ago

New activity in LiquidAI/LFM2.5-1.2B-Instruct about 1 month ago

Liquid AI, You NEED to Make a 16B MoE Next!

❤️ 6

#5 opened about 1 month ago by

tanyiades

New activity in LiquidAI/LFM2-8B-A1B about 2 months ago

Enjoying this one in multi-user chat. + laptop perf

❤️ 1

#8 opened about 2 months ago by

BingoBird

liked a model about 2 months ago

yasserrmd/SciReason-LFM2-2.6B

Text Generation • 3B • Updated Sep 24, 2025 • 4 • 7

reacted to csabakecskemeti's post with 🔥 2 months ago

Post

3264

Just sharing a result of a homelab infrastructure experiment:

I've managed to setup a distributed inference infra at home using a DGX Spark (128GB unified gddr6) and a linux workstation with an RTX 6000 Pro (96GB gddr7) connected via 100Gbps RoCEv2. The model I've used (https://lnkd.in/gx6J7YuB) is about 140GB so could not fit either of the GPU. Full setup and tutorial soon on devquasar.com

Screen recording:
https://lnkd.in/gKM9H5GJ

3 replies

liked a model 3 months ago

nvidia/Nemotron-Orchestrator-8B

Text Generation • Updated Dec 2, 2025 • 17k • 557

reacted to sequelbox's post with 🔥 3 months ago

Post

3039

Two new releases today!

Firstly, our new Raiden-Mini dataset, powered by DeepSeek's newest deepseek-ai/DeepSeek-V3.2-Speciale model!
- A V3.2-Speciale reasoning showcase: the Raiden prompts test the model's creative, analytic, and general reasoning skills!
- HEAD TO HEAD: a comparison subset pits V3.2-Speciale against V3.2 with the same prompts, providing a direct look at each model's advantages!

Get the new Raiden-Mini dataset: sequelbox/Raiden-Mini-DeepSeek-V3.2-Speciale

On the model side, we've also brought Shining Valiant 3 to Ministral 3!
- Science-reasoning: sequelbox/Celestia3-DeepSeek-R1-0528 for physics, biology, chemistry, compsci, astronomy, Earth science, and information theory.
- AI to build AI: the sequelbox/Mitakihara-DeepSeek-R1-0528 dataset for high-quality reasoning performance on AI, MLOps, math and CUDA, complex adaptive and agentic systems, cognition, logic, linguistics, simulation, knowledge management, and more!
- Creative reasoning and general chat performance supplemented with sequelbox/Raiden-DeepSeek-R1

Get the newest SV3: ValiantLabs/Ministral-3-14B-Reasoning-2512-ShiningValiant3

Esper 3.1 is available for Ministral 3 as well: ValiantLabs/Ministral-3-14B-Reasoning-2512-Esper3.1

We're working hard on our next Big New Release, coming out in the next few weeks :)

Help support our releases, donations used for models and datasets: sequelbox/SupportOpenSource

Open source matters. Fight for it with us.

with love and friendship,
allegra

1 reply

sometimesanotion

AI & ML interests

Recent Activity

Organizations