Yazan Agha-Schrader PRO

phi0112358

AI & ML interests

Brain, EEG, BCI, consciousness, autism, octopus, automation, a.i., etymology, numbers, spirituality, astronomy

Recent Activity

upvoted a collection 5 days ago

🔮 Mixture of Experts

liked a model 5 days ago

Qwen/Qwen3.5-35B-A3B

liked a dataset 5 days ago

nebius/SWE-rebench

View all activity

Organizations

upvoted a collection 5 days ago

🔮 Mixture of Experts

Collection

MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y • 13 items • Updated Aug 16, 2024 • 24

upvoted a collection 7 days ago

Qwen3 Voice Embedding

Collection

Standalone ECAPA-TDNN x-vector speaker encoders extracted from Qwen3-TTS. 1024-dim (0.6B) and 2048-dim (1.7B). • 4 items • Updated 3 days ago • 26

upvoted an article 7 days ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

10 days ago

•

471

upvoted 2 articles 3 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

605

Article

Norm-Preserving Biprojected Abliteration

Nov 6, 2025

•

upvoted 2 collections 6 months ago

💧 LFM2

Collection

LFM2 is a new generation of hybrid models, designed for on-device deployment. • 31 items • Updated 6 days ago • 145

Multimodal GGUFs

Collection

Vision and audio models compatible with llama-server and llama-mtmd-cli • 16 items • Updated Dec 18, 2025 • 18

upvoted a collection 7 months ago

Draft Models

Collection

Tiny "draft" models for speculative decoding. • 36 items • Updated Oct 29, 2025 • 6

upvoted a paper 7 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 151

upvoted a collection 8 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 183

upvoted 2 papers 8 months ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 141

Turning large language models into cognitive models

Paper • 2306.03917 • Published Jun 6, 2023 • 5

upvoted 2 collections 9 months ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 76 items • Updated 5 days ago • 402

Granite Quantized Models

Collection

Quantized versions of IBM Granite models. Licensed under the Apache 2.0 license. • 44 items • Updated Nov 21, 2025 • 32

upvoted 2 collections 10 months ago

Text-to-Speech (TTS) models

Collection

A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now! • 16 items • Updated 6 days ago • 26

Qwen3

Collection

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 6 days ago • 261

upvoted a collection 11 months ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10, 2025 • 216

upvoted a collection about 1 year ago

GGUF LoRA adapters

Collection

Adapters extracted from fine tuned models, using mergekit-extract-lora • 16 items • Updated Dec 16, 2025 • 4

upvoted 2 collections over 1 year ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Dec 31, 2025 • 696

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 309

Yazan Agha-Schrader PRO

AI & ML interests

Recent Activity

Organizations

phi0112358's activity

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

We Got Claude to Fine-Tune an Open Source LLM

Norm-Preserving Biprojected Abliteration