RuAR's picture

RuAR

RachidAR

·

RachidARx

AI & ML interests

1.58 bit LLM

Recent Activity

liked a model 2 days ago

bartowski/Qwen_Qwen3.6-35B-A3B-GGUF

liked a model 2 days ago

bartowski/google_gemma-4-26B-A4B-it-GGUF

liked a model 3 days ago

unsloth/Qwen3.6-35B-A3B-GGUF

View all activity

Organizations

upvoted a collection 7 months ago

mmBERT: a modern multilingual encoder

mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9, 2025 • 53

upvoted 3 collections 11 months ago

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 33 items • Updated Mar 2 • 59

Granite 4.0 Language Models

Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated 19 days ago • 218

Falcon Edge series

A series of powerful, universal and fine-tunable small Language Models • 7 items • Updated Nov 6, 2025 • 25

upvoted a paper 12 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 191

upvoted an article 12 months ago

Article

Comparing sub 50GB Llama 4 Scout quants (KLD/Top P)

Apr 9, 2025

•

45

upvoted a collection 12 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 70 items • Updated 5 days ago • 269

upvoted an article 12 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

844

upvoted a collection 12 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.75k

upvoted 7 collections about 1 year ago

blt

4 items • Updated Apr 17, 2025 • 29

Skywork-OR1

Skywork Open Reasoner 1 • 8 items • Updated Mar 2 • 31

BitNet

🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated May 1, 2025 • 63

Granite Experiments

Experimental projects under consideration for the Granite family. • 26 items • Updated 5 days ago • 16

GLM-4-0414

GLM-4-0414 series model • 6 items • Updated Mar 2 • 135

Granite 3.3 Language Models

Language models with improved reasoning and instruction-following capabilities. • 4 items • Updated 20 days ago • 45

Cogito v1 Preview

5 items • Updated Apr 8, 2025 • 119

upvoted a paper about 1 year ago

ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization

Paper • 2502.02631 • Published Feb 4, 2025 • 4

upvoted a collection about 1 year ago

Gemma 3 Release

28 items • Updated Mar 12 • 635

upvoted 2 papers about 1 year ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16, 2025 • 170

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 31