eepos's picture

eepos

eepos

·

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

unsloth/Qwen3-Coder-Next-GGUF

liked a model 5 days ago

dx8152/Flux2-Klein-9B-Enhanced-Details

liked a model 5 days ago

unsloth/Qwen3.5-122B-A10B-GGUF

View all activity

Organizations

None yet

upvoted a collection 5 days ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5-35B-A3B, 27B, 122B-A10B and 397B-A17B. • 20 items • Updated about 8 hours ago • 39

upvoted an article 9 days ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

10 days ago

•

471

upvoted a paper 10 days ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published 17 days ago • 53

upvoted 2 collections 13 days ago

Hibiki-Zero

4 items • Updated 17 days ago • 2

Qwen3.5

17 items • Updated about 10 hours ago • 548

upvoted 3 collections about 2 months ago

TranslateGemma

3 items • Updated Jan 15 • 215

Text-To-Speech

https://kyutai.org/next/tts • 7 items • Updated 14 days ago • 25

FLUX.2

Our second generation of FLUX • 17 items • Updated Jan 18 • 133

upvoted a collection 2 months ago

CASA

CASA: Cross-Attention as Self-Attention for Efficient Vision-Language Fusion on long context streaming inputs • 6 items • Updated Dec 23, 2025 • 7

upvoted an article 3 months ago

Article

New in llama.cpp: Model Management

Dec 11, 2025

•

126

upvoted 2 collections 3 months ago

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 91

Qwen-Image

14 items • Updated Dec 31, 2025 • 77

upvoted a paper 5 months ago

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 118

upvoted a paper 8 months ago

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24, 2025 • 42

upvoted a paper 9 months ago

SageAttention2++: A More Efficient Implementation of SageAttention2

Paper • 2505.21136 • Published May 27, 2025 • 45

upvoted a collection 9 months ago

Instruction-tuned models (GuidedQuant)

40 items • Updated Sep 6, 2025 • 2

upvoted a collection 10 months ago

EXL3 models

47 items • Updated about 4 hours ago • 39

upvoted a collection 11 months ago

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated Jun 30, 2025 • 134

upvoted 2 papers 11 months ago

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Paper • 2504.07964 • Published Apr 10, 2025 • 62

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11, 2025 • 130