6 16 217

Tripp Lyons

tripplyons

https://tripplyons.com

AI & ML interests

None yet

Recent Activity

liked a model about 4 hours ago

Qwen/Qwen3.5-35B-A3B

liked a model 9 days ago

Qwen/Qwen3.5-397B-A17B

liked a model 11 days ago

Edge-Quant/Nanbeige4.1-3B-Q4_K_M-GGUF

View all activity

Organizations

upvoted a collection about 1 year ago

DeepSeek R1 (All Versions)

Collection

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 1 day ago • 262

upvoted an article over 1 year ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Oct 14, 2024

•

103

upvoted a paper almost 2 years ago

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28, 2024 • 20

upvoted a collection about 2 years ago

SigLIP

Collection

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Jul 10, 2025 • 63

upvoted a paper about 2 years ago

Diffusion Model with Perceptual Loss

Paper • 2401.00110 • Published Dec 30, 2023 • 13

upvoted 11 papers over 2 years ago

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 24

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 40

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 84

RMT: Retentive Networks Meet Vision Transformers

Paper • 2309.11523 • Published Sep 20, 2023 • 34

Tripp Lyons

AI & ML interests

Recent Activity

Organizations

tripplyons's activity

Model2Vec: Distill a Small Fast Model from any Sentence Transformer