Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yichen's picture
4

Yichen

YichenLLM
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago
Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation
authored a paper 2 days ago
NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time
authored a paper 2 days ago
DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion
View all activity

Organizations

None yet

upvoted a paper about 20 hours ago

Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation

Paper • 2603.04971 • Published 2 days ago • 3
upvoted a paper 2 days ago

Mixture of Hidden-Dimensions Transformer

Paper • 2412.05644 • Published Dec 7, 2024 • 1
upvoted a paper 30 days ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published about 1 month ago • 262
upvoted a paper about 1 year ago

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11, 2025 • 40
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs