Yichen's picture

4

Yichen

YichenLLM

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation

authored a paper 1 day ago

NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time

authored a paper 1 day ago

DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion

View all activity

Organizations

None yet

Papers 4

arxiv:2502.13842

arxiv:2412.05644

arxiv:2408.03675

arxiv:2406.06567

models 0

None public yet

datasets 0

None public yet