RetMask - a maym15 Collection

maym15 's Collections

updated 22 days ago

Trained checkpoints for the paper "From Interpretability to Performance: Optimizing Retrieval Heads for Long-Context Language Models"