Jelena Mitrović
Jecovit
AI & ML interests
NLP, LLMs
Recent Activity
upvoted an article 1 day ago
KV Caching Explained: Optimizing Transformer Inference Efficiency liked a dataset 8 days ago
mteb/WebFAQRetrieval upvoted an article 8 months ago
Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval