Ezzaldeen Mousa's picture

Open to Collab

Ezzaldeen Mousa

ezzaldeen

·

AI & ML interests

Deep Cooking :)

Recent Activity

liked a model 4 days ago

declare-lab/TangoFlux

liked a Space 4 days ago

declare-lab/TangoFlux

liked a Space 4 days ago

fantaxy/Sound-AI-SFX

View all activity

Organizations

upvoted an article 23 days ago

Article

How I contributed a new model to the Transformers library using Codex

24 days ago

•

48

upvoted 2 collections 9 months ago

🤗 SmolLM2 Automatic Essay Grading

Automatic Essay Grading - SmolLM2 • 15 items • Updated Jun 9, 2025 • 1

🪅 Qwen2.5 Automatic Essay Grading

Automatic Essay Grading - Qwen2.5 • 15 items • Updated Jun 9, 2025 • 1

upvoted a paper 11 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 129

upvoted 2 articles 12 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

70

Article

Open R1: Update #3

Mar 11, 2025

•

297

upvoted 2 articles about 1 year ago

Article

Mixture of Experts Explained

+4

Dec 11, 2023

•

1.12k

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

289

upvoted a collection about 1 year ago

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks • 8 items • Updated Jul 31, 2025 • 32

upvoted an article about 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28, 2025

•

887

upvoted an article over 1 year ago

Article

Towards a Fully Arabic Retrieval-Augmented Generation (RAG) Pipeline:

Nov 30, 2024

•

28

upvoted a paper about 2 years ago

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 23

upvoted a collection over 2 years ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 253

upvoted 7 papers over 2 years ago

Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models

Paper • 2312.17661 • Published Dec 29, 2023 • 15

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 264

Object Recognition as Next Token Prediction

Paper • 2312.02142 • Published Dec 4, 2023 • 13

Distributed Representations of Words and Phrases and their Compositionality

Paper • 1310.4546 • Published Oct 16, 2013 • 3

Efficient Estimation of Word Representations in Vector Space

Paper • 1301.3781 • Published Jan 16, 2013 • 8

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 60

Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning

Paper • 2012.13255 • Published Dec 22, 2020 • 5