3 20 21

AlphaSue

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

Jupyter Agents: training LLMs to reason with notebooks

upvoted a paper 3 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

upvoted a paper 4 months ago

ReCode: Unify Plan and Action for Universal Granularity Control

View all activity

Organizations

None yet

liked 3 models 11 months ago

liked a Space about 1 year ago

TxT360: Trillion Extracted Text

📖

133

Explore and download the TxT360 LLM pre‑training dataset

liked a model about 1 year ago

jinaai/ReaderLM-v2

Text Generation • 2B • Updated Mar 4, 2025 • 9.66k • • 768

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.72k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset about 1 year ago

microsoft/RedStone

Updated Dec 5, 2024 • 15 • 35

liked a model about 1 year ago

open-web-math/filtering-models

Updated Nov 2, 2023 • 9

liked a dataset about 1 year ago

m-a-p/FineFineWeb

Viewer • Updated Dec 19, 2024 • 4.89B • 1.22M • 107

liked 2 models over 1 year ago

nvidia/quality-classifier-deberta

Updated Sep 22, 2025 • 4.34k • 75

oliverguhr/fullstop-punctuation-multilang-large

Token Classification • Updated Nov 16, 2023 • 634k • • 174

liked a dataset over 1 year ago

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 8.12k • 799

liked a model over 1 year ago

Snowflake/snowflake-arctic-embed-m

liked a Space almost 2 years ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.3k

Generate a curated web‑text dataset for LLM training

liked 4 datasets almost 2 years ago

liwu/MNBVC

Updated Dec 3, 2025 • 136k • 590

togethercomputer/RedPajama-Data-1T

Viewer • Updated Jun 17, 2024 • 1.73M • 2.32k • 1.14k

allenai/dolma

Updated Apr 17, 2024 • 7.16k • 996

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 164k • 2.69k

liked a Space over 2 years ago

ControlNet V1.1

📉

1.18k

Generate images from sketches, edges, or poses

liked a model over 2 years ago

TheBloke/Llama-2-7B-Chat-GGML

Text Generation • Updated Sep 27, 2023 • 423 • 872