ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-4096-SFT-Tulu3-decontaminated Text Generation • 2B • Updated about 9 hours ago
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-SFT-Tulu3-decontaminated Text Generation • 2B • Updated about 9 hours ago
ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-SFT-Tulu3-decontaminated Text Generation • 0.4B • Updated about 9 hours ago
ali-elganzory/1.7b-Comma0.1-300BT-longsft_16k Text Generation • 2B • Updated about 19 hours ago • 146
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-4096-longsft_16k Text Generation • 2B • Updated about 19 hours ago • 142 • 1
ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated about 20 hours ago • 187
ali-elganzory/Baguettotron-SFT-Tulu3-decontaminated Text Generation • 0.3B • Updated about 21 hours ago • 198
ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-16384-rope_theta-1M-long_sft_16k Text Generation • 2B • Updated 3 days ago • 178
ali-elganzory/ablation-model-fineweb-edu-DPO-Tulu3-decontaminated Text Generation • 2B • Updated Feb 1 • 2
ali-elganzory/ablation-model-fineweb-edu-SFT-Tulu3-decontaminated Text Generation • 2B • Updated Jan 31 • 2
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-16k Feature Extraction • 2B • Updated Jan 26 • 1
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-DPO-Tulu3-decontaminated Text Generation • 2B • Updated Jan 26 • 4
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-16k-DPO-Tulu3-decontaminated Feature Extraction • 2B • Updated Jan 26 • 1
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-16k-SFT-Tulu3-decontaminated Feature Extraction • 2B • Updated Jan 26 • 1
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-SFT-Tulu3-decontaminated Text Generation • 2B • Updated Jan 26 • 3
ali-elganzory/1.7b-MixtureVitae-100BT-DPO-Tulu3-decontaminated Text Generation • 2B • Updated Jan 26 • 2
ali-elganzory/1.7b-MixtureVitae-curated_instruct-100BT-DPO-Tulu3-decontaminated Text Generation • 2B • Updated Jan 26 • 2