PhysiQuanty PRO
PhysiQuanty
AI & ML interests
Theoretical Physics, Meta Deep Learning
Recent Activity
reacted to Shrijanagain's post with 🔥 about 8 hours ago
Surya-1.1T: Scaling Beyond Human-Level Reasoning via 146 Trillion Token Pre-training
Author: Shrijan Kumar Tiwari
Affiliation: SKT AI Labs / Project Surya
Model Architecture: Optimized Dense Transformer
Parameters: 1.1 Trillion
Training Tokens: 146 Trillion
Wanna collaborate us Friends let's Start Journey we have Collected 146 trillon tokens and done pre training but we need to made more powerfull
Whitepaper - https://github.com/SHRIJANAGAIN/PROFF reacted to Shrijanagain's post with ❤️ about 8 hours ago
Surya-1.1T: Scaling Beyond Human-Level Reasoning via 146 Trillion Token Pre-training
Author: Shrijan Kumar Tiwari
Affiliation: SKT AI Labs / Project Surya
Model Architecture: Optimized Dense Transformer
Parameters: 1.1 Trillion
Training Tokens: 146 Trillion
Wanna collaborate us Friends let's Start Journey we have Collected 146 trillon tokens and done pre training but we need to made more powerfull
Whitepaper - https://github.com/SHRIJANAGAIN/PROFF upvoted a paper about 12 hours ago
Quantifying the Carbon Emissions of Machine Learning