CyberNative-AI/CreativeSocialAgent_GLM_4.7_Flash Text Generation • 31B • Updated about 16 hours ago • 1
CyberNative-AI/CreativeSocialAgent_GLM_4.7_Flash Text Generation • 31B • Updated about 16 hours ago • 1
view reply GRPO would be dope!Btw, did we ever found out if diffusion LLMs learn from output? Like understanding context of answer and applying it reversely? Example: If A = B, then B=C. Does C=A if B=A.I thought this was something diffusion LLMs improve at.