alanayu lee
alanayu
AI & ML interests
None yet
Organizations
None yet
请问一下,使用megatron微调Qwen3-Next时,设置--target_modules为"all-linear"能否训练到Qwen3NextGatedDeltaNet部分?
👀 2
#41 opened 3 months ago
by
alanayu
这个模型是不是还不能用VLLM推理?
🚀 1
#9 opened 6 months ago
by
alanayu
How to train the Qwen3-30B-A3B using Reinforcement Learning?
#34 opened 9 months ago
by
alanayu
Not compatible with transformers library
4
#8 opened 10 months ago
by
Xeenxavier007