DavidAU/Qwen3-48B-A4B-Deadpan-Savant-12x-Closed-Open-Source-Distill

mlx-lm v0.28.4 failure

by Epistates - opened Dec 15, 2025

Dec 15, 2025

This model looks really interesting for coding! I tried to use:
mlx_lm.convert --hf-path DavidAU/Qwen3-48B-A4B-Deadpan-Savant-12x-Closed-Open-Source-Distill --mlx-path /mypath/Qwen3-48B-A4B-Deadpan-Savant-12x-Closed-Open-Source-Distill

Normally Qwen3MoeForCausalLM converts without issue but this model fails with:

mlx/nn/layers/base.py", line 191, in load_weights
    raise ValueError(f"Missing {num_missing} parameters: \n{missing}.")
ValueError: Missing 1 parameters:
lm_head.weight.

https://github.com/ml-explore/mlx/blob/main/python/mlx/nn/layers/base.py#L188-L191

DavidAU

Owner Dec 16, 2025

Hey ;

The head/embed are tied on this MOE; which is the cause of this error.
I will attempt to "untie" them next version ; however all the expert models must also be "untied" too.

I don't know of a way to bypass this MLX issue.
If you find a way ; please share as this will help with other models too.

thanks'
David

Epistates changed discussion status to closed Dec 16, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment