ValueError: Missing 285 parameters

#1
by finding1 - opened

Loading this model with mlx-lm 0.31.3 I get ValueError: Missing 285 parameters. Has anyone else encountered this?

Yes same here

Found solution, PR #1410 is necessary for running the model not just converting it.

MLX Community org

@finding1 @EVCIA
Hi,

The released version of mlx-lm is not quite ready for this model. You will also need PR #1410 at https://github.com/ml-explore/mlx-lm/pull/1410 as this has not been merged yet.

The easiest would be to to download the python file for the GLM-5.2 model from https://github.com/pcuenca/mlx-lm/blob/glm-moe-dsa-indexer-sharing/mlx_lm/models/glm_moe_dsa.py, and copy it into the models folder of your mlx-lm installation. Let me know if you need more details.

Or else wait a few days until the pull request has been merged with mlx-lm.

Thank you!
If using oMLX App, is it possible to add the python file to that?

I'm going to try this too. it might be part of what I'm running into because I didn't do anything specifically to handle the load of this model. It's getting loaded through EXO's integration with mlx-lm.

MLX Community org

Thank you!
If using oMLX App, is it possible to add the python file to that?

@EVCIA

oMLX is still on my to-do list, so I don't know. Something new to explore...

bibproj changed discussion status to closed
MLX Community org

@EVCIA
Hi Chris

I see that oMLX already has this python file in their GitHub repo at https://github.com/jundot/omlx/tree/main/omlx/patches/glm_moe_dsa

oMLX seems quite up to date, even with non-merged patches like this one for GLM-5.2 and the one for MiniMax M3.

Sign up or log in to comment