MeroMero-31B Style Swap

An expirimental tensor swap merge targeting only one tensor: lm_head.weight
The merge consist of two models:

The theory behind this is that since Gryphe's tune touches what your typical fine-tune doesn't: meging through tensor swapping should be practically loseless.
It's also possible that it would make for good merging fodder post-FTing, but that's a theory for someone more knowledgeable in training to dive into.

If you're interested in Gryphe's tuning method, I'd suggest reading the model card of the tune.

Downloads last month
19
Safetensors
Model size
33B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Casual-Autopsy/G4-MeroMero-31B-StyleSwap

Merge model
this model
Quantizations
4 models