30b model

by breawerawer - opened Nov 19, 2022

Nov 19, 2022

Hi, I have been able to get the 125m and 6.7b model versions to run with nothing more than the sample code in the read me. However, the 30b model gives errors such as:

KeyError: 'decoder.layers.13.self_attn_layer_norm.weight'
KeyError: 'decoder.layers.31.self_attn.q_proj.bias
KeyError: 'decoder.layers.27.fc1.bias'

Is this somthing silly I am doing wrong or is this a bug?

Thanks so much!

lukaemon

Nov 21, 2022

Facing the same problem right now.

Jackmin108

Nov 22, 2022

Same. Let's continue the discussion here:
https://huggingface.co/facebook/galactica-30b/discussions/4#637c8606d55081513c5679ef

The more hashsums of the blobs we have the better :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment