--- base_model: mistralai/Mistral-7B-Instruct-v0.3 library_name: peft pipeline_tag: text-generation tags: - lora - peft - transformers - mistral - grpo - molecule-optimization - cde license: mit --- # C-MORAL Mistral GRPO CDE Adapter LoRA adapter for molecule optimization trained with GRPO on the `cde` task. ## Base Model - `mistralai/Mistral-7B-Instruct-v0.3` ## Task - task alias: `cde` - property combination: `carc+drd2+herg` ## Method - algorithm: `GRPO` - adapter type: `LoRA` ## Load With PEFT ```python from transformers import AutoModelForCausalLM, AutoTokenizer from peft import PeftModel base_model_id = "mistralai/Mistral-7B-Instruct-v0.3" adapter_id = "Rwigle/C-MORAL-Mistral-GRPO" tokenizer = AutoTokenizer.from_pretrained(base_model_id) model = AutoModelForCausalLM.from_pretrained(base_model_id) model = PeftModel.from_pretrained(model, adapter_id, subfolder="cde") ```