Pradheep1647/smollm2-135m-instruct-paper-cited-chunks-v1-sft-lr2e-5-ep8-lora32a64-seq4096-mbs8 Updated 8 days ago
Pradheep1647/gpt-oss-20b-paper-cited-chunks-v1-sft-lr8e-6-ep4-lora16a32-seq2048-mbs1 Updated 7 days ago
Pradheep1647/smollm2-135m-instruct-paper-cited-chunks-v1-sft-lr2e-5-ep8-lora32a64-seq4096-mbs8-merged 0.1B • Updated 7 days ago • 19
Pradheep1647/gpt-oss-20b-paper-cited-chunks-v1-sft-lr8e-6-ep4-lora16a32-seq2048-mbs1-adapter Updated 7 days ago
Pradheep1647/gpt-oss-20b-paper-preference-150k-v1-dpo-lr5e-6-ep1-beta0-1-lora16a32-seq1024 Updated 6 days ago
Pradheep1647/smollm2-135m-instruct-paper-preference-150k-v1-dpo-lr5e-6-ep1-beta0-1-lora16a32-seq1024 Updated 6 days ago
Pradheep1647/gpt-oss-20b-paper-preference-150k-v1-sft-dpo-lr5e-6-ep1-beta0-1-lora16a32-seq1024 Updated 5 days ago
Pradheep1647/smollm2-135m-instruct-paper-preference-150k-v1-sft-dpo-lr5e-6-ep1-beta0-1-lora16a32-seq1024 Updated 5 days ago