--- license: apache-2.0 base_model: Qwen/Qwen3-8B tags: - limo - mathematical-reasoning - full-precision language: - en pipeline_tag: text-generation --- # LIMO-Qwen3-8B-Math-Full-Precision **Full-precision** (bfloat16) merged model trained with LIMO methodology. ## Model Details - **Size**: ~16GB (full precision) - **Base**: Qwen/Qwen3-8B - **Training**: LIMO dataset (817 samples) - **Method**: LoRA → Full merge ## Usage ```python from transformers import AutoModelForCausalLM, AutoTokenizer model = AutoModelForCausalLM.from_pretrained( "Cbgcbg/limo-qwen3-8b-math-full-precision", torch_dtype=torch.bfloat16, device_map="auto" ) tokenizer = AutoTokenizer.from_pretrained("Cbgcbg/limo-qwen3-8b-math-full-precision") ``` ## Comparison - Original Gasing: 15.26 GB ✅ - Previous LIMO: 5.55 GB ❌ (quantized) - **This model: ~16 GB ✅ (full precision)**