Text Generation
GGUF
English
qwen2
legal
indian-law
BNS
BNSS
BSA
criminal-law
qwen
qwen2.5
llama.cpp
ollama
qlora
unsloth
domain-adaptation
instruction-tuning
question-answering
law
india
conversational
Instructions to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF", filename="qwen2.5-1.5b-instruct.Q4_K_M.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with llama.cpp:
Install (macOS, Linux)
curl -LsSf https://llama.app/install.sh | sh # Start a local OpenAI-compatible server with a web UI: llama serve -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M # Run inference directly in the terminal: llama cli -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama serve -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M # Run inference directly in the terminal: llama cli -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
Use Docker
docker model run hf.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
- LM Studio
- Jan
- vLLM
How to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
- Ollama
How to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with Ollama:
ollama run hf.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
- Unsloth Studio
How to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF to start chatting
- Pi
How to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
Run Hermes
hermes
- Atomic Chat new
- Docker Model Runner
How to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with Docker Model Runner:
docker model run hf.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
- Lemonade
How to use GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF:Q4_K_M
Run and chat with the model
lemonade run user.Indian-Legal-Qwen2.5-1.5B-GGUF-Q4_K_M
List all available models
lemonade list
File size: 7,223 Bytes
5348deb af16dcc 5348deb af16dcc 5348deb af16dcc 5348deb af16dcc 5348deb af16dcc 5348deb af16dcc 5348deb af16dcc 5348deb af16dcc 5348deb af16dcc | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 | ---
language:
- en
license: apache-2.0
base_model: unsloth/Qwen2.5-1.5B-Instruct-bnb-4bit
tags:
- legal
- indian-law
- BNS
- BNSS
- BSA
- criminal-law
- qwen
- qwen2.5
- gguf
- llama.cpp
- ollama
- qlora
- unsloth
- domain-adaptation
- instruction-tuning
- question-answering
- law
- india
datasets:
- GSMS-B/Indian-Legal-QA-BNS-BNSS-BSA
pipeline_tag: text-generation
---
# βοΈπ Indian Legal Qwen 2.5 β 1.5B (GGUF)
<p align="center">
<img src="https://img.shields.io/badge/Base%20Model-Qwen%202.5%201.5B-6366F1?style=for-the-badge" alt="Base Model"/>
<img src="https://img.shields.io/badge/Type-GGUF%20Quantized-A855F7?style=for-the-badge" alt="Type"/>
<img src="https://img.shields.io/badge/Domain-Indian%20Criminal%20Law-DC2626?style=for-the-badge" alt="Domain"/>
<img src="https://img.shields.io/badge/Method-QLoRA-2563EB?style=for-the-badge" alt="Method"/>
<img src="https://img.shields.io/badge/Acts-BNS%20%7C%20BNSS%20%7C%20BSA-16A34A?style=for-the-badge" alt="Acts"/>
<img src="https://img.shields.io/badge/License-Apache%202.0-F59E0B?style=for-the-badge" alt="License"/>
</p>
> π‘ **This is the GGUF-quantized version** β for CPU inference via Ollama or llama.cpp. For full-precision inference see the [Merged Model](https://huggingface.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B) Β· For lightweight adapter loading see the [Adapter](https://huggingface.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B-Adapter).
---
## π Model Description
**Indian Legal Qwen 2.5 β 1.5B (GGUF)** is a quantized, CPU-friendly version of [`GSMS-B/Indian-Legal-Qwen2.5-1.5B`](https://huggingface.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B), a domain-adapted model fine-tuned using QLoRA on a structured question-answer dataset covering all **1,059 sections** of India's three landmark 2023 criminal justice reform acts:
| Act | Full Name | Replaces | Sections |
|---|---|---|---|
| π **BNS 2023** | Bharatiya Nyaya Sanhita | IPC 1860 | 358 |
| π **BNSS 2023** | Bharatiya Nagarik Suraksha Sanhita | CrPC 1973 | 531 |
| π **BSA 2023** | Bharatiya Sakshya Adhiniyam | Indian Evidence Act 1872 | 170 |
Trained on **6,354 instruction-format QA pairs** β 6 question types per section covering definitions, scenarios, legal elements, exceptions, and consequences β giving it broad, structured coverage of India's reformed criminal law framework. As the smallest model in the family, this GGUF build is ideal for fast, fully offline CPU inference.
---
## π Model Family β Qwen 2.5 1.5B
| Variant | Repo | Best For |
|---|---|---|
| π’ **Merged** | [GSMS-B/Indian-Legal-Qwen2.5-1.5B](https://huggingface.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B) | Out-of-the-box inference, Gradio / API deployment |
| π΅ **LoRA Adapter** | [GSMS-B/Indian-Legal-Qwen2.5-1.5B-Adapter](https://huggingface.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B-Adapter) | Lightweight loading on top of base model |
| π‘ **GGUF (this repo)** | `GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF` | CPU inference via Ollama / llama.cpp |
---
## π Quick Start
### π» Run with Ollama
```bash
ollama run hf.co/GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF
```
### βοΈ Run with llama.cpp
```bash
./llama-cli \
-hf GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF \
-p "What is a Zero FIR under BNSS 2023?" \
-n 300 \
--temp 0.1
```
### π Run with llama-cpp-python
```python
from llama_cpp import Llama
llm = Llama.from_pretrained(
repo_id="GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF",
filename="*.gguf",
)
SYSTEM = "You are an expert legal assistant specializing in Indian criminal law β BNS, BNSS, and BSA 2023."
response = llm.create_chat_completion(
messages=[
{"role": "system", "content": SYSTEM},
{"role": "user", "content": "What is a Zero FIR under BNSS 2023?"}
],
temperature=0.1,
max_tokens=300
)
print(response["choices"][0]["message"]["content"])
```
---
## π― Recommended Use Cases
> β οΈ **Important Note:** This model has been domain-adapted on structured QA data and works best as a **component in a larger pipeline** rather than a standalone answer engine. Direct usage without retrieval context may produce incomplete or imprecise answers on complex legal queries.
### β
Where this model excels
| Use Case | π‘ How to Use |
|---|---|
| π **RAG Pipeline** | Pair with a BM25 or vector retriever over BNS/BNSS/BSA texts; feed retrieved sections as context for grounded, citation-backed answers |
| π€ **Legal Chatbot Backend** | Use as the generation backbone of a legal assistant app with a ChromaDB / FAISS document store |
| π **Legal Education Tool** | Build interactive Q&A apps for law students and practitioners learning the 2023 criminal justice reforms |
| π **Section Lookup Assistant** | Combine with a section index to surface the exact BNS / BNSS / BSA provision relevant to a given situation |
| π» **Offline / Edge Deployment** | Smallest model in the family, runnable on consumer CPUs without a GPU β ideal for local apps, kiosks, or low-resource environments |
| π **Structured Legal Summarization** | Summarize individual sections when the section text is supplied as input context |
| ποΈ **Legal NLP Research** | Benchmark Indian criminal law understanding across model families (Qwen vs Llama) |
| βοΈ **Comparative Law Analysis** | Highlight differences between old acts (IPC/CrPC/IEA) and their 2023 replacements |
### β Not recommended for
- Standalone legal advice without a retrieval component
- High-stakes legal decisions without qualified human review
- Jurisdictions or acts outside BNS / BNSS / BSA 2023
---
## ποΈ Training Details
| Property | Value |
|---|---|
| π€ Base model | `unsloth/Qwen2.5-1.5B-Instruct-bnb-4bit` |
| π§ Fine-tuning method | QLoRA |
| ποΈ LoRA rank | 64 |
| ποΈ LoRA alpha | 128 |
| π§© Target modules | q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj |
| π Training data | 6,354 QA pairs β 1,059 sections Γ 6 question types |
| π Epochs | 3 |
| π¦ Batch size (per device) | 4 |
| π Learning rate | 2e-4 |
| βοΈ Optimizer | adamw_8bit |
| π» Hardware | Google Colab T4 GPU |
| π οΈ Framework | Unsloth + TRL SFTTrainer |
| π¬ Prompt format | ChatML |
| ποΈ Quantization | GGUF (converted from merged FP16 model) |
---
## π Training Dataset
| π Dataset | π Link |
|---|---|
| Indian Legal QA β BNS + BNSS + BSA 2023 | [GSMS-B/Indian-Legal-QA-BNS-BNSS-BSA](https://huggingface.co/datasets/GSMS-B/Indian-Legal-QA-BNS-BNSS-BSA) |
**6 question types per section:**
`definitional_topic` Β· `definitional_section` Β· `scenario` Β· `elements` Β· `exceptions` Β· `consequence`
---
## π€ Author
**GSMS-B** β Bugatha Ganasyam Mani Sankar
π€ [Hugging Face Profile](https://huggingface.co/GSMS-B)
---
## β οΈ Disclaimer
This model is intended for **research and educational purposes only**. It does not constitute legal advice. Outputs should not be relied upon for any legal decision without review by a qualified legal professional. The model's responses reflect patterns in training data and may contain errors or omissions.
---
*β‘ Fine-tuned using [Unsloth](https://github.com/unslothai/unsloth) for training efficiency.* |