Instructions to use lewiswatson/Frame2KG-InternVL3.5-2b-JSON with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use lewiswatson/Frame2KG-InternVL3.5-2b-JSON with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="lewiswatson/Frame2KG-InternVL3.5-2b-JSON",
	filename="Frame2KG-InternVL3.5-2b-JSON.IQ4_XS.gguf",
)

llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": [
				{
					"type": "text",
					"text": "Describe this image in one sentence."
				},
				{
					"type": "image_url",
					"image_url": {
						"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
					}
				}
			]
		}
	]
)

Notebooks
Google Colab
Kaggle
Local Apps Settings

llama.cpp

How to use lewiswatson/Frame2KG-InternVL3.5-2b-JSON with llama.cpp:

Install (macOS, Linux)

curl -LsSf https://llama.app/install.sh | sh
# Start a local OpenAI-compatible server with a web UI:
llama serve -hf lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M
# Run inference directly in the terminal:
llama cli -hf lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama serve -hf lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M
# Run inference directly in the terminal:
llama cli -hf lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M

Use Docker

docker model run hf.co/lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M

LM Studio
Jan

vLLM

How to use lewiswatson/Frame2KG-InternVL3.5-2b-JSON with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "lewiswatson/Frame2KG-InternVL3.5-2b-JSON"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lewiswatson/Frame2KG-InternVL3.5-2b-JSON",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker

docker model run hf.co/lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M

Ollama
How to use lewiswatson/Frame2KG-InternVL3.5-2b-JSON with Ollama:
```
ollama run hf.co/lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M
```

Unsloth Studio

How to use lewiswatson/Frame2KG-InternVL3.5-2b-JSON with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for lewiswatson/Frame2KG-InternVL3.5-2b-JSON to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for lewiswatson/Frame2KG-InternVL3.5-2b-JSON to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for lewiswatson/Frame2KG-InternVL3.5-2b-JSON to start chatting

Atomic Chat new
Docker Model Runner
How to use lewiswatson/Frame2KG-InternVL3.5-2b-JSON with Docker Model Runner:
```
docker model run hf.co/lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M
```

Lemonade

How to use lewiswatson/Frame2KG-InternVL3.5-2b-JSON with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull lewiswatson/Frame2KG-InternVL3.5-2b-JSON:Q4_K_M

Run and chat with the model

lemonade run user.Frame2KG-InternVL3.5-2b-JSON-Q4_K_M

List all available models

lemonade list

Frame2KG-InternVL3.5-2b-JSON / README.md

lewiswatson

Upload README.md

3c13b87 verified 17 days ago

preview code

Raw

History Blame Contribute Delete

3.75 kB

	---
	library_name: llama.cpp
	pipeline_tag: image-text-to-text
	tags:
	- gguf
	- llama.cpp
	- vision-language
	- frame2kg
	- knowledge-graph
	- JSON
	datasets:
	- lewiswatson/Frame2KG-YC2
	base_model: OpenGVLab/InternVL3_5-2B-HF
	---

	# Frame2KG-InternVL3.5-2b-JSON

	This repository contains GGUF quantised files for the Frame2KG fine-tuned InternVL3.5 2b model.

	These GGUF releases are optimized deployment variants of Frame2KG models. They are provided for practical inference use and may not exactly match the original weights, checkpoints, or evaluation configuration reported in the Frame2KG paper.

	## Model Details

	- Family: InternVL3.5
	- Size: 2b
	- Output format: JSON Frame2KG graph output
	- Base model: OpenGVLab/InternVL3_5-2B-HF
	- Model type: internvl
	- Architecture: InternVLForConditionalGeneration
	- Fine-tuning method: PEFT LoRA
	- LoRA rank: 8
	- LoRA alpha: 16
	- Trainable added token count: 0

	## Files

	\| File \| Size \| Notes \|
	\|---\|---:\|---\|
	\| `Frame2KG-InternVL3.5-2b-JSON.f16.gguf` \| 3.79 GB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.IQ4_XS.gguf` \| 1.10 GB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q2_K.gguf` \| 839.13 MB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q3_K_L.gguf` \| 1.06 GB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q3_K_M.gguf` \| 1023.52 MB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q3_K_S.gguf` \| 954.58 MB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q4_K_M.gguf` \| 1.19 GB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q4_K_S.gguf` \| 1.15 GB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q5_K_M.gguf` \| 1.37 GB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q5_K_S.gguf` \| 1.35 GB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q6_K.gguf` \| 1.56 GB \| model weights \|
	\| `Frame2KG-InternVL3.5-2b-JSON.Q8_0.gguf` \| 2.02 GB \| model weights \|
	\| `mmproj-Frame2KG-InternVL3.5-2b-JSON.f16.gguf` \| 896 B \| multimodal projector \|
	\| `mmproj-Frame2KG-InternVL3.5-2b-JSON.Q8_0.gguf` \| 896 B \| multimodal projector \|



	## Usage Notes

	These files are intended for llama.cpp-compatible GGUF runtimes.

	## Scope

	Frame2KG models are intended to convert image or frame content into structured graph-style outputs. The exact output format depends on the variant:

	- `JSON` variants target JSON-formatted Frame2KG output.
	- `CT` variants target compressed Frame2KG graph tokens.

	## Citation

	If you use this model in your work, please cite the paper:

	```bibtex
	@inproceedings{watson2026frame2kg,
	title = {Frame2KG: A Benchmark and Evaluation Toolkit for Interpretable Frame-to-Graph Generation},
	author = {Watson, Lewis N. and Strathearn, Carl and Mitchell, Kenny and Yu, Yanchao},
	booktitle = {Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)},
	month = {May},
	year = {2026},
	pages = {10912--10926},
	address = {Palma, Mallorca, Spain},
	publisher = {European Language Resources Association (ELRA)},
	editor = {Piperidis, Stelios and Bel, Núria and van den Heuvel, Henk and Ide, Nancy and Krek, Simon and Toral, Antonio},
	doi = {10.63317/4ys6kofrzoc5},
	url = {https://doi.org/10.63317/4ys6kofrzoc5}
	}
	```

	## Disclaimer

	This model is provided as is, without warranties or guarantees of any kind, either express or implied. The authors make no representations regarding the accuracy, reliability, safety, suitability, or performance of the model or its outputs.

	The model may generate incorrect, incomplete, or misleading results and should not be relied upon for critical, safety-sensitive, legal, medical, financial, or other high-stakes decisions. Use of this model is entirely at your own risk.

	The authors accept no liability for damages, losses, or consequences arising from use, misuse, or inability to use the model.