How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="Ba2han/Qwen3-30B-A3B-Geminized-v0.2",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

Use "You are an assistant with reasoning capabilities." system message to consistently trigger gemini-style thinking.

I'm working on improving the dataset & model and will release a new, full version.

Training Dataset

  • The fine-tuning dataset consists of ~450 diverse examples, 250 of which are directly from Gemini 2.5 Pro.

Trained on:

  • Unsloth version of Qwen3-30B-A3B (instruct).
  • 32k seq_len with examples ranging from 1k to ~20k tokens.
  • Up to 2 turns of conversations.

  • No benchmark data for now.

Keep in mind that it's slightly overfit since the training dataset was quite small. The model can be used to create more high quality examples for further training.

image/png

Downloads last month
16
GGUF
Model size
31B params
Architecture
qwen3moe
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Ba2han/Qwen3-30B-A3B-Geminized-v0.2

Quantized
(119)
this model

Collection including Ba2han/Qwen3-30B-A3B-Geminized-v0.2