How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="markldn/pagestorm-research-preview-14b-full-book-Q8_0-GGUF",
	filename="",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

PageStorm Research Preview 14B Full Book โ€” GGUF

GGUF quantizations of Pageshift-Entertainment/pagestorm-research-preview-14b-full-book, a ministral3 model trained to produce a full novel from a single prompt via a staged generation pipeline.

Files

  • pagestorm-research-preview-14b-full-book-Q8_0.gguf (~14 GB)
  • pagestorm-research-preview-14b-full-book-Q4_K_M.gguf (~7.7 GB)

Requirements

  • A llama.cpp build whose runtime supports the mistral3 architecture (llm_build_mistral3 / LLM_ARCH_MISTRAL3). Older builds will fail to load it.

Notes

  • The Q8_0 file was converted with convert_hf_to_gguf.py --outtype q8_0.
  • The Q4_K_M file was quantized from a temporary BF16 GGUF exported from the original BF16 Hugging Face checkpoint, not requantized from Q8_0.
  • The source config.json needed original_max_position_embeddings changed from 16384.0 to integer 16384 so the converter could write the int rope KV field.
  • The model uses a staged protocol with custom role headers (<|start_header_id|>โ€ฆ<|stop_header_id|>) and <|eot_id|> as the stage stop token โ€” it is not a plain chat model. See the base model card and its story_stage_generation.py for the prompt protocol.
  • Native context is 262144; KV at that length is large โ€” quantize the KV cache (--cache-type-k q8_0 --cache-type-v q8_0) and/or cap --ctx-size to fit VRAM.

Attribution

Base model ยฉ Pageshift Entertainment, Apache-2.0. This repo only redistributes a quantized copy of those weights.

Downloads last month
606
GGUF
Model size
14B params
Architecture
mistral3
Hardware compatibility
Log In to add your hardware

4-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for markldn/pagestorm-research-preview-14b-full-book-Q8_0-GGUF