Update README.md

01befd9 verified about 2 months ago

5.07 kB

	---
	license: gemma
	base_model: google/gemma-4-e2b-it
	datasets:
	- junwatu/indonesian-recipes
	language:
	- id
	tags:
	- text-generation
	- recipes
	- indonesian
	- cooking
	- gemma
	- gemma-4
	- conversational
	pipeline_tag: text-generation
	---

	# Resep Masakan Indonesia Gemma 4

	A fine-tune of `google/gemma-4-e2b-it` that writes Indonesian recipes the way Indonesian home cooks actually write them — short, structured, and with the right bumbu.

	## What this model is for

	Give it a dish title in Bahasa Indonesia, get back a recipe in the standard Indonesian home-cook format:

	```
	Bahan:
	- 500 gr ikan kering
	- 5 siung bawang merah
	- 3 siung bawang putih
	- 1 buah tomat
	- ...

	Langkah:
	1. Goreng ikan kering hingga matang dan kering.
	2. Tumis bumbu halus hingga harum...
	6. Sajikan hangat.
	```

	## What it does better than the base Gemma 4 E2B

	I fine-tuned the base model on 66K real Indonesian recipes (mainly home cooking — ayam, sapi, ikan, kambing, tahu, tempe, telur, udang). Compared to the stock `google/gemma-4-e2b-it`:

	\| \| Stock Gemma 4 E2B \| This model \|
	\|---\|---\|---\|
	\| Recipe format consistency \| Mostly OK \| Always Bahan + Langkah \|
	\| Recipe length vs real recipes \| ~50% too long, rambles \| Matches real recipes — concise \|
	\| Knows Indonesian recipe vocabulary (`tumis`, `koreksi rasa`, `Sajikan`) \| Sometimes uses unnatural phrasing \| Uses the natural words home cooks use \|
	\| Common Indonesian dishes (Rendang, Pepes, Tumis, Sambal) \| Often misses key ingredients \| Better ingredient profiles for common dishes \|
	\| Indonesian fluency \| Good (5/5) \| Good (~4.6/5 — small artifact rate) \|

	Overall on a 50-recipe blind eval against real home-cook references: +12% quality vs the stock base.

	## What it does NOT do well

	Be honest with yourself about what to expect:

	- Regional / less-common dishes (e.g. Tahu Thek, Sosis Solo, Tempe Mendoan, Cabe Gendot) — the model will give you a generic recipe that ignores the specific dish identity. The training data didn't have enough examples.
	- Special diet modifiers (MPASI baby food, DEBM low-carb diet, etc.) — it ignores these qualifiers and gives a normal recipe.
	- Defining secret ingredients — Padang dishes might be missing santan; "kuning" dishes might miss kunyit/asam jawa.

	For these cases the base Gemma 4 isn't better either — they're hard problems that need either a much larger model or a curated specialty dataset.

	## How to use

	This model needs one important inference setting to work properly: `no_repeat_ngram_size=6`. Without it, the model can get stuck repeating ingredient lines on long recipes. This setting is a single inference-time argument; everything else works like any standard Gemma chat model.

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	import torch

	model_id = "junwatu/gemma-4-e2b-resep-id" # adjust to actual repo
	tok = AutoTokenizer.from_pretrained(model_id)
	model = AutoModelForCausalLM.from_pretrained(
	model_id,
	dtype=torch.bfloat16,
	device_map="auto",
	)

	messages = [{
	"role": "user",
	"content": (
	'Tulis resep masakan Indonesia berjudul: "Tumis Kangkung Tempe".\n'
	"Format jawaban:\n"
	"Bahan:\n- (daftar bahan, satu per baris)\n\n"
	"Langkah:\n1. (langkah pertama)\n2. (langkah kedua)\n...\n"
	"Gunakan Bahasa Indonesia."
	),
	}]
	inputs = tok.apply_chat_template(
	messages, add_generation_prompt=True,
	return_tensors="pt", return_dict=True,
	).to(model.device)

	out = model.generate(
	**inputs,
	max_new_tokens=1024,
	do_sample=False,
	repetition_penalty=1.05,
	no_repeat_ngram_size=6, # ← important
	pad_token_id=tok.eos_token_id,
	)
	print(tok.decode(out[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True))
	```

	## Training summary (for the curious)

	- Base: `google/gemma-4-e2b-it` (4.6B effective params, vision/audio paths frozen during training)
	- Data: ~66K Indonesian home recipes (HF parquet)
	- Method: Supervised fine-tune, prompt + completion format, 1 epoch
	- Compute: AMD MI300X (192 GB HBM3), ~1.5 hours, bf16
	- Final eval loss: 1.26

	Full technical details in the project repo.

	## Limitations & honest disclosures

	- This is a single-epoch run on a budget. Quality could be higher with more compute and better data curation.
	- Model occasionally produces odd fraction notation like `1/n sdt` — these are minor inference-time glitches from the n-gram repetition guard. Easy to clean up if you're using the output programmatically.
	- Some recipes will be plausible but not authentic to a specific region. Use it as a starting point, not a definitive cookbook.
	- License: inherits the [Gemma Terms of Use](https://ai.google.dev/gemma/terms) from the base model.

	## Acknowledgements

	- Base model: [google/gemma-4-e2b-it](https://huggingface.co/google/gemma-4-e2b-it)
	- Dataset: Indonesian home recipe corpus (cleaned/curated)
	- Compute: AMD Instinct MI300X

	If you find this useful or have suggestions, open an issue on the model page.