---
license: apache-2.0
base_model: Qwen/Qwen3-4B
tags:
- coreai
- apple
- aimodel
- apple-silicon
---

# Qwen3-4B — official Apple Core AI exports

Pre-converted **`.aimodel` bundles from Apple's official
[coreai-models](https://github.com/apple/coreai-models) export recipe — unmodified**,
with the exact environment, hashes, and measured performance published.

```bash
uv run coreai.llm.export qwen3-4b              # macOS
uv run coreai.llm.export qwen3-4b --platform iOS
```

## Why pre-converted bundles?

1. **The conversion needs a big-RAM Mac** (the 20B export was done on 128 GB);
   running only needs enough RAM to mmap the artifact.
2. **An `.aimodel` is a build artifact, not a pure function of the recipe** — the
   same export command produced a 2.2× slower artifact across the macOS 26 → 27β
   boundary ([forensics](https://github.com/john-rocky/apple-silicon-llm-bench/blob/main/methodology/coreai-export-lowering.md)).
   Hosted artifacts + hashes are the reproducible ground truth; every bundle here
   is exactly the one measured in
   [apple-silicon-llm-bench](https://github.com/john-rocky/apple-silicon-llm-bench).

## Bundles & integrity

| Bundle | Contents | SHA-256 (`main.mlirb`) |
|---|---|---|
| `macos/` | macOS dynamic, int4 | `e9d1e9e49e70e7abe0c9a24062603a827483ed6ffb9ac4f244c553156c8ce5e7` |
| `ios/` | iOS static ctx4096, mixed 4/8-bit palettized | `a1c45800117f2a0a00df5cd4196485a7ea6bc9d894b3d4cac3f3aa032a1c0a89` |

## Measured (Apple's official `llm-benchmark`, greedy)

| Bundle | Protocol | Decode tok/s | Prefill | Load (warm) |
|---|---|---:|---:|---:|
| macos | M4 Max, 512p/1024g | 145.4 | 1,635 | 0.36 s |
| ios (ANE, h18p) | iPhone 17 Pro, 512p/1024g | 13.2 | 546 | 0.46 s (cold spec ≈194 s) |

## Export environment

- macOS 27.0 beta (build 26A5353q) · Xcode 27.0 (27A5194q)
- `coreai-core 1.0.0b1` · `coreai-torch 0.4.0` · `coreai-opt 0.2.0` · `torch 2.9.0`
- apple/coreai-models @ `b1cb71b` (export code identical to upstream `0c1055f`)

## Run it

```bash
# CLI (from a coreai-models checkout)
swift run -c release llm-runner --model <downloaded-bundle-dir> --prompt "Hello"
swift run -c release llm-benchmark --model <downloaded-bundle-dir>
```

Or chat with it in [CoreAIChatMac](https://github.com/john-rocky/coreai-samples)
(point "Choose Models Folder…" at the download directory).

iOS static bundles must be AOT-compiled before device use:
`xcrun coreai-build compile <ir>.aimodel --platform iOS --preferred-compute neural-engine --architecture h18p`
(h18p = iPhone 17 Pro), then set `metadata.json` `assets.main` to the `.aimodelc`.


---
Maintained alongside [coreai-model-zoo](https://github.com/john-rocky/coreai-model-zoo)
(community models) and [coreai-samples](https://github.com/john-rocky/coreai-samples) (apps).