Status: draft (call for contributors)
Owner: OpenQuackKit/Transcription/ + community
Last updated: 2026-05-16
Depends on: SPEC-029
Collect storage and performance numbers across the Mac fleet we don’t own, so that SPEC-029 lands on evidence rather than one machine’s snapshot. Specifically, we need to know — across M1/M2/M3/M4, 8 / 16 / 24+ GB, multiple macOS builds — the answers to:
com.apple.e5rt.e5bundlecache)
vs the on-disk source weights, per Whisper variant?openquack-bench corpus run (bench/CONTRIBUTING.md).
This campaign is additional: it’s footprint + cold-warm timings, not
WER/RTF.Run one script, paste the output into a GitHub Discussion (or PR the JSON file). That’s the whole ask. The script:
OpenQuackBench already collects, so reports collate easily.bench/out/<host-tag>/cache-report.json and prints a
one-screen summary the user can paste.Output schema (one JSON file per host):
{
"host": { "chip": "Apple M4", "ram_gb": 16, "macos_build": "24G84" },
"openquack_version": "0.7.2",
"models_on_disk": [
{ "variant": "openai_whisper-medium", "bytes": 1530000000 }
],
"e5rt_cache": {
"path_os_build": "24G84",
"total_bytes": 107200000,
"bundles": 3
},
"timings": {
"cold_transcribe_ms": 14300,
"warm_transcribe_ms": 820,
"sample_clip": "samples/short_en_10s.wav"
},
"captured_at": "2026-05-16T19:42:00Z"
}
No audio is uploaded. No transcripts are uploaded. The sample clip is shipped with the script — public-domain reading, ~10 s.
After ~30 reports across hardware tiers, we can answer:
Path: scripts/bench_ane_cache.sh (this spec ships it alongside).
Requirements:
bench/out/.openquack-cli if installed for the timed transcribes;
otherwise prints copy-pasteable instructions for running them through
the menu-bar app and pasting the timing back.The full text of the report ends in a one-screen markdown table the user can paste verbatim — no JSON wrangling required for the casual contributor.
bench/out/<host-tag>/cache-report.json.
Mirrors the existing bench-result PR convention.We do not ask volunteers to run a paired before/after macOS-update test unless they happen to be updating anyway. Opportunistic data only — nobody should update their OS for our benefit.
We want at least the cells marked wanted filled by the campaign. Cells marked have are already covered by maintainer hardware or existing bench reports — extra data still welcome but lower priority.
| Chip family | 8 GB | 16 GB | 24 GB | 32 GB | 36–48 GB | 64 GB+ |
|---|---|---|---|---|---|---|
| M1 (base) | wanted | wanted | — | — | — | — |
| M1 Pro | — | wanted | wanted | wanted | — | — |
| M1 Max | — | — | wanted | wanted | wanted | wanted |
| M1 Ultra | — | — | — | — | wanted | wanted |
| M2 (base) | wanted | wanted | wanted | — | — | — |
| M2 Pro | — | wanted | wanted | wanted | — | — |
| M2 Max | — | — | wanted | wanted | wanted | wanted |
| M2 Ultra | — | — | — | — | wanted | wanted |
| M3 (base) | wanted | wanted | wanted | — | — | — |
| M3 Pro | — | wanted | wanted | wanted | — | — |
| M3 Max | — | — | wanted | wanted | wanted | wanted |
| M4 (base) | wanted | have (M4 / 16 GB) | wanted | — | — | — |
| M4 Pro | — | wanted | wanted | wanted | wanted | — |
| M4 Max | — | — | wanted | wanted | wanted | wanted |
| Intel (any) | wanted | wanted | wanted | wanted | wanted | wanted |
Intel Macs are included specifically because they have no ANE — those reports validate the “ANE refused / not used” branch of SPEC-029 and prevent us shipping a default that would regress them.
Form factor (MacBook Air / Pro / iMac / mini / Studio) does not go
in the matrix — the chip + RAM + OS build are the load-bearing fields.
But the script captures form factor anyway since system_profiler
reports it for free; useful for cross-referencing thermal-throttling
anomalies if any show up.
This spec is “done” (i.e. SPEC-029 can move past Step 1 with confidence) when:
scripts/bench_ane_cache.sh be promoted to a subcommand of
openquack-bench (e.g. openquack-bench cache-footprint)? Pro:
one entry point for contributors. Con: the script needs to run on
machines that haven’t built the bench binary.