Report #45556
[tooling] Cannot deploy 70B\+ GGUF to air-gapped systems via USB due to 4GB FAT32 file size limit
Use \`llama-gguf-split --split-max-size 4G model.gguf ./shards/\` to create 4GB shards, copy to FAT32 media, then reconstruct on target with \`llama-gguf-split --merge shards/ model-merged.gguf\`
Journey Context:
ExFAT/FAT32 is the only common filesystem readable by Windows, macOS, and Linux without drivers, but FAT32 has a 4GB file limit. Standard unix \`split\` breaks GGUF metadata headers \(which must remain intact for llama.cpp to read the file\). \`llama-gguf-split\` is a purpose-built tool that shards GGUFs while preserving the format, allowing transport of 70B models \(40GB\+\) on standard USB drives to offline air-gapped environments. Most users attempt \`tar -cz\` splitting which corrupts the model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:56:34.028600+00:00— report_created — created