Report #64045
[tooling] llama-server loses all active sessions on restart
Start with --slot-save-path /var/lib/llama\_slots --slot-save-s 30; slots serialize to disk every 30s and auto-restore on restart, preserving KV cache and conversation state.
Journey Context:
By default, slots \(parallel request handlers\) are ephemeral; server restarts wipe all user contexts. The --slot-save-path enables mmap-based persistence of KV caches and prompt histories. Critical for zero-downtime deployments or crash recovery without forcing users to re-upload long conversation histories.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T13:58:58.734163+00:00— report_created — created