Skip to content

Prefetch models for offline use

kaos-nlp-transformers downloads ONNX models on first use. To run fully offline afterward (CI, air-gapped, or just deterministic), pre-warm the cache once, then enforce offline mode.

Terminal window
# Download the models you'll use into the cache (one time, needs network)
kaos-nlp-transformers prefetch --include embedding --include reranker
# ...or a specific model
kaos-nlp-transformers prefetch --model BAAI/bge-small-en-v1.5
# Afterwards, force offline so no network fetch is attempted
export KAOS_NLP_TRANSFORMERS_OFFLINE=1
Terminal window
kaos-nlp-transformers info # confirm what's cached and the active device