Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Compatibility Matrix

Runtime requirements

ComponentMinimumTested
LinuxUbuntu 22.04Ubuntu 24.04
WSL2WSL2 on Windows 10/11Windows 10 22H2
Rust toolchain1.85.01.85.0 (pinned)
Python3.113.11
vLLM0.9.00.21.0 (resolved in uv.lock)
NVIDIA driver555.x555.x
CUDA toolkit12.612.6
CUDA Compute Capability8.0 (A100)8.0+

Build requirements

ToolVersion
cargo + rustupRust 1.85.0
maturinlatest (install via pip install maturin)
uv0.4+
nvcc12.6 (only for --features cuda)
mdbooklatest (only for docs)

Model compatibility

Models that have been verified to work with Meridian's phase detection:

Model familyBoundary detectionConfig
DeepSeek-R1Token IDs [128799, 128800]models/deepseek_r1.toml
Qwen3 / Qwen2.5Token IDs [151648, 151649]models/qwen3.toml
IBM Granite 3.2Prose markers (no distinct token IDs)models/granite_3_2.toml

Models that are not verified to work:

  • Models with non-standard <think> tokenisation not listed above — configure think_start_token_ids / think_end_token_ids manually and validate with a sample prompt before production use.
  • Models served through streaming APIs (e.g. Claude via Anthropic API) — Meridian requires direct access to the logit vector, which API-served models do not expose.

Feature flags

Feature flagRequiresStatus
(default — no flags)Linux, RustFully CI-tested
--features prometheusprometheus crateCI-tested
--features unstableRust nightly-gated APIsCI-tested
--features nixllibnixl.so on deploy hostCompiles; integration tested with synthetic mock
--features cudanvcc, CUDA 12.6, GPU at runtimeBuild-tested on GPU CI runner

CI coverage

JobPlatformGPUStatus
rust-coreubuntu-24.04NoCI
rust-kernels (stub)ubuntu-24.04NoSame badge
pythonubuntu-24.04NoSame badge
docsubuntu-24.04NoSame badge
supply-chainubuntu-24.04NoSame badge
cuda-buildself-hosted gpu runnerYesRuns only on angelnicolasc org pushes

The GPU jobs are gated to prevent arbitrary code execution on the self-hosted runner from fork PRs. See GPU CI runner setup.

Known incompatibilities

  • vLLM below 0.9.0: the dependency constraint is vllm>=0.9.0. Earlier versions are not supported and will be rejected at install time.
  • Windows (native): the Rust workspace builds on Windows (tested in development), but the Python extension and benchmarks require Linux or WSL2 for the CUDA and maturin paths.
  • macOS: not supported. CUDA is not available on macOS.