Models that have been verified to work with Meridian's phase detection:
Model family
Boundary detection
Config
DeepSeek-R1
Token IDs [128799, 128800]
models/deepseek_r1.toml
Qwen3 / Qwen2.5
Token IDs [151648, 151649]
models/qwen3.toml
IBM Granite 3.2
Prose markers (no distinct token IDs)
models/granite_3_2.toml
Models that are not verified to work:
Models with non-standard <think> tokenisation not listed above — configure
think_start_token_ids / think_end_token_ids manually and validate with
a sample prompt before production use.
Models served through streaming APIs (e.g. Claude via Anthropic API) —
Meridian requires direct access to the logit vector, which API-served models
do not expose.
vLLM below 0.9.0: the dependency constraint is vllm>=0.9.0. Earlier
versions are not supported and will be rejected at install time.
Windows (native): the Rust workspace builds on Windows (tested in development),
but the Python extension and benchmarks require Linux or WSL2 for the CUDA and
maturin paths.
macOS: not supported. CUDA is not available on macOS.