Update ctransformers.md

This commit is contained in:
PSBigBig 2025-09-05 11:14:59 +08:00 committed by GitHub
parent d80b5bc290
commit dcc0515aef
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -1,5 +1,22 @@
# CTransformers: Guardrails and Fix Patterns
<details>
<summary><strong>🧭 Quick Return to Map</strong></summary>
<br>
> You are in a sub-page of **LocalDeploy_Inference**.
> To reorient, go back here:
>
> - [**LocalDeploy_Inference** — on-prem deployment and model inference](./README.md)
> - [**WFGY Global Fix Map** — main Emergency Room, 300+ structured fixes](../README.md)
> - [**WFGY Problem Map 1.0** — 16 reproducible failure modes](../../README.md)
>
> Think of this page as a desk within a ward.
> If you need the full triage and all prescriptions, return to the Emergency Room lobby.
</details>
CTransformers is a lightweight Python/C++ binding for GGML/GGUF models.
It is widely used in minimal local inference setups (often with quantized LLaMA/GPTQ models) but introduces specific risks: unstable JSON tool output, KV cache drift, and library mismatch across versions.
This page defines reproducible guardrails and WFGY-based fixes.