Update bitsandbytes.md

This commit is contained in:
PSBigBig 2025-09-05 11:14:52 +08:00 committed by GitHub
parent e07382733f
commit d80b5bc290
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -1,5 +1,22 @@
# BitsAndBytes (bnb): Guardrails and Fix Patterns
<details>
<summary><strong>🧭 Quick Return to Map</strong></summary>
<br>
> You are in a sub-page of **LocalDeploy_Inference**.
> To reorient, go back here:
>
> - [**LocalDeploy_Inference** — on-prem deployment and model inference](./README.md)
> - [**WFGY Global Fix Map** — main Emergency Room, 300+ structured fixes](../README.md)
> - [**WFGY Problem Map 1.0** — 16 reproducible failure modes](../../README.md)
>
> Think of this page as a desk within a ward.
> If you need the full triage and all prescriptions, return to the Emergency Room lobby.
</details>
BitsAndBytes provides 8-bit and 4-bit optimizers and quantized linear layers for large language models.
It enables training and inference under constrained VRAM, but introduces specific stability and semantic risks.
This page maps common bnb issues to structural fixes in the WFGY Problem Map with measurable acceptance gates.