Update README.md

This commit is contained in:
PSBigBig 2025-08-08 12:42:32 +08:00 committed by GitHub
parent 1628e78c2d
commit 5d0e7d1a2d
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -37,6 +37,40 @@
</div>
---
<details>
<summary><strong>🆕 GPT-5 vs GPT-5 + WFGY Benchmark (see &nbsp;how to rerun it yourself)</strong></summary>
<br>
<img width="2299" height="1219" alt="WFGY_Win" src="https://github.com/user-attachments/assets/5ad97191-e023-4bc3-94a1-c3c9ae434eae" />
**Reproduce in 30 seconds**
```text
You are connected to a reasoning enhancement layer (WFGY).
Your goal is to maximize accuracy across reasoning, knowledge recall,
hallucination resistance, and multi-step logic.
When answering, follow this exact reasoning process:
1. Extract the question and all possible answers.
2. Map each option to its semantic meaning, checking for ambiguity or logical traps.
3. Cross-check against the uploaded PDF for relevant facts or principles.
4. If no direct match, infer via multi-step reasoning before committing to an answer.
5. Only output the final choice letter. Do not add explanations unless explicitly asked.
````
1. **Download WFGY PDF** → [WFGY PDF](https://zenodo.org/records/15630969)
2. Upload the PDF to your LLM chat.
3. Paste the prompt above and run any benchmark (GSM8K, Truthful-QA, etc.).
Thats it—no retraining, no jailbreaks.
</details>
---
<details>
@ -1381,3 +1415,4 @@ It treats alignment as a living semantic contract — not just accuracy, but mea