PaletLabs
/

Circe

Text Generation

text-generation-inference

Model card Files Files and versions Community

ErnestoOjeda commited on May 5

Commit

575bad0

·

verified ·

1 Parent(s): e9fc8f7

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -126,7 +126,7 @@ python train/upload_to_hub.py \
 # 💻 Hardware & Inference Tips
 - **bf16 / fp16**: Needs ~9 GB VRAM.
-- **4-bit GPTQ**: < 3 GB; `bitsandbytes` works out-of-the-box.
 - Compile once (`torch.compile`) for **+10–15 %** throughput.
 ---
@@ -135,10 +135,10 @@ Formal **lighteval / MMLU / GSM-8K** runs are queued. Preliminary spot-checks sh
 ---
 ## ⚙️ Limitations & Bias
-- No reward-model alignment — outputs may be unsafe or hallucinate.
 - Long-context (> 4 k) stability untested.
-- Training data bias from public QA pairs; Spanish coverage favors Latin-American variants.
-- Minimal safety filters — **you** must wrap with your own guardrails for production.
 ---
 # 🔮 Roadmap

 # 💻 Hardware & Inference Tips
 - **bf16 / fp16**: Needs ~9 GB VRAM.
+- **4-bit GPTQ**: < 3 GB. `bitsandbytes` works out-of-the-box.
 - Compile once (`torch.compile`) for **+10–15 %** throughput.
 ---
 ---
 ## ⚙️ Limitations & Bias
+- No reward-model alignment.
 - Long-context (> 4 k) stability untested.
+- Training data bias from public QA pairs. Spanish coverage favors Latin American variants.
+- Minimal safety filters so **you** have to wrap with your own guardrails for production.
 ---
 # 🔮 Roadmap