Info
This model is an adapter model trained with QloRA technique.
- 📜 Model license: Llama 2 Community License Agreement
- 🏛️ Base Model: Llama-2-70b-hf
- 🖥️ Machine: Nvidia A100 (40 GB vRAM)
- 💵 Cost: $3.5
- ⌛ Training Time: 3 hour 22 minutes
- 📊 Dataset Used: vicgalle/alpaca-gpt4
You can acces Llama-2 paper by clicking here
Evaluation Results (Open LLM Leaderboard)
Average | ARC (25-shot) | HellaSwag (10-shot) | MMLU (5-shot) | TruthfulQA (0-shot) | |
---|---|---|---|---|---|
Scores | 67.3 | 66.38 | 84.51 | 62.75 | 55.57 |
Loss Graph
- Downloads last month
- 6
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.