File size: 1,514 Bytes
ea13b99 20e6786 ea13b99 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 |
---
library_name: transformers
license: other
license_name: custom
license_link: LICENSE
model_index:
- name: Llama-speechlmm-1.0-l-SQA
base_model:
- meetween/Llama-speechlmm-1.0-l
datasets:
- Spoken-SQuAD
language:
- en
metrics:
- exact match, F1 score
pipeline_tag: translation
---
## Model Information
This is the version of [meetween/Llama-speechlmm-1.0-l](https://huggingface.co/meetween/Llama-speechlmm-1.0-l) that was
fine-tuned for Speech Question Answering.
**License:** see [LICENSE](LICENSE)
## Model Architecture
Identical to the base model. The model was obtained by training LoRA and the modality adapter on the LLM.
This repository contains the model weights with LoRA merged into the main weights.
## How to Use
Identical to the base model.
## Fine-tuning Data
This model has been fine-tuned on the same Spoken-SQuAD data from the training data of the base
model.
## Evaluation Results
| Model Name | Exact Match (%) | F1 Score (%) |
|------------------------------------------------|------------------|--------------|
| Cascaded: Whisper-Large-V3 and Llama 3.1 8B Instruct | 24.73 | 50.82 |
| Phi-4-multimodal-instruct | 38.62 | 62.44 |
| SpeechLMM_v1.0_L | 65.45 | 77.35 |
| SpeechLMM_v1.0_L_SQA | 65.24 | 77.17 |
## Framework Versions
- Transformers 4.45.0
- Pytorch 2.3.1+cu124.post2
- Datasets 3.2.0
- Tokenizers 0.20.0 |