--- library_name: transformers license: other license_name: custom license_link: LICENSE model_index: - name: Llama-speechlmm-1.0-l-SQA base_model: - meetween/Llama-speechlmm-1.0-l datasets: - Spoken-SQuAD language: - en metrics: - exact match, F1 score pipeline_tag: translation --- ## Model Information This is the version of [meetween/Llama-speechlmm-1.0-l](https://huggingface.co/meetween/Llama-speechlmm-1.0-l) that was fine-tuned for Speech Question Answering. **License:** see [LICENSE](LICENSE) ## Model Architecture Identical to the base model. The model was obtained by training LoRA and the modality adapter on the LLM. This repository contains the model weights with LoRA merged into the main weights. ## How to Use Identical to the base model. ## Fine-tuning Data This model has been fine-tuned on the same Spoken-SQuAD data from the training data of the base model. ## Evaluation Results | Model Name | Exact Match (%) | F1 Score (%) | |------------------------------------------------|------------------|--------------| | Cascaded: Whisper-Large-V3 and Llama 3.1 8B Instruct | 24.73 | 50.82 | | Phi-4-multimodal-instruct | 38.62 | 62.44 | | SpeechLMM_v1.0_L | 65.45 | 77.35 | | SpeechLMM_v1.0_L_SQA | 65.24 | 77.17 | ## Framework Versions - Transformers 4.45.0 - Pytorch 2.3.1+cu124.post2 - Datasets 3.2.0 - Tokenizers 0.20.0