maikezu's picture
Update README.md
20e6786 verified
metadata
library_name: transformers
license: other
license_name: custom
license_link: LICENSE
model_index:
  - name: Llama-speechlmm-1.0-l-SQA
base_model:
  - meetween/Llama-speechlmm-1.0-l
datasets:
  - Spoken-SQuAD
language:
  - en
metrics:
  - exact match, F1 score
pipeline_tag: translation

Model Information

This is the version of meetween/Llama-speechlmm-1.0-l that was fine-tuned for Speech Question Answering.

License: see LICENSE

Model Architecture

Identical to the base model. The model was obtained by training LoRA and the modality adapter on the LLM. This repository contains the model weights with LoRA merged into the main weights.

How to Use

Identical to the base model.

Fine-tuning Data

This model has been fine-tuned on the same Spoken-SQuAD data from the training data of the base model.

Evaluation Results

Model Name Exact Match (%) F1 Score (%)
Cascaded: Whisper-Large-V3 and Llama 3.1 8B Instruct 24.73 50.82
Phi-4-multimodal-instruct 38.62 62.44
SpeechLMM_v1.0_L 65.45 77.35
SpeechLMM_v1.0_L_SQA 65.24 77.17

Framework Versions

  • Transformers 4.45.0
  • Pytorch 2.3.1+cu124.post2
  • Datasets 3.2.0
  • Tokenizers 0.20.0