antgroup
/

HumanSense_Omni_Reasoning

Video-Text-to-Text

Model card Files Files and versions

HumanSense_Omni_Reasoning / README.md

nielsr's picture

nielsr HF Staff

Update pipeline tag and add library name

f87475b verified 13 days ago

|

492 Bytes

	---
	base_model:
	- Qwen/Qwen2.5-Omni-7B
	datasets:
	- antgroup/HumanSense_Benchmark
	language:
	- en
	license: apache-2.0
	metrics:
	- accuracy
	pipeline_tag: video-text-to-text
	library_name: transformers
	---

	<div align="center" style="font-family: charter;">


	<p align="center">
	<img src="pic.png" width="400"/>
	<p>

	<!-- <h1></br>From Multimodal Perception to Empathetic Context-Aware Responses through Reasoning MLLMs</h1> -->

	<div>
	<a href="https://scholar.google.com/citations?user=sPQ