Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sbintuitions
/
sarashina2-vision-8b
like
4
Follow
SB Intuitions
156
Image-to-Text
Transformers
Safetensors
Japanese
English
sarashina2_vision
text-generation
multimodal
vision-language
llama
qwen2_vl
custom_code
License:
mit
Model card
Files
Files and versions
Community
1
Train
Use this model
021aa10
sarashina2-vision-8b
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
toshi-456
update modeling_sarashina2_vision.py
021aa10
verified
21 days ago
.gitattributes
1.57 kB
update
about 1 month ago
LICENSE
Safe
1.07 kB
update
about 1 month ago
README.md
5.44 kB
Update README.md
about 1 month ago
chat_template.json
533 Bytes
update
about 1 month ago
config.json
852 Bytes
update
about 1 month ago
configuration_sarashina2_vision.py
2.92 kB
update
about 1 month ago
generation_config.json
Safe
111 Bytes
update
about 1 month ago
model-00001-of-00002.safetensors
9.99 GB
LFS
update
about 1 month ago
model-00002-of-00002.safetensors
6 GB
LFS
update
about 1 month ago
model.safetensors.index.json
53.9 kB
update
about 1 month ago
modeling_sarashina2_vision.py
11.5 kB
update modeling_sarashina2_vision.py
21 days ago
preprocessor_config.json
680 Bytes
update
about 1 month ago
processing_sarashina2_vision.py
17.1 kB
update
about 1 month ago
processor_config.json
150 Bytes
update
about 1 month ago
sample.jpg
2.51 MB
LFS
update
about 1 month ago
special_tokens_map.json
Safe
968 Bytes
update
about 1 month ago
tokenizer.json
Safe
6.72 MB
update
about 1 month ago
tokenizer.model
Safe
1.83 MB
LFS
update
about 1 month ago
tokenizer_config.json
4.46 kB
update
about 1 month ago