Russian emotion analisys
Collection
Набор моделей разной степени плохости для задач определения эмоции
•
2 items
•
Updated
Модель rubert_tiny2_russian_emotion_sentiment
— это дообученная версия легковесной модели cointegrated/rubert-tiny2
для классификации пяти эмоций в русскоязычных сообщениях:
Метрика | Значение |
---|---|
Accuracy | 0.8911 |
F1 macro | 0.8910 |
F1 micro | 0.8911 |
Точность по классам:
pip install transformers torch
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
# Загружаем модель и токенизатор
MODEL_ID = "Kostya165/rubert_tiny2_russian_emotion_sentiment"
tokenizer = AutoTokenizer.from_pretrained(MODEL_ID)
model = AutoModelForSequenceClassification.from_pretrained(MODEL_ID)
model.eval()
texts = [
"Сегодня отличный день!",
"Меня это всё бесит и раздражает."
]
# Токенизация
enc = tokenizer(texts, padding=True, truncation=True, max_length=128, return_tensors="pt")
with torch.no_grad():
logits = model(**enc).logits
preds = logits.argmax(dim=-1).tolist()
# Преобразуем ID обратно в метки
id2label = model.config.id2label
labels = [id2label[p] for p in preds]
print(labels) # например: ['positive', 'aggression']
cointegrated/rubert-tiny2
Kostya165/ru_emotion_dvach
transformers>=4.30.0
torch>=1.10.0
datasets
evaluate
CC-BY-SA 4.0.
@article{rubert_tiny2_russian_emotion_sentiment,
title = {Russian Emotion Sentiment Classification with RuBERT-tiny2},
author = {Kostya165},
year = {2024},
howpublished = {\url{https://huggingface.co/Kostya165/rubert_tiny2_russian_emotion_sentiment}}
}
Description
The rubert_tiny2_russian_emotion_sentiment
model is a fine‑tuned version of the lightweight cointegrated/rubert-tiny2
for classifying five emotions in Russian text:
Validation Results
Metric | Value |
---|---|
Accuracy | 0.8911 |
F1 macro | 0.8910 |
F1 micro | 0.8911 |
Per‑class accuracy:
Usage
pip install transformers torch
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
MODEL_ID = "Kostya165/rubert_tiny2_russian_emotion_sentiment"
tokenizer = AutoTokenizer.from_pretrained(MODEL_ID)
model = AutoModelForSequenceClassification.from_pretrained(MODEL_ID)
model.eval()
texts = ["Сегодня отличный день!", "Меня это всё бесит и раздражает."]
enc = tokenizer(texts, padding=True, truncation=True, max_length=128, return_tensors="pt")
with torch.no_grad():
logits = model(**enc).logits
preds = logits.argmax(dim=-1).tolist()
labels = [model.config.id2label[p] for p in preds]
print(labels) # e.g. ['positive', 'aggression']
Training Details
cointegrated/rubert-tiny2
Kostya165/ru_emotion_dvach
(train/validation) Requirements
transformers>=4.30.0
torch>=1.10.0
datasets
evaluate
License
CC-BY-SA 4.0.
Citation
@article{rubert_tiny2_russian_emotion_sentiment,
title = {Russian Emotion Sentiment Classification with RuBERT-tiny2},
author = {Kostya165},
year = {2024},
howpublished = {\url{https://huggingface.co/Kostya165/rubert_tiny2_russian_emotion_sentiment}}
}
Base model
cointegrated/rubert-tiny2