Dimitris Roussis
droussis
AI & ML interests
All things data for LLMs, NMT, evaluation, safety, alignment, and more
Recent Activity
new activity
6 days ago
nvidia/OpenCodeReasoning:Question about number of samples and questions
liked
a Space
6 days ago
openGPT-X/european-llm-leaderboard
upvoted
a
paper
19 days ago
Qwen2.5-Omni Technical Report
Organizations
droussis's activity
Question about number of samples and questions
1
3
#4 opened 7 days ago
by
MeiGao
Thinking token generation
1
3
#2 opened about 2 months ago
by
thtang
What languages ββwere you trained in?
1
2
#7 opened about 1 month ago
by
NickyNicky

Bug on the tokenizer, using the code that you provided for the inference.
6
#2 opened 2 months ago
by
Ptrnk
Seems very promising
5
5
#1 opened 2 months ago
by
gstrat88
Is this the same as Kurage?
2
#2 opened 5 months ago
by
droussis

About context size and difference in quality
3
#1 opened 11 months ago
by
droussis

Future plans (Llama 3?)
1
1
#3 opened 12 months ago
by
velocity

LLama-Factory inference issue
2
14
#2 opened about 1 year ago
by
ianss
Regarding quality assessment
2
#1 opened over 1 year ago
by
droussis

Community request: more languages
1
8
#1 opened over 1 year ago
by
emre

Which part of HC3?
#1 opened over 1 year ago
by
droussis

The model output is totally corrupted
4
4
#5 opened over 1 year ago
by
fernandofernandes

Fix weights by putting the right value in `lm_head.weight`
1
3
#3 opened almost 2 years ago
by
sgugger

The model output is totally corrupted
4
4
#5 opened over 1 year ago
by
fernandofernandes
