Dimitris Roussis's picture

Dimitris Roussis

droussis

·

https://www.ilsp.gr/en/members/roussis-dimitris/

AI & ML interests

All things data for LLMs, NMT, evaluation, safety, alignment, and more

Organizations

New activity in nvidia/Nemotron-Pretraining-SFT-v1 3 months ago

Use in continual pretraining mix of Apache 2.0 model

#2 opened 3 months ago by

New activity in zai-org/GLM-4.6 3 months ago

Thinking language difference with 4.5?

#10 opened 3 months ago by

New activity in JQL-AI/fw2_edu_scores 3 months ago

Connection with Fineweb-2

#2 opened 3 months ago by

New activity in MegaScience/MegaScience 3 months ago

License

#3 opened 5 months ago by

New activity in JQL-AI/Fineweb_2_500k_filtered 4 months ago

About selection of examples

#2 opened 4 months ago by

New activity in huggingface/InferenceSupport 4 months ago

ilsp/Llama-Krikri-8B-Instruct

#4815 opened 4 months ago by

New activity in Reward-Reasoning/RRM-32B 6 months ago

Regarding the rewards in the knockout tournament

#1 opened 7 months ago by

New activity in open-llm-leaderboard/ilsp__Llama-Krikri-8B-Instruct-details 7 months ago

Add Citation Information

#1 opened 7 months ago by

commented a paper 7 months ago

Krikri: Advancing Open Large Language Models for Greek

Paper • 2505.13772 • Published May 19, 2025 • 6 •

New activity in sethjsa/scipar_en_ru_parallel 8 months ago

Please consider adding citation information in the dataset card.

#1 opened 8 months ago by

New activity in FrancophonIA/SciPar 8 months ago

Please consider adding citation

#1 opened 8 months ago by

New activity in nvidia/OpenCodeReasoning 9 months ago

Question about number of samples and questions

#4 opened 9 months ago by

New activity in Jofthomas/hermes-function-calling-thinking-V1 10 months ago

Thinking token generation

#2 opened 10 months ago by

New activity in OpenLeecher/lmsys_chat_1m_clean 10 months ago

What is the accuracy of the Skywork/Skywork-Reward-Gemma-2-27B-v0.2? How much is the correct sample of 273K?

#5 opened 12 months ago by

New activity in Qwen/QwQ-32B 10 months ago

What languages were you trained in?

#7 opened 10 months ago by

New activity in ilsp/Llama-Krikri-8B-Instruct 11 months ago

Bug on the tokenizer, using the code that you provided for the inference.

#2 opened 11 months ago by

Seems very promising

#1 opened 11 months ago by

New activity in lightblue/rag_multilingual_training_negatives about 1 year ago

Is this the same as Kurage?

#2 opened about 1 year ago by

New activity in bitextor/bicleaner-ai-full-large-en-xx over 1 year ago

About context size and difference in quality

#1 opened over 1 year ago by

New activity in ilsp/Meltemi-7B-v1 over 1 year ago

Future plans (Llama 3?)

#3 opened over 1 year ago by