Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

commented on a paper 3 days ago

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

upvoted a collection 3 days ago

liked a model 3 days ago

google/gemma-3n-E4B-it

View all activity

Organizations

commented a paper 3 days ago

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Paper • 2506.19697 • Published 6 days ago • 43 •

New activity in open-llm-leaderboard/open_llm_leaderboard 4 months ago

It's been a wild ride, folks :) (end of the Open LLM Leaderboard)

#1135 opened 4 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 5 months ago

Qwen2.5-32B merge eval error

#1078 opened 5 months ago by

commented a paper 9 months ago

What Matters in Transformers? Not All Attention is Needed

Paper • 2406.15786 • Published Jun 22, 2024 • 32 •

New activity in mattshumer/Reflection-Llama-3.1-70B 10 months ago

If you find Independent third-party evaluation results about this model, please share here.

#42 opened 10 months ago by

New activity in Weyaxi/leaderboard-results-to-modelcard 10 months ago

Hi, can you deal with the bot spamming the community section of mattshumer/ref_70_e3 and

#16 opened 10 months ago by

New activity in mattshumer/ref_70_e3 10 months ago

🚩 Report

#6 opened 10 months ago by

Reflection-Llama-3.1-70B was sonnet 3.5.

#5 opened 10 months ago by

mattshumer/ref_70_e3 and mattshumer/Reflection-Llama-3.1-70B-ep2-working are the SAME.

#10 opened 10 months ago by

Third party evaluation on this so-called "LLM".

#7 opened 10 months ago by

New activity in mattshumer/Reflection-Llama-3.1-70B 10 months ago

Reflection-Llama-3.1-70B was claude 3, chatgpt4o and what?

#48 opened 10 months ago by

New activity in laion/relaion2B-en-research 10 months ago

[Question] 2B or 5B?

#1 opened 10 months ago by

New activity in huggingchat/chat-ui 12 months ago

[MODELS] Discussion

#372 opened over 1 year ago by

New activity in mistralai/Mistral-Nemo-Instruct-2407 12 months ago

How to FP8 inference

#5 opened 12 months ago by

New activity in bosonai/Higgs-Llama-3-70B about 1 year ago

Questions about the context length

#4 opened about 1 year ago by

commented 3 papers about 1 year ago

The Road Less Scheduled

Paper • 2405.15682 • Published May 24, 2024 • 28 •

The Road Less Scheduled

Paper • 2405.15682 • Published May 24, 2024 • 28 •

The Road Less Scheduled

Paper • 2405.15682 • Published May 24, 2024 • 28 •