Peter Kis

NePe
ยท

AI & ML interests

None yet

Recent Activity

updated a model about 15 hours ago
NePe/Qwen3-30B-A3B-GPTQ
published a model about 15 hours ago
NePe/Qwen3-30B-A3B-GPTQ
View all activity

Organizations

Hugging Face Discord Community's profile picture

NePe's activity

New activity in JunHowie/Qwen3-30B-A3B-GPTQ-Int4 1 day ago

Slow GPTQ inference

4
#2 opened 2 days ago by
NePe
New activity in AlphaGaO/Qwen3-30B-A3B-GPTQ 1 day ago

Slow GPTQ inference

9
#1 opened 2 days ago by
NePe
New activity in moonshotai/Moonlight-16B-A3B-Instruct 9 days ago

PEFT finetuning support

#14 opened 9 days ago by
NePe
New activity in google/gemma-2-27b-it 10 months ago
New activity in rainjay/gemma-2-27b-it-4bit 10 months ago
reacted to santiviquez's post with ๐Ÿ”ฅ 11 months ago
view post
Post
1568
I ran 580 experiments (yes, 580 ๐Ÿคฏ) to check if we can quantify data drift's impact on model performance using only drift metrics.

For these experiments, I built a technique that relies on drift signals to estimate model performance. I compared its results against the current SoTA performance estimation methods and checked which technique performs best.

The plot below summarizes the general results. It measures the quality of performance estimation versus the absolute performance change. (The lower, the better).

Full experiment: https://www.nannyml.com/blog/data-drift-estimate-model-performance

In it, I describe the setup, datasets, models, benchmarking methods, and the code used in the project.
reacted to andrewrreed's post with โค๏ธ 12 months ago
view post
Post
2623
๐Ÿ”ฌ Open LLM Progress Tracker ๐Ÿ”ฌ

Inspired by the awesome work from @mlabonne , I created a Space to monitor the narrowing gap between open and proprietary LLMs as scored by the LMSYS Chatbot Arena ELO ratings ๐Ÿค—

The goal is to have a continuously updated place to easily visualize these rapidly evolving industry trends ๐Ÿš€

๐Ÿ”— Open LLM Progress Tracker: andrewrreed/closed-vs-open-arena-elo
๐Ÿ”— Source of Inspiration: https://www.linkedin.com/posts/maxime-labonne_arena-elo-graph-updated-with-new-models-activity-7187062633735368705-u2jB/
  • 2 replies
ยท