Malikeh Ehghaghi's picture

Malikeh Ehghaghi

Malikeh1375

·

AI & ML interests

NLP, Modular ML, Model Merging, Decentralized Training, Efficient LLMs

Recent Activity

updated a model 1 day ago

Malikeh1375/Qwen2.5-1.5B-Advanced-Mathematics-And-Modeling-Distilled-8Clusters-25K

published a model 1 day ago

Malikeh1375/Qwen2.5-1.5B-Advanced-Mathematics-And-Modeling-Distilled-8Clusters-25K

updated a model 5 days ago

Malikeh1375/Qwen2.5-1.5B-Non-English-Math-Distilled-8Clusters-25K

View all activity

Organizations

upvoted a paper about 1 month ago

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12, 2024 • 44

upvoted 2 papers about 2 months ago

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 49

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 46

upvoted a collection 3 months ago

supertoken

The initial checkpoints for the token comparison research. • 20 items • Updated May 22 • 1

upvoted a collection 5 months ago

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80. • 69 items • Updated Jul 4 • 10

upvoted 2 papers 10 months ago

Revealing the Barriers of Language Agents in Planning

Paper • 2410.12409 • Published Oct 16, 2024 • 28

EchoPrime: A Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation

Paper • 2410.09704 • Published Oct 13, 2024 • 13

upvoted a collection 11 months ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 123

upvoted a paper 11 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 141

upvoted 2 collections 11 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 29 days ago • 631

Korean Datasets I've released so far.

지금까지 업로드한 한국어 데이터셋 콜렉션입니다. • 8 items • Updated May 24, 2024 • 20

upvoted a collection 12 months ago

Arabic Light Benchmarks

10% sample of the original benchmarks for each dataset from lighteval • 7 items • Updated Sep 10, 2024 • 2

upvoted an article 12 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19, 2024

• 78

upvoted a collection about 1 year ago

Arabic ORPO-DPO Datasets

12 items • Updated Aug 17, 2024 • 2

upvoted 3 papers about 1 year ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 51

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 167

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19, 2024 • 19

upvoted 2 collections about 1 year ago

Top 10% instruction tuning datasets

Collects datasets with 'instruction' in the name and more than 1 download and in the top 10% for the number of likes • 13 items • Updated Jul 3, 2024 • 11

Probably function calling datasets

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 38

upvoted a paper about 1 year ago

Better Alignment with Instruction Back-and-Forth Translation

Paper • 2408.04614 • Published Aug 8, 2024 • 16