27 6 2

Yosef Worku Alemneh

rasyosef

bitwise42's profile picture

endagegnehu's profile picture

yosefvx's profile picture

rasyosef
yosef-worku-alemneh

AI & ML interests

Pretraining, Supervised Fine Tuning, Direct Preference Optimization, Retrieval Augmented Generation (RAG), Function Calling

Recent Activity

updated a model about 18 hours ago

rasyosef/splade-small

updated a collection about 19 hours ago

SPLADE-Tiny-MSMARCO

published a model about 19 hours ago

rasyosef/splade-small

View all activity

Organizations

rasyosef 's collections 8

SPLADE-Tiny-MSMARCO

SPLADE sparse retrieval models based on BERT-Tiny (4M) and BERT-Mini (11M) distilled from a Cross-Encoder on the MSMARCO dataset

rasyosef/splade-tiny

Feature Extraction • 0.0B • Updated 20 days ago • 190 • 2
rasyosef/splade-mini

Feature Extraction • 0.0B • Updated 20 days ago • 246 • 2
rasyosef/splade-small

Feature Extraction • 0.0B • Updated about 18 hours ago

Llama 3.2 Amharic

Llama 3.2 decoder transformer models trained on Amharic text

rasyosef/Llama-3.2-400M-Amharic

Text Generation • 0.4B • Updated 15 days ago • 101 • 3
rasyosef/Llama-3.2-400M-Amharic-Instruct-Poems-Stories-Wikipedia

Text Generation • 0.4B • Updated Mar 19 • 36 • 1
rasyosef/Llama-3.2-400M-Amharic-Instruct

Text Generation • 0.4B • Updated Dec 24, 2024 • 2
rasyosef/Llama-3.2-180M-Amharic

Text Generation • 0.2B • Updated Nov 7, 2024 • 3 • 1

Amharic GPT2

GPT2 transformer decoder models pretrained on 290 million tokens of Amharic text

rasyosef/gpt2-medium-amharic-28k-512

Text Generation • 0.0B • Updated Oct 4, 2024 • 62
rasyosef/gpt2-mini-amharic-28k

Text Generation • 0.0B • Updated Aug 2, 2024 • 5 • 1
rasyosef/gpt2-small-amharic

Text Generation • 0.0B • Updated Jul 4, 2024 • 4
rasyosef/gpt2-small-amharic-128-v3

Text Generation • 0.0B • Updated Jul 4, 2024 • 4 • 1

Phi 2 Chat Models

These models underwent supervised fine-tuning and direct preference optimization for instruction following on top of Microsoft's Phi 2 base LLM

rasyosef/phi-2-instruct-v0.1

Text Generation • 3B • Updated Aug 11, 2024 • 14 • 2
rasyosef/phi-2-sft-openhermes-128k-v2

Updated Aug 8, 2024 • 3
rasyosef/phi-2-openhermes-128k-v2-dpo-combined

Updated Aug 9, 2024 • 1
rasyosef/phi-2-instruct-apo

Text Generation • 3B • Updated Dec 19, 2024 • 4

Amharic Text Embedding Models

Text Embedding and ColBERT models based on Amharic RoBERTa and BERT for Amharic passage retrieval

Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval

Paper • 2505.19356 • Published May 25
rasyosef/roberta-amharic-text-embedding-base

Sentence Similarity • 0.1B • Updated Jun 11 • 23
rasyosef/colbert-roberta-amharic-base

Sentence Similarity • 0.1B • Updated Jun 11 • 15
rasyosef/roberta-amharic-text-embedding-medium

Sentence Similarity • 0.0B • Updated Jun 11 • 11

Amharic BERT and RoBERTa

BERT and RoBERTa transformer encoder models pretrained on 290 million tokens of Amharic text

rasyosef/roberta-base-amharic

Fill-Mask • 0.1B • Updated Jan 16 • 66
rasyosef/roberta-medium-amharic

Fill-Mask • 0.0B • Updated Jan 11 • 26 • 1
rasyosef/bert-medium-amharic

Fill-Mask • 0.0B • Updated Jun 27, 2024 • 56 • • 1
rasyosef/bert-small-amharic

Fill-Mask • 0.0B • Updated Jun 27, 2024 • 11

Phi 1.5 Chat Models

These models underwent supervised fine-tuning and direct preference optimization for instruction following on top of Microsoft's Phi 1.5 base LLM

rasyosef/Phi-1_5-Instruct-v0.1

Text Generation • 1B • Updated Aug 11, 2024 • 1.05k • 1
rasyosef/phi-1_5-sft

Text Generation • 1B • Updated Jul 20, 2024 • 4
rasyosef/phi-1_5-dpo

Updated Jul 31, 2024 • 1

Minitron Chat Models

Instruction-tuned (chat) versions of Nvidia's Minitron base models created through supervised fine-tuning (SFT)

rasyosef/Mistral-NeMo-Minitron-8B-Chat

Text Generation • 8B • Updated Dec 18, 2024 • 8 • 8
rasyosef/Llama-3.1-Minitron-4B-Chat

Text Generation • 5B • Updated Aug 27, 2024 • 23 • 4

SPLADE-Tiny-MSMARCO

SPLADE sparse retrieval models based on BERT-Tiny (4M) and BERT-Mini (11M) distilled from a Cross-Encoder on the MSMARCO dataset

rasyosef/splade-tiny

Feature Extraction • 0.0B • Updated 20 days ago • 190 • 2
rasyosef/splade-mini

Feature Extraction • 0.0B • Updated 20 days ago • 246 • 2
rasyosef/splade-small

Feature Extraction • 0.0B • Updated about 18 hours ago

Amharic Text Embedding Models

Text Embedding and ColBERT models based on Amharic RoBERTa and BERT for Amharic passage retrieval

Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval

Paper • 2505.19356 • Published May 25
rasyosef/roberta-amharic-text-embedding-base

Sentence Similarity • 0.1B • Updated Jun 11 • 23
rasyosef/colbert-roberta-amharic-base

Sentence Similarity • 0.1B • Updated Jun 11 • 15
rasyosef/roberta-amharic-text-embedding-medium

Sentence Similarity • 0.0B • Updated Jun 11 • 11

Llama 3.2 Amharic

Llama 3.2 decoder transformer models trained on Amharic text

rasyosef/Llama-3.2-400M-Amharic

Text Generation • 0.4B • Updated 15 days ago • 101 • 3
rasyosef/Llama-3.2-400M-Amharic-Instruct-Poems-Stories-Wikipedia

Text Generation • 0.4B • Updated Mar 19 • 36 • 1
rasyosef/Llama-3.2-400M-Amharic-Instruct

Text Generation • 0.4B • Updated Dec 24, 2024 • 2
rasyosef/Llama-3.2-180M-Amharic

Text Generation • 0.2B • Updated Nov 7, 2024 • 3 • 1

Amharic BERT and RoBERTa

BERT and RoBERTa transformer encoder models pretrained on 290 million tokens of Amharic text

rasyosef/roberta-base-amharic

Fill-Mask • 0.1B • Updated Jan 16 • 66
rasyosef/roberta-medium-amharic

Fill-Mask • 0.0B • Updated Jan 11 • 26 • 1
rasyosef/bert-medium-amharic

Fill-Mask • 0.0B • Updated Jun 27, 2024 • 56 • • 1
rasyosef/bert-small-amharic

Fill-Mask • 0.0B • Updated Jun 27, 2024 • 11

Amharic GPT2

GPT2 transformer decoder models pretrained on 290 million tokens of Amharic text

rasyosef/gpt2-medium-amharic-28k-512

Text Generation • 0.0B • Updated Oct 4, 2024 • 62
rasyosef/gpt2-mini-amharic-28k

Text Generation • 0.0B • Updated Aug 2, 2024 • 5 • 1
rasyosef/gpt2-small-amharic

Text Generation • 0.0B • Updated Jul 4, 2024 • 4
rasyosef/gpt2-small-amharic-128-v3

Text Generation • 0.0B • Updated Jul 4, 2024 • 4 • 1

Phi 1.5 Chat Models

These models underwent supervised fine-tuning and direct preference optimization for instruction following on top of Microsoft's Phi 1.5 base LLM

rasyosef/Phi-1_5-Instruct-v0.1

Text Generation • 1B • Updated Aug 11, 2024 • 1.05k • 1
rasyosef/phi-1_5-sft

Text Generation • 1B • Updated Jul 20, 2024 • 4
rasyosef/phi-1_5-dpo

Updated Jul 31, 2024 • 1

Phi 2 Chat Models

These models underwent supervised fine-tuning and direct preference optimization for instruction following on top of Microsoft's Phi 2 base LLM

rasyosef/phi-2-instruct-v0.1

Text Generation • 3B • Updated Aug 11, 2024 • 14 • 2
rasyosef/phi-2-sft-openhermes-128k-v2

Updated Aug 8, 2024 • 3
rasyosef/phi-2-openhermes-128k-v2-dpo-combined

Updated Aug 9, 2024 • 1
rasyosef/phi-2-instruct-apo

Text Generation • 3B • Updated Dec 19, 2024 • 4

Minitron Chat Models

Instruction-tuned (chat) versions of Nvidia's Minitron base models created through supervised fine-tuning (SFT)

rasyosef/Mistral-NeMo-Minitron-8B-Chat

Text Generation • 8B • Updated Dec 18, 2024 • 8 • 8
rasyosef/Llama-3.1-Minitron-4B-Chat

Text Generation • 5B • Updated Aug 27, 2024 • 23 • 4