132 34 433

Jeonghwan Park PRO

maywell

https://www.linkedin.com/in/jeonghwan-park-6b97b1245

AI & ML interests

None yet

Recent Activity

published a model about 18 hours ago

maywell/Tri-70B-el-160s

new activity 7 days ago

upstage/Solar-Open-100B:Inquiry regarding base model and training origin

liked a model 8 days ago

naver-hyperclovax/HyperCLOVAX-SEED-Think-32B

View all activity

Organizations

upvoted an article 27 days ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

30 days ago

•

upvoted a paper about 1 month ago

Flash Sparse Attention: An Alternative Efficient Implementation of Native Sparse Attention Kernel

Paper • 2508.18224 • Published Aug 25, 2025 • 1

upvoted a paper about 2 months ago

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Paper • 2511.09611 • Published Nov 12, 2025 • 69

upvoted a paper 2 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 83

upvoted an article 3 months ago

Article

Vocabulary is the most important element of Sparse Retrieval

Oct 4, 2025

•

upvoted an article 4 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26, 2025

•

177

upvoted an article 5 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Aug 9, 2025

•

upvoted a paper 10 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26, 2025 • 65

upvoted 3 papers about 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 16

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11, 2024 • 38

upvoted an article about 1 year ago

Article

Navigating Korean LLM Research #1: Models

Oct 22, 2024

•

upvoted a paper about 1 year ago

Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

Paper • 2410.10814 • Published Oct 14, 2024 • 51

upvoted a collection about 1 year ago

Gemma-APS Release

Collection

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated Jul 10, 2025 • 24

upvoted 3 articles over 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

•

272

Article

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

Aug 22, 2024

•

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

•

263

upvoted a paper over 1 year ago

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 82

upvoted an article over 1 year ago

Article

Putting RL back in RLHF

Jun 12, 2024

•

109

upvoted a paper over 1 year ago

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Paper • 2406.12793 • Published Jun 18, 2024 • 33

Jeonghwan Park PRO

AI & ML interests

Recent Activity

Organizations

maywell's activity

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Vocabulary is the most important element of Sparse Retrieval

Training and Finetuning Reranker Models with Sentence Transformers v4

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Navigating Korean LLM Research #1: Models

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

Training and Finetuning Embedding Models with Sentence Transformers v3

Putting RL back in RLHF