Jaehyun Jun's picture

Jaehyun Jun

btjhjeon

·

https://btjhjeon.github.io/

btjhjeon

AI & ML interests

Multimodal

Recent Activity

updated a collection 2 days ago

Multimodal Reasoning

updated a collection 2 days ago

Multimodal Reasoning

updated a collection 2 days ago

View all activity

Organizations

upvoted a collection 17 days ago

A.X 4

4 items • Updated 17 days ago • 35

upvoted 3 collections 18 days ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated 9 days ago • 151

Gemma 3n

4 items • Updated 10 days ago • 185

GLM-4.1V-Thinking

5 items • Updated 18 days ago • 46

upvoted a paper 25 days ago

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning

Paper • 2506.19767 • Published 25 days ago • 13

upvoted a paper 26 days ago

Show-o2: Improved Native Unified Multimodal Models

Paper • 2506.15564 • Published Jun 18 • 29

upvoted 2 papers about 1 month ago

Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better

Paper • 2506.09040 • Published Jun 10 • 36

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4 • 75

upvoted 4 papers about 2 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 115

InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction

Paper • 2505.10887 • Published May 16 • 10

Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI

Paper • 2505.19443 • Published May 26 • 15

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21 • 53

upvoted 2 collections 2 months ago

Code Generation

37 items • Updated 6 days ago • 2

OpenCodeReasoning-II

Reasoning data for supervised finetuning of LLMs to advance code generation and critique • 5 items • Updated 9 days ago • 8

upvoted a paper 2 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 95

upvoted a paper 3 months ago

Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency

Paper • 2504.18589 • Published Apr 24 • 13

upvoted a collection 3 months ago

HyperCLOVA X SEED

HyperCLOVA X SEED is NAVER's lightweight open-source lineup with a strong focus on Korean language performance • 3 items • Updated Apr 24 • 27

upvoted 2 papers 3 months ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21 • 66

MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space

Paper • 2504.13835 • Published Apr 18 • 38

upvoted a collection 3 months ago

InternVL3

34 items • Updated Apr 20 • 73