41 213

Jaekoo Kang

jkang

https://github.com/jaekookang

AI & ML interests

Anything fun and interesting

Recent Activity

liked a dataset 1 day ago

meal-bbang/Korean_message

liked a model 1 day ago

blockenters/sms-spam-classifier

liked a model about 1 month ago

skt/A.X-K1

View all activity

Organizations

upvoted 3 papers 11 months ago

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Paper • 2401.03506 • Published Jan 7, 2024 • 15

The VoxCeleb Speaker Recognition Challenge: A Retrospective

Paper • 2408.14886 • Published Aug 27, 2024 • 11

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

Paper • 2402.00340 • Published Feb 1, 2024 • 2

upvoted a paper 12 months ago

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published Mar 17, 2025 • 13

upvoted a collection 12 months ago

SuperBPE

Collection

SuperBPE tokenizers and models trained with them • 8 items • Updated 11 days ago • 17

upvoted 3 papers over 1 year ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 84

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 44

Pix2Gif: Motion-Guided Diffusion for GIF Generation

Paper • 2403.04634 • Published Mar 7, 2024 • 16

upvoted 4 papers about 2 years ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 627

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Paper • 2402.03766 • Published Feb 6, 2024 • 15

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

Paper • 2312.15011 • Published Dec 22, 2023 • 18

Boundary Attention: Learning to Find Faint Boundaries at Any Resolution

Paper • 2401.00935 • Published Jan 1, 2024 • 18

upvoted 8 papers over 2 years ago

Context Tuning for Retrieval Augmented Generation

Paper • 2312.05708 • Published Dec 9, 2023 • 16

HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis

Paper • 2311.12454 • Published Nov 21, 2023 • 30

PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction

Paper • 2311.12024 • Published Nov 20, 2023 • 19

GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning

Paper • 2311.12631 • Published Nov 21, 2023 • 14

UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework

Paper • 2311.10125 • Published Nov 16, 2023 • 6

Jaekoo Kang

AI & ML interests

Recent Activity

Organizations

jkang's activity