KuKu

dragonkue

12 96 197

AI & ML interests

anything.

Recent Activity

liked a model 3 days ago

nlpai-lab/LAMAR-600m

liked a model 8 days ago

nvidia/Nemotron-3-Embed-8B-BF16

liked a dataset 8 days ago

lightonai/nv-embed-supervised-distill-dedup-code

View all activity

Organizations

upvoted an article 9 days ago

Article

Build a Domain-Specific Embedding Model in Under a Day

nvidia

•

Mar 20

• 77

upvoted an article 10 days ago

Article

NVIDIA Nemotron 3 Embed Ranks #1 Overall on RTEB, Advancing Agentic Retrieval

nvidia

•

10 days ago

• 57

upvoted an article about 1 month ago

Article

Beyond LoRA: Can you beat the most popular fine-tuning technique?

BenjaminB, sayakpaul, hubnemo, kashif

•

Jun 18

• 83

upvoted 2 papers about 1 month ago

Training Sparse Mixture Of Experts Text Embedding Models

Paper • 2502.07972 • Published Feb 11, 2025 • 12

Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models

Paper • 2501.14818 • Published Jan 20, 2025 • 10

upvoted an article about 1 month ago

Article

Party is over: regularizing ColBERT models to fix efficient ANN methods

lightonai

•

Jun 16

• 23

upvoted 2 papers about 2 months ago

Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking

Paper • 2405.07920 • Published May 13, 2024 • 4

F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World

Paper • 2603.19223 • Published Mar 19 • 37

upvoted a collection about 2 months ago

F2LLM

Collection

23 items • Updated Mar 20 • 7

upvoted a paper about 2 months ago

Is Position Bias in Dense Retrievers Built In-or Learned from Data?

Paper • 2605.26578 • Published May 26 • 20

upvoted an article 2 months ago

Article

Introducing the Ettin Reranker Family

tomaarsen

•

May 19

• 55

upvoted a paper 3 months ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published May 9 • 82

upvoted 2 articles 3 months ago

Article

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

nvidia

•

Apr 21

• 26

Article

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

lightonai

•

Apr 21

• 44

upvoted a collection 4 months ago

pplx-embed

Collection

Diffusion-Pretrained Dense and Contextual Embeddings • 10 items • Updated May 26 • 100

upvoted 2 articles 4 months ago

Article

🥃 Distilling Tiny Embeddings

NeuML

•

Jan 10

• 24

Article

ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark

Metric-AI

•

Mar 19

• 10

upvoted a paper 4 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

upvoted a collection 5 months ago

Qwen3.5-text-only

Collection

Text-only versions of Qwen-3.5 without the vision encoders for a smaller memory and storage footprint. • 4 items • Updated Jun 5 • 15

upvoted a paper 5 months ago

zELO: ELO-inspired Training Method for Rerankers and Embedding Models

Paper • 2509.12541 • Published Sep 16, 2025 • 12

KuKu

AI & ML interests

Recent Activity

Organizations

dragonkue's activity

Build a Domain-Specific Embedding Model in Under a Day

NVIDIA Nemotron 3 Embed Ranks #1 Overall on RTEB, Advancing Agentic Retrieval

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Party is over: regularizing ColBERT models to fix efficient ANN methods

Introducing the Ettin Reranker Family

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

🥃 Distilling Tiny Embeddings

ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark