3 20 119

Dennis

denniscraandijk

DennisCraandijk

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

SuperBPE: Space Travel for Language Models

liked a model 9 days ago

ai4privacy/llama-ai4privacy-multilingual-categorical-anonymiser-openpii

liked a model 16 days ago

google/gemma-3-27b-it

View all activity

Organizations

denniscraandijk's activity

upvoted a paper 5 days ago

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published 10 days ago • 7

upvoted 4 papers 16 days ago

It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers

Paper • 2502.03793 • Published Feb 6 • 4

upvoted a collection 17 days ago

Babel

Collection

Open Multilingual Large Language Models Serving Over 90% of Global Speakers • 7 items • Updated 20 days ago • 16

upvoted a paper 25 days ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published about 1 month ago • 26

upvoted a collection about 1 month ago

Nomic Embed v2

Collection

Multilingual Embedding Models • 4 items • Updated Feb 15 • 17

upvoted a collection about 2 months ago

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 14 days ago • 96

upvoted a paper about 2 months ago

SPLADE-v3: New baselines for SPLADE

Paper • 2403.06789 • Published Mar 11, 2024 • 2

upvoted 2 collections 3 months ago

KaLM-embedding

Collection

11 items • Updated 17 days ago • 24

Granite 3.1 Language Models

Collection

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 9 items • Updated Feb 24 • 60

upvoted a collection 4 months ago

Common Corpus

Collection

Largest multilingual pretraining data. • 1 item • Updated Nov 13, 2024 • 9

upvoted a collection 5 months ago

POTION

Collection

These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 5 items • Updated Feb 3 • 10

upvoted 2 papers 6 months ago

Contextual Document Embeddings

Paper • 2410.02525 • Published Oct 3, 2024 • 22

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 84

upvoted a paper 10 months ago

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 80

upvoted 3 papers about 1 year ago

GROVE: A Retrieval-augmented Complex Story Generation Framework with A Forest of Evidence

Paper • 2310.05388 • Published Oct 9, 2023 • 4

Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30, 2024 • 44

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 612