Lukas Galke

lgalke

https://lgalke.github.io

AI & ML interests

Representation learning; pre-training; quantization; continual learning; probing; multilingual pre-training; knowledge tracing/discovering/inducing

Recent Activity

authored a paper 26 days ago

Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?

authored a paper 26 days ago

What makes a language easy to deep-learn? Deep neural networks and humans similarly benefit from compositional structure

authored a paper 26 days ago

Dynaword: From One-shot to Continuously Developed Datasets

View all activity

Organizations

authored 6 papers 26 days ago

Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?

Paper • 2502.11895 • Published Feb 17 • 2

What makes a language easy to deep-learn? Deep neural networks and humans similarly benefit from compositional structure

Paper • 2302.12239 • Published Feb 23, 2023 • 1

Dynaword: From One-shot to Continuously Developed Datasets

Paper • 2508.02271 • Published Aug 4 • 13

GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language Understanding

Paper • 2311.09707 • Published Nov 16, 2023

When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization

Paper • 2411.05882 • Published Nov 8, 2024 • 1

CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model

Paper • 1902.06423 • Published Feb 18, 2019

upvoted a paper 28 days ago

Dynaword: From One-shot to Continuously Developed Datasets

Paper • 2508.02271 • Published Aug 4 • 13

liked 2 models 5 months ago

google/gemma-3-1b-it

Text Generation • 1.0B • Updated Apr 4 • 3.26M • 602

meta-llama/Llama-Guard-3-1B

Text Generation • 1B • Updated Sep 26, 2024 • 27.3k • 89

liked 2 Spaces 5 months ago

12.8k

DeepSite v2

🐳

Generate any application with DeepSeek

13.5k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

liked 3 models 5 months ago

liked 3 datasets 5 months ago

strombergnlp/bajer_danish_misogyny

Updated May 16, 2023 • 5 • 2

danish-foundation-models/danish-dynaword

Viewer • Updated 5 days ago • 11.2M • 4.24k • 13

DDSC/dkhate

Viewer • Updated May 17, 2023 • 3.29k • 94 • 5

upvoted an article 6 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

and 14 others •

Dec 19, 2024

• 683

liked a dataset 6 months ago

ergotts/propositional-logic

Viewer • Updated Aug 26, 2024 • 1.6k • 44 • 10

liked a model 7 months ago

KennethEnevoldsen/dfm-sentence-encoder-large

Feature Extraction • 0.4B • Updated Nov 27, 2024 • 184 • 2

Lukas Galke

AI & ML interests

Recent Activity

Organizations

lgalke's activity

DeepSite v2

Open LLM Leaderboard

Finally, a Replacement for BERT: Introducing ModernBERT