web's picture

web

dim

·

dmitrymailk

AI & ML interests

dimweb, LM/LLM pronouns

Recent Activity

updated a model 2 minutes ago

dim/2025_05_08_13_21_20_019296_checkpoint-14022

published a model about 1 hour ago

dim/2025_05_08_13_21_20_019296_checkpoint-14022

updated a dataset 10 days ago

dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy

View all activity

Organizations

dim's activity

upvoted a paper 21 days ago

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19 • 48

upvoted a paper 24 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 130

upvoted an article 2 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

• 161

upvoted a paper 3 months ago

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published Feb 18 • 72

upvoted an article 3 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 852

upvoted 2 papers 10 months ago

Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models

Paper • 2407.12327 • Published Jul 17, 2024 • 80

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5, 2024 • 37

upvoted 4 papers 11 months ago

Complexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a Task

Paper • 2406.14213 • Published Jun 20, 2024 • 21

nabla^2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials

Paper • 2406.14347 • Published Jun 20, 2024 • 102

The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing

Paper • 2406.10601 • Published Jun 15, 2024 • 70

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Paper • 2406.10149 • Published Jun 14, 2024 • 51

upvoted a collection over 1 year ago

Instruct datasets in Russian

All datasets have been translated using Google Translate • 14 items • Updated Mar 10 • 8