4 16 16

Temurbek

raxtemur

raxtemur

AI & ML interests

Topological Data Analisys, Handwritten Text Recognition, Latent Space Exploration

Recent Activity

updated a model 8 days ago

raxtemur/SONAR_200_text_decoder

published a model 8 days ago

raxtemur/SONAR_200_text_decoder

updated a model 5 months ago

raxtemur/sonar-llm-1.3b

View all activity

Organizations

None yet

upvoted a paper 6 months ago

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7, 2025 • 47

upvoted 2 papers 7 months ago

MUR: Momentum Uncertainty guided Reasoning for Large Language Models

Paper • 2507.14958 • Published Jul 20, 2025 • 47

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17, 2025 • 125

upvoted a paper 8 months ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5, 2025 • 133

upvoted 2 papers 9 months ago

Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex

Paper • 2505.15813 • Published May 21, 2025 • 5

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 92

upvoted 2 papers 10 months ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21, 2025 • 78

MediAug: Exploring Visual Augmentation in Medical Imaging

Paper • 2504.18983 • Published Apr 26, 2025 • 7

upvoted 3 papers 11 months ago

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17, 2025 • 95

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20, 2025 • 72

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24, 2025 • 119

upvoted a paper 12 months ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20, 2025 • 174

upvoted a paper about 1 year ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published Jan 5, 2025 • 45

upvoted a collection about 1 year ago

Planning-with-LLM

Collection

11 items • Updated Sep 4, 2024 • 2

upvoted 2 papers over 1 year ago

MedSyn: LLM-based Synthetic Medical Text Generation Framework

Paper • 2408.02056 • Published Aug 4, 2024 • 2

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 157

Temurbek

AI & ML interests

Recent Activity

Organizations

raxtemur's activity