Hemanth-thunder's picture

Hemanth-thunder

Hemanth-thunder

·

https://www.kaggle.com/hemanthkumar21

AI & ML interests

NLP, Machine Translation

Recent Activity

liked a model 1 day ago

unsloth/gemma-3-4b-it-GGUF

liked a model 5 days ago

facebook/nllb-200-distilled-600M

liked a model 6 days ago

deepseek-ai/DeepSeek-Prover-V2-671B

View all activity

Organizations

Hemanth-thunder's activity

upvoted an article 19 days ago

Article

Vision Language Models Explained

Apr 11, 2024

• 330

upvoted a paper 19 days ago

NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes

Paper • 2504.11544 • Published 27 days ago • 42

upvoted 2 papers about 1 month ago

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12, 2024 • 49

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 104

upvoted a paper about 2 months ago

New Trends for Modern Machine Translation with Large Reasoning Models

Paper • 2503.10351 • Published Mar 13 • 23

upvoted a collection 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 13 days ago • 608

upvoted a collection 4 months ago

DeepSeek-R1

8 items • Updated Jan 21 • 628

upvoted a collection 8 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 602

upvoted a paper 8 months ago

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Paper • 2402.10110 • Published Feb 15, 2024 • 3

upvoted a collection 10 months ago

DCLM

DCLM Models + Datasets • 6 items • Updated Oct 4, 2024 • 25

upvoted a paper 12 months ago

Training-Free Long-Context Scaling of Large Language Models

Paper • 2402.17463 • Published Feb 27, 2024 • 25

upvoted a collection about 1 year ago

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated Apr 3 • 331

upvoted 2 papers over 1 year ago

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

Paper • 2309.11674 • Published Sep 20, 2023 • 31

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 85

upvoted a collection over 1 year ago

Tamil-Llama Models and Datasets

11 items • Updated Oct 20, 2024 • 13

upvoted a paper over 1 year ago

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Paper • 2311.05556 • Published Nov 9, 2023 • 87