5 18 12

Nguyễn Tiến Đạt

datnguyen

AI & ML interests

NLP Engineer

Recent Activity

upvoted an article 13 days ago

Introducing smolagents: simple agents that write actions in code.

upvoted an article 19 days ago

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

upvoted a paper 20 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

View all activity

Organizations

datnguyen's activity

upvoted an article 13 days ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 736

upvoted an article 19 days ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22, 2024

• 71

upvoted a paper 20 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 25 days ago • 106

upvoted a paper 5 months ago

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Paper • 2408.06266 • Published Aug 12, 2024 • 10

upvoted a paper 7 months ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 53

upvoted a collection 8 months ago

Embedding Model

Collection

Vietnamese Pre-trained Embedding Models • 4 items • Updated Sep 8, 2024 • 1

upvoted a paper 10 months ago

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21, 2024 • 29

upvoted a collection 10 months ago

Papers - OpenAI

Collection

6 items • Updated Jun 12, 2024 • 1

upvoted a paper 10 months ago

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Paper • 2404.13208 • Published Apr 19, 2024 • 39

upvoted a paper 12 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 185

upvoted 5 papers about 1 year ago

SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

upvoted 2 papers over 1 year ago

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 88

How FaR Are Large Language Models From Agents with Theory-of-Mind?

Paper • 2310.03051 • Published Oct 4, 2023 • 35