Yedidia AGNIMO's picture

24 1

Yedidia AGNIMO

Yedson54

·

AI & ML interests

Reinforcement Learning, Federated Learning

Organizations

Yedson54's activity

upvoted 5 papers 7 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 55

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published Sep 20, 2024 • 52

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 44

On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 14

upvoted 4 papers 8 months ago

Towards Building the Federated GPT: Federated Instruction Tuning

Paper • 2305.05644 • Published May 9, 2023 • 5

A Web-Based Solution for Federated Learning with LLM-Based Automation

Paper • 2408.13010 • Published Aug 23, 2024 • 10

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5, 2024 • 90

Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges

Paper • 2408.08946 • Published Aug 16, 2024 • 12

upvoted 4 papers 9 months ago

Patch-Level Training for Large Language Models

Paper • 2407.12665 • Published Jul 17, 2024 • 17

Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation

Paper • 2407.10817 • Published Jul 15, 2024 • 15

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

Paper • 2407.10457 • Published Jul 15, 2024 • 25

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers

Paper • 2407.09413 • Published Jul 12, 2024 • 11

upvoted 7 papers 10 months ago

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12, 2024 • 63

MUSCLE: A Model Update Strategy for Compatible LLM Evolution

Paper • 2407.09435 • Published Jul 12, 2024 • 23

H2O-Danube3 Technical Report

Paper • 2407.09276 • Published Jul 12, 2024 • 20

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5, 2024 • 37

Training Task Experts through Retrieval Based Distillation

Paper • 2407.05463 • Published Jul 7, 2024 • 10

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 87

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4, 2024 • 27