Falcon's picture

Falcon

FalconLlamalpaca

·

akashocx17

AI & ML interests

Security

Recent Activity

upvoted an article 1 day ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

liked a model 3 days ago

google/gemma-3n-E4B-it

liked a Space 3 days ago

google/rad_explain

View all activity

Organizations

upvoted an article 1 day ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 174

upvoted a collection 3 days ago

Gemma 3n

4 items • Updated 9 days ago • 152

upvoted an article 4 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 293

upvoted a paper 5 months ago

Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation

Paper • 2502.00306 • Published Feb 1 • 5

upvoted an article 5 months ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 215

upvoted 2 papers 5 months ago

Agency Is Frame-Dependent

Paper • 2502.04403 • Published Feb 6 • 23

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17, 2024 • 62

upvoted 2 articles 5 months ago

Article

Introducing smolagents: simple agents that write actions in code.

By

and 2 others •

Dec 31, 2024

• 1.07k

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.26k

upvoted a collection 5 months ago

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 61

upvoted 2 articles 5 months ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

Jan 31

• 42

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 870

upvoted a paper 5 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 91

upvoted a paper over 1 year ago

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5, 2024 • 98

upvoted 2 collections over 1 year ago

SLIM GGUF

Quantized GGUF 'tool' implementations of SLIM Models • 30 items • Updated Feb 23 • 11

SLIM Models

Structured Language Instruction Models (SLIMs) • 31 items • Updated Feb 10 • 32

upvoted a paper over 1 year ago

CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 73

upvoted a paper about 2 years ago

ChatGPT for Robotics: Design Principles and Model Abilities

Paper • 2306.17582 • Published Feb 20, 2023 • 10