Alexandre Marques's picture

28 7

Alexandre Marques

alexmarques

·

anmarques

AI & ML interests

None yet

Recent Activity

updated a model 35 minutes ago

nm-testing/Llama-3.1-8B-tldr

published a model 39 minutes ago

nm-testing/Llama-3.1-8B-tldr

updated a model 40 minutes ago

nm-testing/Sparse-Llama-3.1-8B-tldr-2of4

View all activity

Organizations

alexmarques's activity

upvoted a paper 1 day ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published 8 days ago • 70

upvoted a collection 6 months ago

Sparse-Llama-3.1-2of4

2:4 sparse versions of Llama-3.1, including transfer learning • 10 items • Updated Dec 18, 2024 • 4

upvoted 2 papers 7 months ago

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 51

upvoted a collection 8 months ago

Llama-3.2 Quantization

Llama 3.2 models quantized by Neural Magic • 9 items • Updated Sep 26, 2024 • 9

upvoted 2 collections 10 months ago

Llama-3.1 Quantization

Neural Magic quantized Llama-3.1 models • 22 items • Updated Nov 22, 2024 • 44

INT8 LLMs for vLLM

Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 50 items • Updated Sep 26, 2024 • 15