Léo Hunout's picture

Léo Hunout

hunoutl

·

AI & ML interests

AI Engineer working on Jean Zay supercomputer in France 🇫🇷

Recent Activity

upvoted an article 13 days ago

They Said It Couldn’t Be Done

upvoted a paper 20 days ago

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation

upvoted a paper 20 days ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

View all activity

Organizations

hunoutl's activity

upvoted an article 13 days ago

Article

They Said It Couldn’t Be Done

By

•

21 days ago

• 75

upvoted 2 papers 20 days ago

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation

Paper • 2412.02259 • Published 23 days ago • 59

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published 22 days ago • 118

upvoted 17 papers about 1 month ago

Balancing Pipeline Parallelism with Vocabulary Parallelism

Paper • 2411.05288 • Published Nov 8 • 19

Face Anonymization Made Simple

Paper • 2411.00762 • Published Nov 1 • 7

NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks

Paper • 2410.20650 • Published Oct 28 • 16

TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Paper • 2410.23168 • Published Oct 30 • 24

Zipfian Whitening

Paper • 2411.00680 • Published Nov 1 • 9

GPT or BERT: why not both?

Paper • 2410.24159 • Published Oct 31 • 14

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4 • 11

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Paper • 2411.02397 • Published Nov 4 • 23

Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6 • 16

Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Paper • 2411.03884 • Published Nov 6 • 26

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Paper • 2410.10306 • Published Oct 14 • 54

What Matters in Transformers? Not All Attention is Needed

Paper • 2406.15786 • Published Jun 22 • 29

Neural Metamorphosis

Paper • 2410.11878 • Published Oct 10 • 8

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17 • 89

AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published Oct 21 • 58

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7 • 49

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7 • 63