view article Article Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained β Whatβs Really Changing in Transformers? By Kseniase and 1 other β’ Apr 4 β’ 14
view article Article Open-Source Handwritten Signature Detection Model By samuellimabraz β’ Mar 14 β’ 111
view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control By danaaubakirova and 3 others β’ Feb 4 β’ 156
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 252
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 15 items β’ Updated Apr 18 β’ 228
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper β’ 2402.09844 β’ Published Feb 15, 2024 β’ 21