LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Paper • 2407.14057 • Published Jul 19, 2024 • 46
Speculative Streaming: Fast LLM Inference without Auxiliary Models Paper • 2402.11131 • Published Feb 16, 2024 • 43
Ego4D: Around the World in 3,000 Hours of Egocentric Video Paper • 2110.07058 • Published Oct 13, 2021
eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models Paper • 2309.00964 • Published Sep 2, 2023 • 2
Sequential Voting with Relational Box Fields for Active Object Detection Paper • 2110.11524 • Published Oct 21, 2021
Domain Adaptive Hand Keypoint and Pixel Localization in the Wild Paper • 2203.08344 • Published Mar 16, 2022
FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline Paper • 2312.11537 • Published Dec 15, 2023 • 7
Deformer: Dynamic Fusion Transformer for Robust Hand Pose Estimation Paper • 2303.04991 • Published Mar 9, 2023