view article Article Train 400x faster Static Embedding Models with Sentence Transformers 22 days ago • 133
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval Paper • 2407.19669 • Published Jul 29, 2024 • 23
VideoPoet: A Large Language Model for Zero-Shot Video Generation Paper • 2312.14125 • Published Dec 21, 2023 • 45
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Paper • 2407.14057 • Published Jul 19, 2024 • 45
Efficient World Models with Context-Aware Tokenization Paper • 2406.19320 • Published Jun 27, 2024 • 8
MatchTime: Towards Automatic Soccer Game Commentary Generation Paper • 2406.18530 • Published Jun 26, 2024 • 12
Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published Jun 28, 2024 • 97
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper • 2311.03099 • Published Nov 6, 2023 • 29
LayoutPrompter: Awaken the Design Ability of Large Language Models Paper • 2311.06495 • Published Nov 11, 2023 • 11
Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization Paper • 2311.09184 • Published Nov 15, 2023 • 1
BookSum-based Summarizers Collection BookSum-tuned text-to-text summarization models • 7 items • Updated Nov 4, 2024 • 3
SLiC-HF: Sequence Likelihood Calibration with Human Feedback Paper • 2305.10425 • Published May 17, 2023 • 5
Measuring Faithfulness in Chain-of-Thought Reasoning Paper • 2307.13702 • Published Jul 17, 2023 • 28
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest Paper • 2307.03601 • Published Jul 7, 2023 • 12