ZeroGen: Efficient Zero-shot Learning via Dataset Generation Paper • 2202.07922 • Published Feb 16, 2022 • 1
Scaling Diffusion Language Models via Adaptation from Autoregressive Models Paper • 2410.17891 • Published Oct 23, 2024 • 17
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning Paper • 2410.14157 • Published Oct 18, 2024
Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability Paper • 2306.06688 • Published Jun 11, 2023
Generating Data for Symbolic Language with Large Language Models Paper • 2305.13917 • Published May 23, 2023
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models Paper • 2402.07754 • Published Feb 12, 2024
L-Eval: Instituting Standardized Evaluation for Long Context Language Models Paper • 2307.11088 • Published Jul 20, 2023 • 5
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models Paper • 2402.13577 • Published Feb 21, 2024 • 10
Training-Free Long-Context Scaling of Large Language Models Paper • 2402.17463 • Published Feb 27, 2024 • 25
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models Paper • 2210.08933 • Published Oct 17, 2022 • 5
Scaling Diffusion Language Models via Adaptation from Autoregressive Models Paper • 2410.17891 • Published Oct 23, 2024 • 17