FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute Paper • 2502.20126 • Published 1 day ago • 9
WolfCore-L3-8B Collection L3 merge, small but efficiency. Have DeepSeek R1 Distill merge version. Haven't tested yet. • 2 items • Updated about 12 hours ago • 2
view article Article HuggingFace, IISc partner to supercharge model building on India's diverse languages 2 days ago • 9
Frame Series - WolFrame(WolfFrame) Collection Next merge series, focus on RP and Story writing. • 3 items • Updated 23 days ago • 2
FoxSpirit Collection Next series, good result, nearly 'human response'. (~ ̄▽ ̄)~ But have a pity that sometimes occur Tokenization bugs (o′┏▽┓`o) • 3 items • Updated 19 days ago • 1
OwO-ified Models V1.0 Collection This is a (better) series of experimental models fine-tuned for generating text in the "OwO/UwU" style, Are they smart? No, Are they fun? Mostly :3 • 7 items • Updated 2 days ago • 1
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published 9 days ago • 56
view article Article Distilling from Dialogues: Finding Meaning in LLM Interactions By chansung • 4 days ago • 4
X-Dancer: Expressive Music to Human Dance Video Generation Paper • 2502.17414 • Published 4 days ago • 9