Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7.4
TFLOPS
4
5
10
Pro Creations
PRO
ProCreations
Follow
sudanenator's profile picture
OldKingMeister's profile picture
Dcas89's profile picture
66 followers
Β·
11 following
https://pro-ai.glitch.me
ProCreations-Offical
AI & ML interests
AGI and small scale-high quality AI
Recent Activity
replied
to
Ruurd
's
post
1 day ago
The past year I have been trying to get diffusion models to work for language generation, without having to retrain a LLM from scratch. And recently, we finally succeeded: We introduce "LAD: LoRA-Adapted Denoiser", a method to convert a LLaMA model into a text diffusion model using LoRA finetuning and structured input corruption. π― Try the demo and read the write-up here! https://ruurdkuiper.github.io/tini-lad/ Unlike autoregressive (word-for-word) models like ChatGPT, diffusion models iteratively refine a noised sequence. However, most current diffusion approaches rely on all-parameter retraining and repeatedly remasking tokens, which is costly and slow during both training and inference! π§ With LAD: - We can finetune an autoregressive model for diffusive generation in just 10 hours on a single GPU. - Test-time compute is fully adjustable: fewer steps means faster outputs while more steps improve output quality. - Due to our unique noising schedule, remasking is not always needed during inference. All tokens are attended to in each iteration! π LAD is built using: β A frozen LLaMA-8B backbone β Structured noising: token swaps, duplications, replacements, span shifts β Modified attention masks for bidirectional decoding π‘ We show that even small, fast-trained models can perform diffusive generation β with competitive benchmark performance, perplexity and more flexible test-time behavior than traditional transformers.
liked
a dataset
1 day ago
ProCreations/Ultra-FineWeb-EDU
posted
an
update
2 days ago
NEW DATASETTTTT https://huggingface.co/datasets/ProCreations/Ultra-FineWeb-EDU This dataset is an extremely filtered version of https://huggingface.co/datasets/openbmb/Ultra-FineWeb for educational content only, inspired by https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu Right now dataset size is small (64k examples) and more examples are being processed as you read this!
View all activity
Organizations
ProCreations
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
1 day ago
ProCreations/Ultra-FineWeb-EDU
Viewer
β’
Updated
2 days ago
β’
64k
β’
32
β’
2
liked
a Space
7 days ago
Running
5
5
Ai Labs
π»
Watch and experiment with realtime AIβs with visuals
liked
3 models
26 days ago
Qwen/Qwen3-0.6B
Text Generation
β’
Updated
18 days ago
β’
850k
β’
β’
339
Qwen/Qwen3-30B-A3B
Text Generation
β’
Updated
18 days ago
β’
293k
β’
β’
629
Qwen/Qwen3-235B-A22B
Text Generation
β’
Updated
18 days ago
β’
184k
β’
β’
931
liked
a dataset
27 days ago
ProCreations/quantum-randomness
Preview
β’
Updated
27 days ago
β’
87
β’
4
liked
a model
about 1 year ago
guinmoon/LLMFarm_Models
Updated
Aug 6, 2024
β’
1.32k
β’
9
liked
3 models
over 1 year ago
Aryanne/Orca-Mini-3B-gguf
Updated
Sep 27, 2023
β’
625
β’
5
dataautogpt3/OpenDalleV1.1
Text-to-Image
β’
Updated
Jan 19, 2024
β’
2.28k
β’
502
apple/DFN5B-CLIP-ViT-H-14-378
Updated
Feb 28
β’
644k
β’
88