Post
1851
I've begun adding valuable blog posts on using/creating synthetic datasets to my curated list.
I am starting with a great post by @MoritzLaurer on utilizing an open LLM to generate data for training a specialized Roberta model.
Read the blog post: https://huggingface.co/blog/synthetic-data-save-costs
See the rest of the list: https://github.com/davanstrien/awesome-synthetic-datasets
I am starting with a great post by @MoritzLaurer on utilizing an open LLM to generate data for training a specialized Roberta model.
Read the blog post: https://huggingface.co/blog/synthetic-data-save-costs
See the rest of the list: https://github.com/davanstrien/awesome-synthetic-datasets