@etemiz on Hugging Face: "As more synthetic datasets are made, we move slowly away from human alignment."

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

etemiz

posted an update Dec 20, 2024

Post

2343

As more synthetic datasets are made, we move slowly away from human alignment.

deleted

Dec 20, 2024

This comment has been hidden

deleted

Dec 20, 2024

This comment has been hidden

hivaze

Dec 21, 2024

I don't think purely human labeling is that good... In most cases, only 80% of people agree on labeling datasets. Also, most human queries are quite simple, they also depend on different time periods, the skill level of the person, in my opinion, all this together is a big problem for llmarena. In general, I don't see any problem with moving away from the human worldview in the broadest sense, since synthetics allow you to create a large number of unique complex queries and answers...

etemiz

Dec 22, 2024

It is not ok to remove people from the equation however efficient the machines are. We can never be sure that the synthetic matches the original in terms of alignment and those further models and further synthetics can derail the whole thing.

In this post