Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
etemizΒ 
posted an update 6 days ago
Post
2266
As more synthetic datasets are made, we move slowly away from human alignment.
deleted
This comment has been hidden
deleted
This comment has been hidden

I don't think purely human labeling is that good... In most cases, only 80% of people agree on labeling datasets. Also, most human queries are quite simple, they also depend on different time periods, the skill level of the person, in my opinion, all this together is a big problem for llmarena. In general, I don't see any problem with moving away from the human worldview in the broadest sense, since synthetics allow you to create a large number of unique complex queries and answers...

Β·

It is not ok to remove people from the equation however efficient the machines are. We can never be sure that the synthetic matches the original in terms of alignment and those further models and further synthetics can derail the whole thing.