Join the conversation
Join the community of Machine Learners and AI enthusiasts.
Sign UpI don't think purely human labeling is that good... In most cases, only 80% of people agree on labeling datasets. Also, most human queries are quite simple, they also depend on different time periods, the skill level of the person, in my opinion, all this together is a big problem for llmarena. In general, I don't see any problem with moving away from the human worldview in the broadest sense, since synthetics allow you to create a large number of unique complex queries and answers...
It is not ok to remove people from the equation however efficient the machines are. We can never be sure that the synthetic matches the original in terms of alignment and those further models and further synthetics can derail the whole thing.