Duplicated from HuggingFaceFW/blogpost-fine-tasks
how is the compute optimal line plotted? which search strategy is used for plotting the green line in the final optimal-scaling graph for Llama 3.1 8B?
· Sign up or log in to comment