Florian Zimmermeister

flozi00

AI & ML interests

ASR, German LLM

Recent Activity

updated a model about 15 hours ago
pL-Community/GermanEduScorer-Qwen2-1.5b
updated a dataset 3 days ago
flozi00/Fineweb2-EDUscore-German
updated a dataset 3 days ago
flozi00/InstructGer-exp
View all activity

Organizations

Training Transformers Together's profile picture Speech Recognition Community Event Version 2's profile picture A\\Ware's profile picture primeLine AI Services's profile picture ZeroGPU Explorers's profile picture primeLine Research Community's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture

flozi00's activity

reacted to m-ric's post with πŸ‘€ 13 days ago
view post
Post
1791
π’πœπšπ₯𝐒𝐧𝐠 π₯𝐚𝐰𝐬 𝐚𝐫𝐞 𝐧𝐨𝐭 𝐝𝐞𝐚𝐝 𝐲𝐞𝐭! New blog post suggests Anthropic might have an extremely strong Opus-3.5 already available, but is not releasing it to keep their edge over the competition. 🧐

❓Since the release of Opus-3.5 has been delayed indefinitely, there have been lots of rumors and articles about LLMs plateauing. Scaling laws, the main powering factor of the LLM competence increase, could have stopped, according to these rumors, being the cause of this stalling of progress.

These rumors were quickly denied by many people at the leading LLM labs, including OpenAI and Anthropic. But these people would be expected to hype the future of LLMs even if scaling laws really plateaued, so the jury is still out.

πŸ—žοΈ This new article by Semianalysis (generally a good source, specifically on hardware) provides a counter-rumor that I find more convincing:

➑️ Maybe scaling laws still work, Opus-3.5 is ready and as good as planned, but they just don't release it because the synthetic data it helps provide can bring cheaper/smaller models Claude and Haiku up in performance, without risking to leak this precious high-quality synthetic data to competitors.

Time will tell! I feel like we'll know more soon.

Read the article: https://semianalysis.com/2024/12/11/scaling-laws-o1-pro-architecture-reasoning-infrastructure-orion-and-claude-3-5-opus-failures/
  • 1 reply
Β·
upvoted an article 17 days ago