PeterKruger
ยท
AI & ML interests
Neural networks (since 1993), LLMs, AI-based financial analysis, LLM Benchmarks
Recent Activity
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org
view article
Introducing Bot Scanner: A "Skyscanner" for LLM answers
view article
AutoBench Run 2 Results are Out! Surprise: Gemini 2.5 Pro is not the Best Affordable Thinking Model
view article
Escape the Benchmark Trap: AutoBench โ the Collective-LLM-as-a-Judge System for Evaluating AI models (ASI-Ready!)