AI Benchmarks Explained: GPQA, SWE-bench, Chatbot Arena and What They Actually Measure - TrendCloud