Elon Musk-owned xAI is testing Grok 4.20, a new model update to Grok 4, which already competes with GPT-5 in some benchmarks, such as ARC-AGI 2. GPT-5 is one of the best models for coding, and it ...
The launch of Grok 4.3 represents a calculated bet by xAI that the market wants specialized brilliance and extreme cost ...
There is a common problem for all AI companies for overfitting to benchmarks. XAI Grok 4 has some problems with prompt adherence. XAI could have had overfitting resulted from the reinforcement ...
Elon Musk's AI chatbot, Grok 4.1, suggested researchers pretending to be delusional to drive an iron nail through a mirror ...
Grok 4.1 and GPT-5.2 are two of the best AI models on the market right now. Powering the latest versions of ChatGPT and Grok, they are designed to excel at writing, logic, research and creativity. But ...
What if the future of AI could not only dream up stunning web designs but also code them into reality with unmatched precision? In this overview, Universe of AI explores how Grok 4.2, codenamed ...
Grok 4 is a huge leap from Grok 3, but how good is it compared to other models in the market, such as Gemini 2.5 Pro? We now have answers, thanks to new independent benchmarks. LMArena.ai, which is an ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results