Grok 4 Testing - Search News

xAI is testing Grok 4.20 to take on GPT-5, may launch this month

Elon Musk-owned xAI is testing Grok 4.20, a new model update to Grok 4, which already competes with GPT-5 in some benchmarks, such as ARC-AGI 2. GPT-5 is one of the best models for coding, and it ...

xAI launches Grok 4.3 at an aggressively low price and a new, fast, powerful voice cloning suite

The launch of Grok 4.3 represents a calculated bet by xAI that the market wants specialized brilliance and extreme cost ...

NextBigFuture

XAI Grok 4 Scoring Poorly in Some Realworld Tests

There is a common problem for all AI companies for overfitting to benchmarks. XAI Grok 4 has some problems with prompt adherence. XAI could have had overfitting resulted from the reinforcement ...

NewsBytes

Study finds Grok gives dangerous response in delusion test scenario

Elon Musk's AI chatbot, Grok 4.1, suggested researchers pretending to be delusional to drive an iron nail through a mirror ...

Hosted on MSN

I tested ChatGPT-5.2 vs Grok 4.1 with 7 challenging prompts — here's the winner

Grok 4.1 and GPT-5.2 are two of the best AI models on the market right now. Powering the latest versions of ChatGPT and Grok, they are designed to excel at writing, logic, research and creativity. But ...

Geeky Gadgets

Grok 4.2 Quiet Trials Show Sharper UI, Cleaner Code, Plus Playable Games

What if the future of AI could not only dream up stunning web designs but also code them into reality with unmatched precision? In this overview, Universe of AI explores how Grok 4.2, codenamed ...

Bleeping Computer

Grok 4 benchmark results: Tops math, ranks second in coding

Grok 4 is a huge leap from Grok 3, but how good is it compared to other models in the market, such as Gemini 2.5 Pro? We now have answers, thanks to new independent benchmarks. LMArena.ai, which is an ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results