News
Internal docs show xAI paid contractors to "hillclimb" Grok's rank on a coding leaderboard above Anthropic's Claude.
“China’s Kimi K2 is having its mini DeepSeek moment: it is now #14 on OpenRouter today, ahead of Grok 4 and GPT-4.1,” Deedy ...
As reported by TechCrunch, Grok 4 scored 25.4% on Humanity’s Last Exam without “tools,” outperforming Google’s Gemini 2.5 Pro ...
Grok 4 by xAI was released on July 9, and it's surged ahead of competitors like DeepSeek and Claude at LMArena, a leaderboard ...
Grok 4 is a huge leap from Grok 3, but how good is it compared to other models in the market, such as Gemini 2.5 Pro? We now ...
Modern Engineering Marvels on MSN1d
Pentagon’s Grok Gamble: Can Musk’s AI Outpace Rivals and Outrun Controversy?Is the Pentagon’s $200 million bet on Grok a masterstroke for American AI supremacy, or a high-stakes experiment with untested technology and ethics? The Department of Defense’s selection of xAI’s ...
Grok 4 is leading several notable benchmarks, narrowly beating seasoned players like OpenAI and Google. Since its launch, ...
2d
Cryptopolitan on MSNChinese AI model Kimi K2 undercuts rivals with low pricesIn a fresh twist to the growing AI rivalry, Alibaba-backed startup Moonshot has unveiled its latest large language model, ...
CEO Elon Musk’s year has felt more like a high-stakes reality show. The EV giant finally rolled out its Robotaxis, though not ...
Alibaba-backed startup Moonshot released on late Friday night its Kimi K2 model, touting performance that rivals many U.S.
Moonshot’s Kimi K2 AI model beats GPT-4.1 and Claude in coding benchmarks, offers open-source access, and slashes costs ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results