Artificial Analysis overhauls its AI Intelligence Index, replacing saturated benchmarks with real-world tests measuring ...
Beijing-based Ubiquant launches code-focused systems claiming benchmark wins over US peers despite using far fewer parameters ...
On December 22, Z.ai released GLM-4.7, the latest iteration of its GLM large language model family. Designed to handle ...
Z.ai released GLM-4.7 ahead of Christmas, marking the latest iteration of its GLM large language model family. As open-source models move beyond chat-based applications and into production ...
Chinese AI startup’s release is a major update to its open-source model series, aimed at multi-language programming and ...
MiniMax M2 was released in late October this year. The company stated that M2.1 demonstrated significant improvements in ...
This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...
Over the last decade, artificial intelligence (AI) has been largely built around large language models (LLMs). These systems are based on a language and guess words in a chain in the form of tokens.
Most people upgrade their MacBook Pro no more frequently than once every five years, and some leave it considerably longer than that. But Apple always makes it sound like even last year’s model is now ...
We did an informal poll around the Hackaday bunker and decided that, for most of us, our favorite programming language is solder. However, [Stephen Cass] over at IEEE Spectrum released their annual ...
According to Greg Brockman (@gdb), OpenAI's latest reasoning system has achieved a perfect score on the 2025 ICPC programming competition, as confirmed by Mostafa ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results