Baidu's ERNIE-5.0-0110 ranks #8 globally on LMArena, becoming the only Chinese model in the top 10 while outperforming ...
OpenAI Model Wins Gold at International Mathematical Olympiad – or Did It? Your email has been sent A Google DeepMind researcher and OpenAI’s former CTO are posing questions about the validity of ...
AI large language models have been especially weak on math. There are now several papers from Google Deep Mind, Alibaba and other universities where AI large language models are at Math Olympiad ...
If OpenAI's new model can solve grade-school math, it could pave the way for more powerful systems. This story is from The Algorithm, our weekly newsletter on AI. To get stories like this in your ...
Chinese AI lab DeepSeek has quietly updated Prover, its AI model that’s designed to solve math-related proofs and theorems. According to South China Morning Post, DeepSeek uploaded the latest version ...
Microsoft found that small language models can exceed the performance of much larger ones when trained to specialize in a single area. Researchers fine-tuned the Mistral 7B model to create Orca-Math, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results