Profile Picture
  • All
  • Search
  • Images
  • Videos
  • Maps
  • News
  • Copilot
  • More
    • Shopping
    • Flights
    • Travel
  • Notebook
  • Top stories
  • Winter Games
  • Sports
  • U.S.
  • Local
  • World
  • Science
  • Technology
  • Entertainment
  • Business
  • More
    Politics
Order byBest matchMost fresh
  • Any time
    • Past hour
    • Past 24 hours
    • Past 7 days
    • Past 30 days

Google's new Gemini Pro model has record benchmark scores

Digest more
Top News
Overview
techbooky.com · 8h
Google Unveils Gemini 3.1 Pro for Advanced Reasoning
Google has announced Gemini 3.1 Pro, an upgraded version of its flagship large language model designed specifically for complex reasoning,

Continue reading

India Today on MSN · 15h
Gemini 3.1 Pro is here, benchmarks says Google is once again leader in AI
 · 13h · on MSN
Google's Gemini 3.1 Pro is here, and it just doubled its reasoning score
CNET · 20h
Google Rolls Out Latest AI Model, Gemini 3.1 Pro
Google took the wraps off its latest AI model , Gemini 3.1 Pro, on Thursday, calling it a "step forward in core reasoning."

Continue reading

 · 22h
Google doubles the reasoning power of its core AI model with Gemini 3.1 Pro
 · 1d
The new Gemini 3.1 Pro AI model “represents a step forward in core reasoning.”
Hosted on MSN
8mon

Nvidia’s Blackwell Conquers Largest LLM Training Benchmark

For those who enjoy rooting for the underdog, the latest MLPerf benchmark results will disappoint: Nvidia’s GPUs have dominated the competition yet again. This includes chart-topping performance on the latest and most demanding benchmark, pretraining the ...
4d

Automat-it Launches LLM Selection Optimizer to Slash Startup LLM Costs by up to 60%

AWS Premier Tier Partner leverages its AI Services Competency and expertise to help founders cut LLM costs using
1d

Taalas Launches Hardcore Chip With ‘Insane’ AI Inference Performance

Taalas has launched an AI accelerator that puts the entire AI model into silicon, delivering 1-2 orders of magnitude greater performance. Seriously.
VentureBeat
1y

Google DeepMind researchers introduce new benchmark to improve LLM factuality, reduce hallucinations

Hallucinations, or factually inaccurate responses, continue to plague large language models (LLMs). Models falter particularly when they are given more complex tasks and when users are looking for specific and highly detailed responses. It’s a ...
Security
8mon

Simbian launches new security benchmark with AI SOC LLM Leaderboard

Simbian today announced the “AI SOC LLM Leaderboard,” a comprehensive benchmark to measure LLM performance in Security Operations Centers (SOCs). The new benchmark compares LLMs across a diverse range of attacks and SOC tools in a realistic IT ...
2d

Sarvam AI unveils indigenously-built 30B and 105B LLM models

Sarvam AI launches two advanced LLM models, 30B and 105B, outperforming competitors in key benchmarks, focusing on Indian language support.
TechCrunch
1y

This LLM framework takes a first stab at benchmarking Big AI’s compliance with the EU AI Act

While most countries’ lawmakers are still discussing how to put guardrails around artificial intelligence, the European Union is ahead of the pack, having passed a risk-based framework for regulating AI apps earlier this year. The law came into force in ...
VentureBeat
10mon

Beyond generic benchmarks: How Yourbench lets enterprises evaluate AI models against actual data

Every AI model release inevitably includes charts touting how it outperformed its competitors in this benchmark test or that evaluation matrix. However, these benchmarks often test for general capabilities. For organizations that want to use models and ...
  • Privacy
  • Terms