AI Google research

Every tenth answer is a mistake: study questions the accuracy of Google’s AI answers

12.04.26

Google’s AI-powered search summaries demonstrate a high level of accuracy, yet a noticeable share of errors remains. According to the study, around 10% of responses are inaccurate — which, at the scale of Google Search, translates into a massive volume of misleading information.

How AI Overviews work

AI Overviews are a Google feature that generates concise answers to user queries using Gemini AI models. The technology was first introduced in 2024 and has since been widely rolled out across multiple regions, including Ukraine.

The system aggregates data from various sources and produces a short summary, allowing users to get information quickly without visiting multiple links.

Study findings

A joint study by The New York Times and the startup Oumi found that approximately 90% of AI Overviews responses are accurate. However, about one in ten answers contains errors or misleading information.

The evaluation was conducted using the SimpleQA benchmark — a set of 4,000 questions developed by OpenAI. Results showed that accuracy improved after model updates: earlier versions achieved around 85%, while newer iterations exceeded 90%.

Still, even this level of accuracy raises concerns given the scale of Google Search. When extrapolated, it may result in millions of incorrect responses every hour.

Examples of inaccuracies

The report highlights several specific cases. For instance, when asked about the date Bob Marley’s former home became a museum, the system cited sources that either lacked clear dates or contained incorrect information.

In another example, the AI claimed that a particular classical music institution did not exist, despite referencing its official website. Such inconsistencies point to reliability issues in AI-generated responses.

Google’s response

Google criticized the study’s methodology, arguing that the benchmark used may contain inaccuracies and does not reflect real-world search behavior.

According to the company, internal evaluations rely on a more carefully curated dataset, providing a more accurate picture of system performance.

Why evaluating AI is difficult

Assessing generative AI systems remains a complex task. Different benchmarks can produce varying results, and models may generate different answers to the same question.

Additionally, AI Overviews does not rely on a single model — instead, it dynamically selects the most appropriate system for each query. More advanced models tend to be slower and more resource-intensive, so they are not always used.

The main risk: user trust

Despite clear progress, the biggest concern lies in how users воспринимают AI-generated answers. Many tend to trust them without verification, even when errors are possible.

While using internet sources improves accuracy, it also increases the risk of spreading misinformation.

Although Google includes disclaimers that AI responses may be incorrect, in practice many users do not double-check the information they receive.

Don't miss interesting news

Subscribe to our channels and read announcements of high-tech news, tes

We are on Facebook We are on Instagram We are on Telegram

Leave a Reply Cancel reply

Articles & tests

03.11.25
Oppo A6 Pro smartphone review: ambitious

915

Creating new mid-range smartphones is no easy task. Manufacturers have to balance performance, camera capabilities, displays, and the overall cost impact of each component. How the new Oppo A6 Pro balances these factors is discussed in our review.

20.07.26
Logitech Signature Comfort Plus Combo MK880 review: comfort in priority

Logitech Signature Comfort Plus Combo MK880 is a wireless keyboard and mouse set that focuses on comfort during long hours of work, not only due to the ergonomics of the case, but also constructive additions.

20.07.26 | 05.20
Logitech Signature Comfort Plus Combo MK880 review: comfort in priority

06.07.26 | 05.06
Sony WF-1000XM6 Bluetooth headphones review: full power

01.07.26 | 05.19
Logitech MX Keys S Combo wireless keyboard and mouse set review: tactile hi-end

19.05.26 | 06.06
One UI 8.5 Gives Older Samsung Phones a New Lease on Life — Here’s What the Update Brings

11.05.26 | 05.00
Logitech G512 X 75 keyboard review: maximized sleekness

27.04.26 | 06.05
Infinix GT 50 Pro unboxing: a gaming monster with liquid cooling and triggers

14.04.26 | 13.50
Samsung Galaxy A57 vs A37: Is the €100 Upgrade Worth It?

16.03.26 | 05.00
Acer Predator Helios 18 AI (PH18-73) laptop review: god-mode

12.03.26 | 05.05
Logitech G PRO X2 Superstrike Lightspeed mouse review: person solution

09.03.26 | 06.07
Protecting your site from fuzzing: They break you while you sleep

02.03.26 | 05.23
Acer Nitro Lite 16 (NL16-71G) laptop review: versatile and attractive

23.02.26 | 05.48
Oppo Reno 15 5G smartphone review: confident

17.02.26 | 22.00
Logitech G G325 headphones review: reliable and long-lasting

02.02.26 | 05.04
Poco M8 Pro smartphone review: give us more

26.01.26 | 05.05
Home autonomous power sources: inverters, batteries, solar panels

News

24.07.26 | 19.01
The Vatican entered into a dispute with AI detectors over the first encyclical of Leo XIV

After the publication of his first encyclical, Magnifica Humanitas, Pope Leo XIV suddenly found himself at the center of the debate on the use of artificial intelligence.

24.07.26 | 17.02
Light Flip: a $300 minimalist digital detox flip

The startup Light has introduced a new flip phone Light Flip with an OLED screen, created for digital detox.

24.07.26 | 19.01
The Vatican entered into a dispute with AI detectors over the first encyclical of Leo XIV

24.07.26 | 17.02
Light Flip: a $300 minimalist digital detox flip

24.07.26 | 13.03
Honor Robot Phone received a rotating camera and ARRI technologies

24.07.26 | 10.02
Raspberry Pi introduced the 10-inch Touch Display 2 for interactive projects

24.07.26 | 07.04
Lenovo Lecoo AI Mini: a compact mini PC with a processor from the past

23.07.26 | 19.05
British aerotaxi Valo made its first public flight

23.07.26 | 17.02
Volkswagen presented an innovative electric bicycle: car-level safety

23.07.26 | 13.01
A refrigerator for the people in Japan: the Do Hiemon Box cabin to beat the heat

23.07.26 | 12.06
Samsung introduced Galaxy Watch Ultra 2 and Galaxy Watch 9. First impressions

23.07.26 | 11.02
Samsung enters the market of smart glasses: presented a gadget on Android XR

23.07.26 | 10.13
Call of Duty: Modern Warfare 4 will receive two stages of testing already in August

23.07.26 | 07.07
Synthetic Video Detector from NVIDIA will help the media fight against deepfakes

22.07.26 | 19.03
Thought control: the world’s first brain-robot platform presented

22.07.26 | 17.08
5G communication is already working in Kyiv

22.07.26 | 16.05
Samsung Galaxy Fold 8, Fold 8 Ultra and Galaxy Flip 8 foldable smartphones unveiled. First impressions