AI outperformed doctors in Harvard Medical School tests

05.05.26

The results of a recent study show a shift that until recently seemed theoretical: language models are beginning to compete with doctors in the tasks of primary diagnosis. In conditions of scarcity of time and information, for example, in emergency departments, algorithms demonstrate accuracy comparable to or even higher than that of specialists. It is not about a futuristic scenario, but about specific data obtained in clinically similar conditions.

How the capabilities of AI in medicine were tested

The study, published in the journal Science, was conducted by scientists at Harvard Medical School in collaboration with Beth Israel Deaconess Medical Center. Models of the OpenAI company took part in the testing – in particular, o1 and 4o.

The experiment was built as close as possible to real practice. 76 clinical cases of patients who sought emergency care were taken. Diagnostics took place in parallel: on the one hand – therapists, on the other – algorithms.

Further, the results were evaluated by independent specialists who did not know who exactly – a person or a machine – formed this or that diagnosis. Such a “blind” format made it possible to minimize bias.

The key point: the models worked with raw data from electronic medical records – the same that is available to the doctor at the time of the appointment. This excludes the effect of “prepared conditions” and makes the results more indicative.

Where the algorithms were more effective

The most pronounced advantage of AI was manifested at the stage of triage – the initial assessment of the patient’s condition. This is the most difficult moment: there is a minimum of information, and a decision must be made quickly and with a high cost of error.

Model o1 demonstrated an accurate or close to correct diagnosis in 67% of cases. For comparison:

one of the doctors showed a result of about 55%
the second – about 50%

And it is not about isolated luck: the model stably kept the level not lower than the human level even at the following stages of symptom analysis.

Researchers separately note that the algorithms have proven to be particularly strong in pattern recognition — they quickly “assemble” a picture from scattered features, even if the data is incomplete or noisy.

Restrictions: why doctors will not disappear anywhere

Despite the impressive numbers, it is premature to talk about the replacement of doctors. The authors of the study emphasize that the current results are a demonstration of potential, not a ready-made clinical solution.

There are a number of fundamental limitations:

there is no legal and professional responsibility for AI decisions
algorithms do not take into account the entire context of the patient (social, behavioral, psychological)
large-scale prospective trials in real hospitals are needed

In addition, even with high accuracy, models can make errors in non-standard ways – and such errors are more difficult to predict.

What it changes for medicine

Harvard Medical School’s work actually captures the new role of AI: from a data analysis tool to a participant in clinical thinking.

In practice, this may lead to the emergence of a hybrid model of medicine, where:

AI takes over the initial processing and hypotheses
the doctor performs the function of checking, interpreting and making the final decision

The main effect is not competition, but strengthening. Where a person is limited by time and the amount of information, algorithms can be a “second opinion” in time.

And if earlier the question sounded like “will AI replace the doctor”, now it is formulated differently: how much faster and more accurate medicine will become when a person and an algorithm start working as a single system.

Don't miss interesting news

Subscribe to our channels and read announcements of high-tech news, tes

We are on Facebook We are on Instagram We are on Telegram

Leave a Reply Cancel reply

Articles & tests

03.11.25
Oppo A6 Pro smartphone review: ambitious

912

Creating new mid-range smartphones is no easy task. Manufacturers have to balance performance, camera capabilities, displays, and the overall cost impact of each component. How the new Oppo A6 Pro balances these factors is discussed in our review.

06.07.26
Sony WF-1000XM6 Bluetooth headphones review: full power

The new Sony WF-1000XM6 headphones have slightly changed their shape compared to their predecessor, received a new processor, an improved noise cancellation system, more microphones, and generally made a noticeable step forward technically.

06.07.26 | 05.06
Sony WF-1000XM6 Bluetooth headphones review: full power

01.07.26 | 05.19
Logitech MX Keys S Combo wireless keyboard and mouse set review: tactile hi-end

19.05.26 | 06.06
One UI 8.5 Gives Older Samsung Phones a New Lease on Life — Here’s What the Update Brings

11.05.26 | 05.00
Logitech G512 X 75 keyboard review: maximized sleekness

27.04.26 | 06.05
Infinix GT 50 Pro unboxing: a gaming monster with liquid cooling and triggers

14.04.26 | 13.50
Samsung Galaxy A57 vs A37: Is the €100 Upgrade Worth It?

16.03.26 | 05.00
Acer Predator Helios 18 AI (PH18-73) laptop review: god-mode

12.03.26 | 05.05
Logitech G PRO X2 Superstrike Lightspeed mouse review: person solution

09.03.26 | 06.07
Protecting your site from fuzzing: They break you while you sleep

02.03.26 | 05.23
Acer Nitro Lite 16 (NL16-71G) laptop review: versatile and attractive

23.02.26 | 05.48
Oppo Reno 15 5G smartphone review: confident

17.02.26 | 22.00
Logitech G G325 headphones review: reliable and long-lasting

02.02.26 | 05.04
Poco M8 Pro smartphone review: give us more

26.01.26 | 05.05
Home autonomous power sources: inverters, batteries, solar panels

29.12.25 | 05.30
Top news of 2025 on hi-tech.ua

News

17.07.26 | 11.04
OnePlus officially leaves the European and North American markets

OnePlus has announced its final exit from the European and North American markets. At the same time, the company’s work in India continues as usual with new product launches.

17.07.26 | 10.03
Telegram challenges Discord and Notion: communities and an advanced editor have appeared in the messenger

The Telegram team presented one of the most functional updates in recent times.

17.07.26 | 11.04
OnePlus officially leaves the European and North American markets

17.07.26 | 10.03
Telegram challenges Discord and Notion: communities and an advanced editor have appeared in the messenger

17.07.26 | 09.15
Realme C100x Launches in India with a Massive 8,000 mAh Battery

17.07.26 | 08.02
Samsung Galaxy Z Fold8 and Fold8 Ultra Could Finally Deliver a Big Battery Boost

17.07.26 | 07.07
Assassin’s Creed Codename Hexe: Dark Middle Ages and a new fear system

17.07.26 | 05.55
Infinix Hot 70 Pro unveiled: smartphone with six body options and AI button

16.07.26 | 19.07
Sony under fire: refusal to sell PlayStation discs could lead to antitrust investigation.

16.07.26 | 17.01
HMD XploraOne Neo: the minimalism of a push-button phone and the capabilities of a smartphone

16.07.26 | 13.09
Xiaomi Mijia Smart Tea Bar Pro: an innovative tea station with instant heating

16.07.26 | 10.03
Google Pixel Watch 5: design, specifications and price

16.07.26 | 07.52
Realme Narzo 100x Price Revealed: 8,000 mAh Battery and 144 Hz Display

16.07.26 | 07.10
Samsung Flex Titanium: a technology that will save complex smartphones from the visible fold

15.07.26 | 23.36
Starlink V5 Review: An Incredibly Lightweight Dish You Can’t Buy

15.07.26 | 19.09
League of Legends Classic: Complete review of the 2013 return of the legend

15.07.26 | 18.02
Huawei FreeClip 2S: the evolution of open headphones with an innovative case