Microsoft’s medical AI system makes diagnoses 4 times more accurately than doctors
06.07.25
Microsoft has introduced the MAI Diagnostic Orchestrator (MAI-DxO) artificial intelligence system, which, according to the company, is capable of making medical diagnoses four times more accurately than real doctors. At the same time, its work costs less.
The system was tested on 304 complex clinical cases published in the authoritative New England Journal of Medicine. MAI-DxO did not just give a result, but went through the diagnostic process step by step – just like a doctor does. Especially for such tests, Microsoft has developed a new test protocol called Sequential Diagnosis Benchmark.
The peculiarity of MAI-DxO is that it works as a “collective mind” – it combines the answers and analytical capabilities of several advanced AI models at once. The “virtual consilium” includes systems from OpenAI, Google, Anthropic, Meta and even Grok from Elon Musk. Together, they analyze symptoms, suggest hypotheses, refine data, and ultimately issue a final diagnosis.
The results were impressive. The AI achieved an accuracy of 80 percent, while the average among doctors under similar conditions was only 20. In addition, MAI-DxO turned out to be more economical: it chooses more affordable but more effective diagnostic methods and reduces overall costs by about 20 percent.
The head of Microsoft’s AI direction, Mustafa Suleiman, who previously worked at Google, called this development a step towards creating a super-intelligent medical system. According to him, such solutions can radically change the approach to diagnosis, and in the future – to treatment.
So far, Microsoft has not disclosed how exactly it intends to use the technology. Among the possible options are its implementation in the Bing search engine as a self-diagnosis tool or the creation of assistive solutions for doctors. It is already known that MAI-DxO will soon begin testing in real clinics.
However, the technology is controversial. First, doctors were forbidden to use any auxiliary tools during the test, which casts doubt on the correctness of the comparison. Second, AI is not yet able to take into account the emotional state of the patient or the characteristics of a particular medical institution – for example, the availability of equipment. Despite this, most experts agree: if the system shows comparable effectiveness in real practice, it will be a real breakthrough – albeit not the most joyful for doctors.
Don't miss interesting news
Subscribe to our channels and read announcements of high-tech news, tes
Oppo A6 Pro smartphone review: ambitious
Creating new mid-range smartphones is no easy task. Manufacturers have to balance performance, camera capabilities, displays, and the overall cost impact of each component. How the new Oppo A6 Pro balances these factors is discussed in our review.
Oppo Reno 15 5G smartphone review: confident
The Oppo Reno15 smartphone emphasizes design, a high-quality display, versatile cameras, and good battery life. Let’s take a closer look.
Starlink Mobile 5G will provide connection speeds of 150 Mbps even in the Arctic internet SpaceX
SpaceX is preparing to launch its global 5G-enabled Starlink Mobile service. The US Federal Communications Commission (FCC) has approved the launch of up to 15,000 direct-to-cell (D2C) satellites.
Study: Artificial Intelligence uses nuclear weapons in 95% of simulations artificial intelligence war
Researchers at King’s College London conducted a series of military simulations using leading artificial intelligence models. The tests used OpenAI’s GPT-5.2, Anthropic’s Claude Sonnet 4, and Google’s Gemini 3 Flash.


