Alibaba and Microsoft AI best humans in reading test

Artificial intelligence models developed by Microsoft and Alibaba have, for the first time, outperformed humans in a reading comprehension challenge.

The Stanford Question Answering Dataset (SQuAD) consists of a series of questions to which the answers can be found within more than 500 Wikipedia entries.

Alibaba’s deep neural network model scored 82.440 on the ‘exact match’ part of the test, besting the scores achieved by humans (82.304). Microsoft’s similar model achieved a score of 82.650.

The scoreboard is a who’s who of corporates carrying out artificial intelligence research, featuring the likes of Google, IBM Research, Facebook AI Research, Salesforce Research, Tencent and Samsung.

Alibaba and Microsoft have been placed joint first in the ranking, although both companies claim to have reached the better-than-human milestone first.

While Microsoft is listed as having registered its score on 3rd January and Alibaba two days later, Alibaba said those dates were when the companies submitted their models, not when test results were registered.

“It is our great honour to witness the milestone where machines surpass humans in reading comprehension,” said Luo Si, chief scientist for natural language processing at Alibaba’s Institute of Data Science and Technologies (iDST) in a statement. “We are thrilled to see NLP research has achieved significant progress over the year. We look forward to sharing our model-building methodology with the wider community and exporting the technology to our clients in the near future.”

Ming Zhou, assistant managing director of Microsoft Research Asia, said despite the milestone, overall, people are still much better than machines at comprehending the complexity and nuance of language.

“Natural language processing is still an area with lots of challenges that we all need to keep investing in and pushing forward,” he said. “This milestone is just a start.”

The big AI players are investing heavily in reading comprehension and response models.

Alibaba said it had been using the underlying technology during its ‘Global Shopping Festival’ for a number of years to answer customer inquiries.

Microsoft said it was applying earlier versions of the model to its Bing search engine.

“These tools also could let doctors, lawyers and other experts more quickly get through the drudgery of things like reading through large documents for specific medical findings or rarified legal precedent. The technology would augment their work and leave them with more time to apply the knowledge to focus on treating patients or formulating legal opinions,” the company wrote in a blogpost.

It is also working on models that answer probable follow-up questions.

“For example, let’s say you asked a system, ‘What year was the prime minister of Germany born?’ You might want it to also understand you were still talking about the same thing when you asked the follow-up question, ‘What city was she born in?’

“It’s also looking at ways that computers can generate natural answers when that requires information from several sentences. For example, if the computer is asked, ‘Is John Smith a US citizen?’ that information may be based on a paragraph such as, ‘John Smith was born in Hawaii. That state is in the US’” Microsoft explained.

Top AI prompters compete at Global Prompt Engineering Championship

Machines Can See Summit will unveil Public Program during Dubai AI Week

Microsoft advances ‘1 million AI learners’ commitment at Dubai AI Week 2025

Dubai Media Academy launches groundbreaking “Artificial Intelligence Initiative in Arab Media”

Huspy launches GCC’s first AI-powered mortgage chatbot to transform home financing

Sophos powers up cybersecurity in the UAE

Aster Pharmacy unveils largest regional store in Riyadh, pioneering digital healthcare integration

Cisco expands in Saudi Arabia with cloud data centers, AI talent development, and manufacturing plans

Zebra spearheads digital transformation in Saudi Arabia, aligns with Vision 2030

Tech To Make Riyadh Epicentre of MENA Music by 2030

Microsoft AI Tour showcases groundbreaking AI innovations for Oman

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

KROHNE delivers insights to inspire the next generation of engineers in Oman

Oracle supports major project to accelerate Oman digital economy

Ooredoo accelerates cybersecurity in Oman with new deal

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

BDB launches “tijara” platform for SMEs

Bahrain achieves full nationwide 5G coverage

Batelco, SonicWall launch integrated security solutions for SMEs in Bahrain

Bahrain to offer COVID-19 test results on WhatsApp, Facebook Messenger

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

Infopercept opens its first Middle East office in Kuwait

Microsoft Compliance Manager now available in Kuwait

Commercial Bank of Kuwait gets mobile payments moving with Thales Digital Solutions

Ooredoo chooses Fortinet to deliver secure SD-WAN managed services in Kuwait

e& enterprise and RAIN Technology to revolutionise Operating Room efficiency in hospitals across MEA

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

Looking for the best label solutions in South Africa? Go OKI!

OKI is only going bigger in the South African market!

Huawei honours Women in Tech at Apps UP 2022

ASUS unveils latest ExpertBook P1 models

e& UAE revolutionises telecom tower inspections with AI-powered drones

e& AGM approves 83 fils dividend per share for FY 2024

Google launches its state-of-the-art video model Veo 2 in MENA

Championing cyber resilience: Commvault’s vision for secure digital future

Samsung, e& UAE sign strategic MoU to advance AI-driven innovation, digital experiences at MWC

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

Gender Lens investing vital to economic recovery

Virgin Hyperloop unveils location for Hyperloop certification centre

TikTok taps Oracle as secure cloud provider

GDRFA – Dubai launches 8th ICEQ International Conference

Bybit partners with University of Wollongong in Dubai to host Demo Trading Challenge

National IT Academy and Microsoft launch the first Microsoft Datacentre Academy in the Region

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

AWR launches “Mobility and Sustainability through Arts”

Solis poised to transform Dubai’s skyline and deserts into beacons of sustainability

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

Huawei launches ground-breaking solar inverter at World Future Energy Summit

Middle East Energy to further boost their sustainability agenda

EDF UK selects Dynatrace to keep the power flowing

Emirates NBD’s collaboration with Kinexys to enhance cross-border payment security

Continental advances AI Integration to boost efficiency, protect client trust

Arab Bank Group achieves record net profit of USD 1 Billion for 2024, 40% cash dividends

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

Careem Pay introduces instant transfers to Europe

e& AGM approves 83 fils dividend per share for FY 2024

Abu Dhabi Government accelerates digital strategy with landmark Microsoft, G42 partnership

UNDP and e& strengthen AI collaboration for sustainable development, advancing health and climate solutions

Albania selects Presight for nationwide AI-powered smart city project

EDGE, e& UAE ink collaboration to boost secure communications at IDEX 2025

BD hosts Healthcare Summit in Riyadh in line with Vision 2030

e& enterprise and RAIN Technology to revolutionise Operating Room efficiency in hospitals across MEA

How will Agentic AI ease healthcare’s workforce crisis?

DFF launches fourth edition of ‘Future Opportunities: The Global 50’ report

Emirates Health Services, Dell sign MoU to enhance digital infrastructure in healthcare

Huspy launches GCC’s first AI-powered mortgage chatbot to transform home financing

DLD, VARA collaborate to boost leadership in realty and virtual assets regulation

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

Digitalisation key to accelerating construction development in Middle East, says Trimble

R&M Introduces First Single Pair Ethernet System to Support Middle East Smart Building Trend

ASUS unveils latest ExpertBook P1 models

75% of retailers say AI Agents will be essential to compete

New data: Gen Z embraces AI for social media spending

Yango Group and ROOTS unveil autonomous robots in Dubai

Hisense launches ‘Together Means More This Ramadan’ campaign with exclusive offers across the UAE

Top AI prompters compete at Global Prompt Engineering Championship

Machines Can See Summit will unveil Public Program during Dubai AI Week

Salesforce brings Agentic AI to the Field Service Sector

Microsoft advances ‘1 million AI learners’ commitment at Dubai AI Week 2025

Dubai Media Academy launches groundbreaking “Artificial Intelligence Initiative in Arab Media”