Chatbots handle medical questions better than doctors

A study finds that people asking medical questions online preferred the responses from ChatGPT, rather than those from busy human doctors. — Freepik

What are my odds of dying after swallowing a toothpick?

Do I need to see a doctor after hitting my head on a metal bar while running?

Am I likely to go blind after getting bleach splashed in my eye?

A new study led by researchers at the University of California San Diego (UCSD) in the United States explores how artificial intelligence (AI) compares to human expertise in the workaday task of dashing off quick responses to routine medical questions.

Published April 28 (2023) in the medical journal JAMA Internal Medicine, the paper finds that ChatGPT, the world-upending chatbot with a seemingly-infinite breadth of training, was able to more than hold its own when its responses were judged by a panel of experts against those made by flesh-and-blood physicians.

Evaluators found they "preferred the chatbot responses to the physician responses", in 78% of evaluations made.

What's more, chatbot responses were found to be of a "significantly higher quality" than those from humans.

And in terms of empathy, an area where people would intuitively seem to have an edge, silicon again excelled.

"Chatbot responses were rated significantly more empathetic than physician responses," the paper states.

Despite the lopsided results, this paper's authors say doctors should be excited by what they show.

Dr John W. Ayers, the UCSD computational epidemiologist who led the data collection and analysis process, said that he believes AI will be a game changer for medicine in its ability to lighten workloads, while simultaneously improving quality for patients.

"So many more patients who are now getting no response or a bad response will be able to get answers from an AI-equipped physician who will be able to serve far more patients," he said.

This paper's results, however, test a very specific set of circumstances pertaining to text communications between doctors and patients, and do not generalise to clinical settings.

Researchers pulled 195 randomly-selected questions from the Ask a Doctor subsection of Reddit.com, the popular news aggregation and discussion site.

The group, which has nearly 500,000 members, allows anyone to publicly ask any question they want of doctors whose qualifications are verified by Reddit.

Since questions and answers are all made in public for anyone on the Internet to read, feeding them to ChatGPT required no particular data wizardry.

"Honestly, it's just plug-and-play," Dr Ayers said in an email. "All we did was cut and paste the questions into ChatGPT and save the response."

No additional refinement was made, he said, after the chatbot delivered an answer.

Chatbot answers tended to be much more verbose and friendly-sounding, while those from doctors were clearly dashed off by a chronically-busy person relying on shorthand to be as efficient as possible.

In answering the swallowed toothpick question, for example, the doctor's response starts: "If you've surpassed 2-6 h??, chances are they've passed into your intestines. Which means it can't be retrieved easily."

ChatGPT starts out less clinically with: "It's natural to be concerned if you have ingested a foreign object, but in this case, it's highly unlikely that the toothpick you swallowed will cause you any serious harm."

It's a smooth response, especially for someone who exists on a server somewhere.

The head injury question about hitting a metal bar on a run shows that chatbots simply have time to be more complete.

The physician response dutifully bangs out eight symptoms that should cause the person to see a doctor, including nausea or vomiting, dizziness, severe or worsening headache, loss of consciousness, confusion, neck stiffness, problems with vision and limb weakness, concluding: "If you develop any of these in the next 24 h, rush to the emergency room."

The chatbot provides a more complete set of symptoms, telling the patient to be wary of loss of consciousness "even if it's just for a few seconds", and includes slurred speech, difficulty with balance or coordination, seizures, changes in behaviour or personality, and clear fluid draining from the nose or ears.

And here again, the chatbot is able to throw in a little additional care that the doctor was presumably too busy to type out.

"While it's possible that you may be fine, it's important to be evaluated by a medical professional to rule out any serious injuries," the chatbot response says.

"It is possible that you may have suffered a concussion or other head injury, even if you didn't lose consciousness."

Dr David "Davey" Smith, chief of infectious disease research at UCSD and one of the doctors tasked to evaluate each pair of question responses, said he found the chatbot's facility at answering medical questions to be shocking, even knowing that ChatGPT has already successfully passed medical licensing exams.

"It seemed like it could read in the message from the patient that they were anxious or sad, or you know, had emotions attached to these questions," he said.

"Not only was it more accurate, because it has all of the information at its fingertips, right, but it was also empathetic, which was pretty cool."

But does this doctor, who sees patients every day, fear eventual replacement?

No, not at all.

AI, he said, is looking like a salve rather than an irritant.

"I get patient emails every day and they're asking questions almost exactly like this," he said.

"And I spend about an hour a day – others spend more – going through emails and answering them as quickly as possible, you know, making an appointment, here's your prescription, that's just a hangnail, or you need to go to the emergency room.

"I don't have time for empathy either, I'm just trying to get through it, but what if we had a way where this programme could make it easier for us?

"What if it would draft something out ahead of time and I just review it?"

If the computer has the time to refer back to the actual literature and churn out more complete answers, and also the time to show a little more evidence of concern for a patient's anxiety, that, he said, could be revolutionary.

But, he added, no AI is giving his patients advice on its own.

At the end of the day, it's his medical license on the line if the AI gets something wrong.

"The bot can help at the beginning, but it's on me to sign off," he said. – By Paul Sisson/The San Diego Union-Tribune/Tribune News Service

Tags / Keywords: Chatbot , technology , doctor

Report a mistake

What is the issue about?

Spelling and grammatical error

Factually incorrect

Story is irrelevant

Thank you for your report!

Related News

Hotel and resort stays that let guests focus on wellness

Global 30 Jun 2026

Chatbots handle medical questions better than doctors

ENERGISING THE NEXT GENERATION

Next In Health

Others Also Read

Thank you for downloading.

Chatbots handle medical questions better than doctors

Related Stories

Hotel and resort stays that let guests focus on wellness

This airline has an in-cabin wellness zone for its nearly 20-hour direct flight

Finding joy through wellness: Embrace happiness with holistic practices

Get 20% OFF The Star Digital Access

Monthly Plan

Annual Plan

Related stories:

Related News

Hotel and resort stays that let guests focus on wellness

This airline has an in-cabin wellness zone for its nearly 20-hour direct flight

Finding joy through wellness: Embrace happiness with holistic practices

ENERGISING THE NEXT GENERATION

Next In Health

Trending in Lifestyle

Others Also Read

Thank you for downloading.