homehome Home chatchat Notifications


Microsoft sniffs for cancer clues in your search queries

Microsoft researchers data mined health queries and detected pancreatic cancer symptoms before the user even thought about it.

Tibi Puiu
June 8, 2016 @ 4:50 pm

share Share

Search engines today use extremely sophisticated algorithms to guess what you’ll be searching for next based on your previous queries. This optimization has paid off very well for companies like Google, for instance, which can use this information to serve you better, more relevant results to queries, but also sell better ads.

It’s thrilling to hear, though, that more or less the same technology is used to predict which people have cancer before they even visited a doctor — powerful tech that’s used to save lives, not just make a hefty profit.

Dr. Eric Horvitz is both a medical doctor and a computer scientist, a double background that serves him well as the head of Microsoft’s newly founded Health and Wellness division.

One day, Horvitz got a call from a friend who was feeling sick. After describing his symptoms, Horvitz advised him to seek medical help. Not long after, the man was diagnosed with pancreatic cancer and died only a few months later.

Pancreatic cancer is one of the most unforgiving diseases out there, with only 3 percent of patients surviving five years after the diagnosis.

Today, Horvitz and colleagues at Microsoft published a paper in which they claim search queries can be used to predict if a person has pancreatic cancer with pretty good accuracy, considering they’re only working with anonymous queries.

Using data from Bing, Microsoft’s search engine, the researchers devised a computer model that could tell if the symptoms people query online are linked with pancreatic cancer. The researchers say they could distinguish between ‘serious’ concerned queries and those based on anxiety. They could also sniff cancer before a person was even considering searching for ‘cancer symptoms.’

Pancreatic cancer is particularly hard to sniff because its symptoms don’t seem very severe: itchy skin, weight loss, light-colored stools, patterns of back pain and a slight yellowing of the eyes and skin. This also made it very interesting to use as a data mining target because the symptoms could be easily confused with other diseases.

Eric Horvitz. Photo: Scott Eklund/Red Box Pictures)

Eric Horvitz. Photo: Scott Eklund/Red Box Pictures)

According to the paper published in the Journal of Oncology Practice, the researchers could “identify 5% to 15% of pancreatic cancer cases, while preserving extremely low false-positive rates (0.00001 to 0.0001).”

That means 1 in 100,000 to 10,000 people would be informed that they might have pancreatic cancer, but not actually have it. This would scare some, but it would be worth it considering the lives saved. The authors of the paper say this early diagnosis could up the five-year pancreatic cancer survival rate to 5 to 7 percent.

“We are excited about applying this analytical pipeline to other devastating and hard-to-detect diseases,” Horvitz said.

It’s worth noting that the researchers didn’t check their results with the health records of the people doing the online queries since these were anonymous. As such, the team’s claim that it could raise the survival rate, as well as the false positive rate, should be taken with a grain of salt.

Nevertheless, it’s refreshing to see health search queries being put to good use. We now live in an age where people first go to Google to type their symptoms instead of calling their doctors. At least on in ten internet searches are health related.

With no medical training or experience, it’s easy to get lost down a rabbit hole which can cause anxious, sleepless nights. But maybe soon enough, you’ll get a personal health assistant that can actually interpret your illness and pain, then give you the right shove to visit a (human) professional.

“People are being diagnosed too late,” he said. “We believe that these results frame a new approach to pre-screening or screening, but there’s work to do to go from the feasibility study to real-world fielding.”

share Share

This 5,500-year-old Kish tablet is the oldest written document

Beer, goats, and grains: here's what the oldest document reveals.

A Huge, Lazy Black Hole Is Redefining the Early Universe

Astronomers using the James Webb Space Telescope have discovered a massive, dormant black hole from just 800 million years after the Big Bang.

Did Columbus Bring Syphilis to Europe? Ancient DNA Suggests So

A new study pinpoints the origin of the STD to South America.

The Magnetic North Pole Has Shifted Again. Here’s Why It Matters

The magnetic North pole is now closer to Siberia than it is to Canada, and scientists aren't sure why.

For better or worse, machine learning is shaping biology research

Machine learning tools can increase the pace of biology research and open the door to new research questions, but the benefits don’t come without risks.

This Babylonian Student's 4,000-Year-Old Math Blunder Is Still Relatable Today

More than memorializing a math mistake, stone tablets show just how advanced the Babylonians were in their time.

Sixty Years Ago, We Nearly Wiped Out Bed Bugs. Then, They Started Changing

Driven to the brink of extinction, bed bugs adapted—and now pesticides are almost useless against them.

LG’s $60,000 Transparent TV Is So Luxe It’s Practically Invisible

This TV screen vanishes at the push of a button.

Couple Finds Giant Teeth in Backyard Belonging to 13,000-year-old Mastodon

A New York couple stumble upon an ancient mastodon fossil beneath their lawn.

Worms and Dogs Thrive in Chernobyl’s Radioactive Zone — and Scientists are Intrigued

In the Chernobyl Exclusion Zone, worms show no genetic damage despite living in highly radioactive soil, and free-ranging dogs persist despite contamination.