X

IBM's AI loses debate to a human, but it's got worlds to conquer

The tech was "surprisingly charming and human-sounding," and it's about to head out into the real world.

Stephen Shankland Former Principal Writer
Stephen Shankland worked at CNET from 1998 to 2024 and wrote about processors, digital photography, AI, quantum computing, computer science, materials science, supercomputers, drones, browsers, 3D printing, USB, and new computing technology in general. He has a soft spot in his heart for standards groups and I/O interfaces. His first big scoop was about radioactive cat poop.
Expertise Processors, semiconductors, web browsers, quantum computing, supercomputers, AI, 3D printing, drones, computer science, physics, programming, materials science, USB, UWB, Android, digital photography, science. Credentials
  • Shankland covered the tech industry for more than 25 years and was a science writer for five years before that. He has deep expertise in microprocessors, digital photography, computer hardware and software, internet standards, web technology, and more.
Stephen Shankland
5 min read
20190211-ibm-debater-01

Champion debater Harish Natarajan argues against IBM Debater, represented by a screen with a blue oval, in a competition at the IBM Think conference.

Stephen Shankland/CNET

The subject under debate was whether the government should subsidize preschools. But the real question was whether a machine called IBM Debater could out-argue a top-ranked human debater.

The answer, on Monday night, was no.

Watch this: Hear IBM Debater argue with a human -- and lose

Harish Natarajan, the grand finalist at the 2016 World Debating Championships, swayed more among an audience of hundreds toward his point of view than the AI-powered IBM Debater did toward its. Humans, at least those equipped with degrees from Oxford and Cambridge universities, can still prevail when it comes to the subtleties of knowledge, persuasion and argument.

It wasn't a momentous headline victory like we saw when IBM's Deep Blue computers beat the best human chess player in 1997 or Google's AlphaGo vanquish the world's best human players of the ancient game of Go in 2017. But IBM still showed that artificial intelligence can be useful in situations where there's ambiguity and debate, not just a simple score to judge who won a game.

"What really struck me is the potential value of IBM Debater when [combined] with a human being," Natarajan said after the debate. IBM's AI was able to dig through mountains of information and offer useful context for that knowledge, he said.

It was the second time IBM Debater took on humans in public, though it's taken part in dozens of debates behind Big Blue's walls. In the first IBM Debater competition, the AI defeated one human debater soundly while losing a closer competition with another. This time, though, the human opponent was tougher -- indeed, IBM researchers involved in the years-long project expected their AI would lose.

Computer persuasion

IBM Debater lost, but there's no question it won in a way: Listening to it, you evaluate what it's saying, not just that it's a computer saying something. The machine marshaled its argument, broke that down into a few points and backed them up with data from various studies. It wasn't perfect, but it was on point.

And, weirdly for an AI, it told us how Homo sapiens ought to behave.

"Giving opportunities to the less fortunate should be a moral obligation for any human being," IBM Debater said.

IBM Debater looks somewhat like the alien monolith in 2001: A Space Odyssey, only with animated bouncing blue circles to denote activity. Behind the scenes, Debater uses a group of powerful machines on IBM's cloud-computing infrastructure.

IBM Debater looks somewhat like the alien monolith in 2001: A Space Odyssey, only with animated bouncing blue circles to denote activity. Behind the scenes, Debater uses a group of powerful machines on IBM's cloud-computing infrastructure.

Stephen Shankland/CNET

In the debate, each side had 15 minutes to prepare -- though only IBM Debater has the advantage of being able to draw upon 10 billion sentences' worth of publications from news articles and academic research. Each side took turns making its case, rebutting the other and then presenting a closing argument.

The debate is scored based on how many people change their minds. Before the debate, 79 percent agreed with the position in favor of preschool subsidies, the stance IBM Debater argued for. But afterward, the audience support dropped to 62 percent.

In an age in which Apple's Siri , Amazon's Alexa and the Google Assistant listen to our questions and answer in human-sounding voices, it's easy to forget how remarkable it is that we can converse with computers. IBM Debater goes a step beyond, speaking for minutes.

"She was surprisingly charming and human-sounding," said John Donvan, host of the debate moderator of Intelligence Squared Debates, which runs debates and broadcasts them through a radio show.

Watch this: IBM's new AI can debate you

Don't expect to run something like Project Debater on your laptop anytime soon. It ran mainly on a powerful server with 28 processing cores and a whopping 768GB of memory -- roughly 50 times that of a high-end laptop. It was supported by a quartet of servers, each with 64GB of memory and 2-terabyte hard drives packed with text.

Preschool subsidies

IBM Debater argued in favor of the view that we should subsidize preschools, and Natarajan argued against it.

In Debater's view, preschools "carry benefits for society as a whole. It is our duty to support them." Good preschools mean kids -- especially poor kids -- do better in life.

Natarajan countered that preschool subsidies are "little more than a politically motivated giveaway to members of the middle class ... and not to the individuals who are most underprivileged." He also poked holes in Debater's assumptions, for example that a subsidy will meaningfully improve education for the poor.

Debater showed improvements over its 2018 debate. One new trick up its sleeve was the ability to offer a parallel argument -- in this case that subsidizing health care can be beneficial. Another was improved rebuttal skills. After Natarajan argued that some kids might not benefit from immersion into the potentially competitive world of preschool at age 3 or 4, IBM grasped that view and took issue with it: "My opponent argued that preschools are harmful," it said.

"We were working very hard since June to improve the system," said Noam Slonim, the Project Debater principal investigator at IBM Research. Debater's source material -- academic publications and news articles -- also have been expanded with another year's worth of data to the end of 2018.

Humans discussed IBM's AI debate technology at a contest touting the tech. From left to right: Noam Slonim, Project Debater's principal investigator at IBM Research; IBM's Project Debater screen; Ranit Aharonov, IBM's manager of Project Debater; and Harish Natarajan, the grand finalist at the 2016 World Debating Championships.

Humans discussed IBM's AI debate technology at a contest touting the tech. From left to right: Noam Slonim, Project Debater's principal investigator at IBM Research; IBM's Project Debater screen; Ranit Aharonov, IBM's manager of Project Debater; and Harish Natarajan, the grand finalist at the 2016 World Debating Championships.

Stephen Shankland/CNET

Most challenging contest so far

The competition was the most challenging yet for IBM's AI.

Natarajan "is at a different level compared to the debaters we faced so far," said Ranit Aharonov, IBM's manager of Project Debater. "He's the most decorated debater in the history of university debate competitions with the world record in the number of victories."

The event, at IBM's Think conference in San Francisco, is IBM Debater's last big debate. "Debater is nice, and it's good to showcase, but we should be focusing on how to take that technology and make something that's commercially viable," Aharonov said. "We are at the stage where we'll finalize the first use case we'll work on."

That could be something like helping a company understand the views of its employees or customers, or helping the news media or governments engage people in discussion about contested issues, she said.

That's because the technology behind Project Debater is all about the messiness and nuance of the real world we humans live in, not the black-and-white realm of games.

"We are going out of the comfort zone of AI into territory which is more gray," Slonim said.

Facebook acquires AI startup: GrokStyle is here to help you shop.

CNET Magazine: Check out a sample of the stories in CNET's newsstand edition.