[Infowarrior] - IBM Program to Take On ‘Jeopardy!’
Richard Forno
rforno at infowarrior.org
Mon Apr 27 13:53:20 UTC 2009
April 27, 2009
Computer Program to Take On ‘Jeopardy!’
By JOHN MARKOFF
http://www.nytimes.com/2009/04/27/technology/27jeopardy.html?_r=1&hp=&pagewanted=print
YORKTOWN HEIGHTS, N.Y. — This highly successful television quiz show
is the latest challenge for artificial intelligence.
What is “Jeopardy”?
That is correct.
I.B.M. plans to announce Monday that it is in the final stages of
completing a computer program to compete against human “Jeopardy!”
contestants. If the program beats the humans, the field of artificial
intelligence will have made a leap forward.
I.B.M. scientists previously devised a chess-playing program to run on
a supercomputer called Deep Blue. That program beat the world champion
Garry Kasparov in a controversial 1997 match (Mr. Kasparov called the
match unfair and secured a draw in a later one against another version
of the program).
But chess is a game of limits, with pieces that have clearly defined
powers. “Jeopardy!” requires a program with the suppleness to weigh an
almost infinite range of relationships and to make subtle comparisons
and interpretations. The software must interact with humans on their
own terms, and fast.
Indeed, the creators of the system — which the company refers to as
Watson, after the I.B.M. founder, Thomas J. Watson Sr. — said they
were not yet confident their system would be able to compete
successfully on the show, on which human champions typically provide
correct responses 85 percent of the time.
“The big goal is to get computers to be able to converse in human
terms,” said the team leader, David A. Ferrucci, an I.B.M. artificial
intelligence researcher. “And we’re not there yet.”
The team is aiming not at a true thinking machine but at a new class
of software that can “understand” human questions and respond to them
correctly. Such a program would have enormous economic implications.
Despite more than four decades of experimentation in artificial
intelligence, scientists have made only modest progress until now
toward building machines that can understand language and interact
with humans.
The proposed contest is an effort by I.B.M. to prove that its
researchers can make significant technical progress by picking “grand
challenges” like its early chess foray. The new bid is based on three
years of work by a team that has grown to 20 experts in fields like
natural language processing, machine learning and information retrieval.
Under the rules of the match that the company has negotiated with the
“Jeopardy!” producers, the computer will not have to emulate all human
qualities. It will receive questions as electronic text. The human
contestants will both see the text of each question and hear it spoken
by the show’s host, Alex Trebek.
The computer will respond with a synthesized voice to answer questions
and to choose follow-up categories. I.B.M. researchers said they
planned to move a Blue Gene supercomputer to Los Angeles for the
contest. To approximate the dimensions of the challenge faced by the
human contestants, the computer will not be connected to the Internet,
but will make its answers based on text that it has “read,” or
processed and indexed, before the show.
There is some skepticism among researchers in the field about the
effort. “To me it seems more like a demonstration than a grand
challenge,” said Peter Norvig, a computer scientist who is director of
research at Google. “This will explore lots of different capabilities,
but it won’t change the way the field works.”
The I.B.M. researchers and “Jeopardy!” producers said they were
considering what form their cybercontestant would take and what gender
it would assume. One possibility would be to use an animated avatar
that would appear on a computer display.
“We’ve only begun to talk about it,” said Harry Friedman, the
executive producer of “Jeopardy!” “We all agree that it shouldn’t look
like Robby the Robot.”
Mr. Friedman added that they were also thinking about whom the human
contestants should be and were considering inviting Ken Jennings, the
“Jeopardy!” contestant who won 74 consecutive times and collected
$2.52 million in 2004.
I.B.M. will not reveal precisely how large the system’s internal
database would be. The actual amount of information could be a
significant fraction of the Web now indexed by Google, but artificial
intelligence researchers said that having access to more information
would not be the most significant key to improving the system’s
performance.
Eric Nyberg, a computer scientist at Carnegie Mellon University, is
collaborating with I.B.M. on research to devise computing systems
capable of answering questions that are not limited to specific
topics. The real difficulty, Dr. Nyberg said, is not searching a
database but getting the computer to understand what it should be
searching for.
The system must be able to deal with analogies, puns, double entendres
and relationships like size and location, all at lightning speed.
In a demonstration match here at the I.B.M. laboratory against two
researchers recently, Watson appeared to be both aggressive and
competent, but also made the occasional puzzling blunder.
For example, given the statement, “Bordered by Syria and Israel, this
small country is only 135 miles long and 35 miles wide,” Watson beat
its human competitors by quickly answering, “What is Lebanon?”
Moments later, however, the program stumbled when it decided it had
high confidence that a “sheet” was a fruit.
The way to deal with such problems, Dr. Ferrucci said, is to improve
the program’s ability to understand the way “Jeopardy!” clues are
offered. The complexity of the challenge is underscored by the
subtlety involved in capturing the exact meaning of a spoken sentence.
For example, the sentence “I never said she stole my money” can have
seven different meanings depending on which word is stressed.
“We love those sentences,” Dr. Nyberg said. “Those are the ones we
talk about when we’re sitting around having beers after work.”
More information about the Infowarrior
mailing list