the number of adjectives in a sentence.)
Using these and other criteria, sentiment analysis algorithms can then begin to create computational models of
human opinion.
Complicating matters even further
are questions of context (who’s speaking, and to whom?) and linguistic nuances like slang and ambiguity. A “bad
motorcycle” might actually be a good
one; whereas a “bad movie” is probably
just plain bad. Sentiment analysis algorithms sometimes have to go beyond
literal interpretations of a text to discern an author’s original intent. Given
the wide varieties of idiomatic writing
on the Web, this is no small task. As
Grimes notes, “You don’t see ‘Genistein
inhibits protein histidine kinase…Not!’
in a scientific paper.”
mining collective opinions
“With opinions, so much depends on
the point of view of the user,” says David Pierce, chief technology officer of
Jodange, whose sentiment analysis
software grew out of a research project
by Claire Cardie at Cornell University
and Jan Wiebe at the University of Pittsburgh. Drawing on a body of theory in
linguistics, philosophy, and computational linguistics, their team developed
an algorithm that tries to determine the
context of any particular statement by
isolating three key data points: the topic, the opinion holder, and the opinion
itself. First, the algorithm employs an
entity extraction routine that locates keywords to identify particular topics and
opinion holders. Next, it layers that data
onto a linguistic analysis of the opinion
being expressed. The resulting unit of
data is a triple consisting of opinion,
opinion holder, and topic. These triples
are then stored in a relational database,
where they can be cross-referenced
across multiple documents to create
what Jodange vice president of product
management and marketing Pia Chong
calls a “walled garden of opinion.”
By connecting opinions from multiple sources about a particular topic,
the application can provide users with a
bird’s-eye view of a particular topic presented in a variety of different formats:
straightforward lists, heat maps that
show the concentration of opinions on
particular topics, an opinion index that
calculates positive or negative trends,
or a so-called Doppler view that shows a
graphical summary of opinion data. The
company is currently working on a new
predictive model that could use opinion
data to predict future developments,
such as the impact of written opinion on
trends in a company’s stock price.
A number of other companies are
now developing their own variations of
sentiment analysis software. Companies like Attensity, Clarabridge, Lexalyt-ics Limited, SPSS, and TEMIS are developing their own proprietary versions
of sentiment analysis software. All of
these products employ some combination of keyword extraction and linguistic analysis to provide their customers
with a particular understanding of collective opinion. Some of these products
are targeted toward business applications, others toward consumer-facing
Web applications.
For consumers, the most obvious
applications for sentiment analysis involve enhancing search engines with
more opinion data. “Sentiment analysis
software could enable a much smoother
user experience” for consumer research,
says Pang. Microsoft’s Product Search,
which is part of Live Search, and Yelp’s
review highlights, which include phrases automatically extracted from user
reviews, already rely on basic sentiment
analysis to enhance their search results.
Such interactions could eventually find
their way into the general Web search
experience. Pang suggests that such
interactions could be fine-tuned for users at different stages of the research
process, allowing them to narrow down
from reviews of a product category to
comparisons between products, then
finally to in-depth product reviews.
Beyond the realm of consumer prod-
for many businesses,
online customer
opinions have
become a type of
virtual currency that
can make or break
their products.
ucts, Pang also sees opportunities for
sentiment analysis to shape the way people consume news. “When the media is
having a field day, [users] might want to
get a digest of different perspectives on
the breaking news with analysis.” Similar applications might eventually lead
to new types of interfaces where readers
could track the movement of opinion
about particular stories over time.
That kind of opinion-trending insight is particularly valuable to users
working in business or government.
These potential users might include
business intelligence professionals,
market researchers, or public relations
specialists. Today, sentiment analysis
vendors are already marketing their
products to companies in the form of
hosted services that provide opinion
dashboards and other management
tools. At this stage, sentiment analysis
software is too new to have penetrated
most IT firewalls. Eventually, however,
companies may start exploring how to
integrate sentiment analysis data with
their core management systems. “
Unified analysis is coming,” says Grimes,
“but it’s not here yet.”
Attensity is taking a step in that direction by marketing a suite of tools designed to help companies integrate sentiment analysis data with their internal
business operations. In addition to providing sentiment analysis data, Attensity provides mechanisms for funneling that data into operational “queues”
like marketing campaigns or call center
scripts. “For example, if a valuable customer is upset they can route them to a
special marketing campaign that compensates them through points or other
things of value,” explains Michelle de
Haaff, Attensity’s vice president of marketing and products.
As sentiment analysis finds its way
into the business mainstream, vendors
will likely continue to develop similar
services that bring sentiment analysis
into the IT mainstream. Once that integration starts to happen, companies
will be able to feed opinion data into
core business processes that can help
them strengthen their customer relationships—and, ultimately, boost profits: a decidedly unsentimental goal.
Alex Wright is a writer and information architect who lives
and works in new york city.
© 2009 acm 0001-0782/09/0400 $5.00
APriL 2009 | voL. 52 | no. 4 | communicAtionS of the Acm
15