the number of adjectives in a sentence.) Using these and other criteria, sentiment analysis algorithms can then begin to create computational models of human opinion.
Complicating matters even further are questions of context (who’s speaking, and to whom?) and linguistic nuances like slang and ambiguity. A “bad motorcycle” might actually be a good one; whereas a “bad movie” is probably just plain bad. Sentiment analysis algorithms sometimes have to go beyond literal interpretations of a text to discern an author’s original intent. Given the wide varieties of idiomatic writing on the Web, this is no small task. As Grimes notes, “You don’t see ‘Genistein inhibits protein histidine kinase…Not!’ in a scientific paper.”
“With opinions, so much depends on the point of view of the user,” says David Pierce, chief technology officer of Jodange, whose sentiment analysis software grew out of a research project by Claire Cardie at Cornell University and Jan Wiebe at the University of Pittsburgh. Drawing on a body of theory in linguistics, philosophy, and computational linguistics, their team developed an algorithm that tries to determine the context of any particular statement by isolating three key data points: the topic, the opinion holder, and the opinion itself. First, the algorithm employs an entity extraction routine that locates keywords to identify particular topics and opinion holders. Next, it layers that data onto a linguistic analysis of the opinion being expressed. The resulting unit of data is a triple consisting of opinion, opinion holder, and topic. These triples are then stored in a relational database, where they can be cross-referenced across multiple documents to create what Jodange vice president of product management and marketing Pia Chong calls a “walled garden of opinion.”
By connecting opinions from multiple sources about a particular topic, the application can provide users with a bird’s-eye view of a particular topic presented in a variety of different formats: straightforward lists, heat maps that show the concentration of opinions on particular topics, an opinion index that calculates positive or negative trends, or a so-called Doppler view that shows a
graphical summary of opinion data. The company is currently working on a new predictive model that could use opinion data to predict future developments, such as the impact of written opinion on trends in a company’s stock price.
A number of other companies are now developing their own variations of sentiment analysis software. Companies like Attensity, Clarabridge, Lexalyt-ics Limited, SPSS, and TEMIS are developing their own proprietary versions of sentiment analysis software. All of these products employ some combination of keyword extraction and linguistic analysis to provide their customers with a particular understanding of collective opinion. Some of these products are targeted toward business applications, others toward consumer-facing Web applications.
For consumers, the most obvious applications for sentiment analysis involve enhancing search engines with more opinion data. “Sentiment analysis software could enable a much smoother user experience” for consumer research, says Pang. Microsoft’s Product Search, which is part of Live Search, and Yelp’s review highlights, which include phrases automatically extracted from user reviews, already rely on basic sentiment analysis to enhance their search results. Such interactions could eventually find their way into the general Web search experience. Pang suggests that such interactions could be fine-tuned for users at different stages of the research process, allowing them to narrow down from reviews of a product category to comparisons between products, then finally to in-depth product reviews.
Beyond the realm of consumer prod-
ucts, Pang also sees opportunities for sentiment analysis to shape the way people consume news. “When the media is having a field day, [users] might want to get a digest of different perspectives on the breaking news with analysis.” Similar applications might eventually lead to new types of interfaces where readers could track the movement of opinion about particular stories over time.
That kind of opinion-trending insight is particularly valuable to users working in business or government. These potential users might include business intelligence professionals, market researchers, or public relations specialists. Today, sentiment analysis vendors are already marketing their products to companies in the form of hosted services that provide opinion dashboards and other management tools. At this stage, sentiment analysis software is too new to have penetrated most IT firewalls. Eventually, however, companies may start exploring how to integrate sentiment analysis data with their core management systems. “ Unified analysis is coming,” says Grimes, “but it’s not here yet.”
Attensity is taking a step in that direction by marketing a suite of tools designed to help companies integrate sentiment analysis data with their internal business operations. In addition to providing sentiment analysis data, Attensity provides mechanisms for funneling that data into operational “queues” like marketing campaigns or call center scripts. “For example, if a valuable customer is upset they can route them to a special marketing campaign that compensates them through points or other things of value,” explains Michelle de Haaff, Attensity’s vice president of marketing and products.
As sentiment analysis finds its way into the business mainstream, vendors will likely continue to develop similar services that bring sentiment analysis into the IT mainstream. Once that integration starts to happen, companies will be able to feed opinion data into core business processes that can help them strengthen their customer relationships—and, ultimately, boost profits: a decidedly unsentimental goal.
Alex Wright is a writer and information architect who lives and works in new york city.
© 2009 acm 0001-0782/09/0400 $5.00
APriL 2009 | voL. 52 | no. 4 | communicAtionS of the Acm
15
References:
Archives