Personal Democracy Plus Our premium content network. LEARN MORE You are not logged in. LOG IN NOW >

Twitter Political Index Launches, But Is It Actually Measuring "Voter Sentiment?"

BY Micah L. Sifry | Wednesday, August 1 2012

Screen image taken from Twitter blog post

Today, Twitter announced the launch of the "Twitter Political Index" in partnership with the social data analysis firm Topsy and pollsters The Mellman Group and North Star Opinion Research, and the twittering class swooned.

"Ignore the 'Twindex' at your peril," stomped Chris Cilizza of The Washington Post's "The Fix" blog. "This is a big deal: Twitter just launched a very powerful new political sentiment tool," tweeted Ben Smith, the editor of BuzzFeed.

"Twitter Will Gauge Voter Sentiment in New Venture" was the headline at National Journal--never mind the fact that this is neither a measure of voters or of sentiment.

Rather, what the Twindex number for Obama or Romney purports to be is a ranking of where attitudes expressed about those two men stand compared to all the other "sentiment" currently being expressed on a daily basis on Twitter, with some smoothing done over a three-day cycle to deal with the daily fluctuations of topics on the platform. Says Adam Sharp, Twitter's lead for government, news and social innovation, "If a candidate's rating is 30, that would mean that 30% of tweets on all other subjects are more negative than the ones about the candidate, and 70% are more positive."

Look out, everyone, the web just invented another leaderboard. And now the conversation about politics in America is going to get even dumber, because we have a new magic number to chatter about. Obama is at 34, down 4 from yesterday. Romney is at 25, up 2. Sell Obama! Buy Romney!

Over on BuzzFeed, staff reporter Matt Buchanan breathlessly gave the details on how Twindex would supposedly enable anyone to directly "check the pulse of millions of actual people, simultaneously and directly, second by second."

The rough version of how it works: Topsy pores through every single tweet in real time, determines which ones are about Obama or Romney, and then assigns a sentiment score to each tweet based on its content. That is, whether it's positive or negative toward Obama or Romney, and just how positive or negative it is. Add all the data up together and you have something like a real-time approval score for Obama and Romney, determined by what tens of millions of people are saying, which Twitter is going to release daily at election.twitter.com.
….The Twitter Political Index leverages the growing science of sentiment analysis, in which computers — machines! — try to assess what the meaning or feeling of a piece of writing really is. Which isn't easy — particularly for "short form content like tweets, which often lack context," Topsy's chief scientist Rishab Ghosh tells me. But Topsy's algorithms agree with a randomly selected human 90 percent of the time on what a tweet means, validated over 30,000 tests. (It's hard to get to 100 percent because humans often disagree on what a tweet means. Also, machines are still terrible at picking out or assessing sarcasm or irony. So, "Good job, Obama!" with a link to an article about crappy job numbers might fly past the algorithm as a positive tweet. )

Well, that's quite an admission, isn't it? If Topsy's algorithms were right 90 percent of the time, that means they were wrong 10 percent of the time. Or, shall I say, they and the human scorer(s) of a tweet were in disagreement 10 percent of the time, and no one could agree what a tweet meant.

But what if both machine and human scorers are in agreement, but also mistaken about someone's sentiment? Stop and think. How often have you misunderstood something that you read in an email or tweet that was written by someone you know, let alone a stranger? It could be that Topsy's algorithms AND human readers are both validating mistaken impressions. It's for these kinds of reasons that serious academics studying the challenge of doing sentiment analysis insist that researchers publish their methodology in detail, something Topsy has yet to do. A request to Topsy for comment was unanswered as of the end of business today.

I put this question to Sharp, and he said Topsy's testing of its algorithms against the choices of multiple human scorers gave him confidence that most ambiguities would be ironed out of the index's ratings. He added, "At the end of the day, there is no such thing as perfection on this front. Nor do we hold out that the Twindex is perfection. It is an imperfect model."

What about the claim that Twindex is reflecting American political sentiment? Only about 30 percent of Twitter's user base is American, Sharp says, but "obviously, U.S. users are much more vocal about the campaign than non-U.S. users."

And how do we know that a tweet mentioning Obama or Romney was written by a voter? "We are essentially taking the hypothesis that someone who is actively tweeting about the election is more likely to be a voter than not," he answered. The fact that the Obama's Twindex number seems to move in relatively close correlation to his approval numbers in the Gallup Poll, Sharp says, also suggests that the metric is reflecting something useful about public attitudes.

Marc Smith, founder of the Social Media Research Foundation, is less sanguine. "I think that this is more horse race stuff with limited methodological validity," he told me in an email this afternoon.

The idea that the metric claims to represent the percentage of tweets that are more or less negative than tweets about that person is hard to comprehend. I propose that simple alternatives would have more validity and comprehensibility. Raw counts of tweets with each candidate name is a simple and easy to understand metric. A list of the ten terms most often associated with the candidate's names would also be informative (and better express the "sentiment" than a percentage score).

I continue to find "sentiment" metrics hard to believe and understand and compare to other's metrics.  I think it is good to see this kind of data in use and I appreciate the interest in social media as a novel source of insights. But I do think this data is over sold.

Smith also warned that the Twindex could also be gamed, even though the number of tweets being analyzed on a daily basis could number in the millions.

"As it stands, this metric does not recognize the fact that twitter is factionalized.  If people who tweet #tcot post negative things about Obama is that news? Sub-groups with a partisan view can now ramp up negative content creation in an effort to move this metric. If we normalized these metrics based on which group the tweet came from we could see when people who had not been negative become negative. Just measuring the volume of negative seems to miss the idea that determined interest groups can generate more negative on demand. This is different from seeing a shift among positive or neutral people to a negative stance."

To be fair, Sharp is careful to emphasize that this a work in progress. "This is a young, three hours old experiment, an imperfect one that we'll keep trying to make stronger, based on a public data set that we'll invite others to take a stab at."

"None of us are claiming to have cracked the code."

News Briefs

RSS Feed friday >

Chilean Anti-Corruption Resource: A Crowdsourced Database of Social and Political Connections

In countries where a small minority of social circles have a majority of the political and economic power, personal relationships can affect major decision-making, a serious concern of anti-corruption activists. A new web platform stores personal profiles of key players in Chilean business and politics, complete with biographies and personal and professional connections through family, education, social circles, employers and coworkers, to make tracking social relationships and conflict-of-interest easier. Called Poderopedia (from the Spanish word for power), the project sounds kind of like LinkedIn, but the creation and management of profiles is being crowdsourced out to journalists, activists and concerned citizens.

GO

Middle Eastern Telecom Accused of Working With Saudi Arabia to Spy on Citizens

Mobily, an arm of the state-owned Middle Eastern telecom giant Etihad Etisalat, has been accused of working with Saudi Arabia to develop software that would allow the government to bypass protections for social media users. The exposé comes from Moxie Marlinspike (neé Matthew Rosenfield), an expert in a certain type of malicious Internet attack called MITM (man-in-the-middle), whereby attackers intercept and secretly alter private messages exchanged via email and other social media platforms. GO

Saudi Religious Leader Warns Twitter Users of Consequences in the Afterlife

In late March, Saudi Arabia's top religious cleric said Twitter was for clowns and corrupters. Earlier this week, he said anyone using social media, in particular Twitter, “has lost this world and the afterlife.” His comments might be laughable, if they did not come at a time when the Saudi government is looking into monitoring or blocking social media sites and eliminating user anonymity.

GO

thursday >

What The Other Silicon Valley Immigration Group Is Doing This Month

A bipartisan coalition of political advocacy, business and tech groups are moving ahead to launch a social media blitz next week designed to persuade members of the Senate to vote in favor of immigration reform legislation supported in Silicon Valley. "We're going to create a virtual digital storm," said Jeremy Robbins in a Wednesday ... GO

The New Yorker Hopes "Strongbox" Is a Wiretap-Proof Sieve for Leaks

The New Yorker yesterday became the first outlet to implement DeadDrop, a new system for sources to submit information to journalists online in a more secure and anonymous way than, for example, email. GO

Female Organizer of Pakistan's First Hackathon Stresses Collaboration Over Competition

After Pakistan banned Valentine's Day this year, Sabeen Mahmud started an online protest in which people uploaded photos to mock the government ban. In the weeks following she received death threats and menacing phone calls, and early on she had to stay home from work. That did nothing, however, to keep her from further organizing. Last month, the café she started in Karachi hosted Pakistan's first ever hackathon, which tackled problems including sanitation, crime, disaster management, and education. She even invited a government representative to observe the initial conversations, tackling sensitive areas like government inefficiency and elections.

GO

wednesday >

White House Innovation Fellows Project Spins Off Into A Business

Clay Johnson and Adam Becker joined the Presidential Innovation Fellows program to help the White House fix the way government does business. Now they're turning that mission into a business themselves. GO

Fighting Fires With Data, New York City Launches New Safety Inspection System

Mayor Michael Bloomberg announced today that New York City has implemented city-wide a new risk based inspection system focused on fire safety that is driven by analytics from multiple city agencies. GO

Chinese Netizens Use Digital Initiative to Gain Media Attention for Unsolved Poisoning Case

Last month a medical science student at a Shanghai university died from poisoning, allegedly murdered by his roommate. The specifics of the crime echoed a case from the mid-1990s, in which a 19-year-old student was poisoned with thallium. That case has once again been thrown into the media spotlight, but after 18 years the media has changed and the spotlight means a trending hashtag on Sina Weibo or an online petition to the U.S. President.

GO

PDF France 2013: “Au Code, Citoyens!”

This year PDF France will take place in Paris on June 13, with the theme "Au Code, Citoyens!" ("To Code, Citizens!") The speakers' lineup includes some of the continent's leaders in the digital revolution. GO

tuesday >

Website Imitation is Flattery in New York City Council Race

A New York City Council candidate who had made his name as a technology consultant and spearheaded an open government initiative several years ago found parts of his website copied by another City Council candidate in a different borough, as Politicker first reported. GO

Mike Honda Locks Up Establishment Support, But Challenger Has Ear of the Silicon Valley Elite

Some of Silicon Valley's most influential business people will hold a fundraiser in San Francisco this Thursday for Ro Khanna, the 36-year-old lawyer who's challenging 71-year-old California Democrat Mike Honda for his 17th Congressional District seat. The names at the top of the invite: Ron Conway and Sean Parker. They're apparently forming a committee to help Khanna build his campaign. The other bold-face names who are listed as part of the 'committee in formation' include Salesforce.com's Founder and CEO Marc Benioff, Benchmark Capital General Partners' Matt Cohler and Peter Fenton, tech entrepreneur Shawn Fanning, Yahoo CEO Marissa Mayer, her big data venture investor husband Zach Bogue, and Conway's SV Angel colleague, Founder and Managing Partner David Lee. GO

Tools to Keep Independent Media Online in Hostile Environments

Websites and media outlets in developing countries or countries with corrupt or repressive regimes struggle daily to fend off hacker attacks, some from their own government — like the Malaysian news portal Sarawak Report, which techPresident reported was taken down in April by sustained denial-of-service attacks. The negative attention controversial reporting draws can scare local advertisers away as well, making it difficult for a media company to support itself. Media Frontiers offers two services to websites dealing with either of those problems.

GO

monday >

Ahead of September Elections, German Pirate Party Picks Its Platform

The German Pirate Party held its election year convention over the weekend and approved its party platform, following lengthy debate over the role that online decision-making should have within the party, as German news sources reported and the party outlined on its own web platforms. GO

Peruvians Petition their President to Stick Up for their Digital Rights

Peru’s civil society advocacy groups have started an online petition outlining their ‘non-negotiable’ demands for digital rights and freedom of speech. The campaign was prompted by the controversial Trans-Pacific Partnership (TPP) agreement. Lima, Peru, will soon host the 17th round of secretive TPP trade talks, which will take place from May 15 – 24.

GO

Gun Control Advocates Take Aim At LivingSocial for Promoting Guns and Alcohol

A coalition of advocacy groups is launching a new campaign this week against the promotion of American gun culture. The campaign focuses on the daily deals site Living Social, which hasn't stopped promoting social events Hunter S. Thompson would have loved (they promote shooting off guns and letting off steam and drinking.) GO

More