Personal Democracy Plus Our premium content network. LEARN MORE You are not logged in. LOG IN NOW >

Voter File 2.0: Catalist, Democratic Tool

BY Micah L. Sifry | Friday, May 9 2008

I'm in a breakout session at the New Democratic Network's daylong conference on "New Tools, New Audiences," listening to Vijay Ravindran, the CTO of Catalist, talk about web 2.0 and its development of an "Enhanced Voter File." As usual, these are my rushed notes, and at best a good paraphrase of what was said, not direct quotation.

The traditional voter file, which is collected by state bodies, is just name, contact info and party registration, and past voter behavior.

The enhanced voter file, something that Democrats, Republicans and sometimes other organizations build and maintain, contains commercial data, census data, historical information about your behavior, and specialized data (like lifestyle choices). (Vijay notes, later in the Q&A, that this kind of data is often a weak indicator of people's actual political preferences, and hypothesizes that someday under an OpenID framework, campaigns or organizations like his might be more interested in highly accurate information that individuals volunteer about themselves.)

Enhanced voter files are used for canvassing, and also for modeling campaigns.

Catalist is building on the lessons of 2004 (where Democrats had a database meltdown) and working to build a 50 state national database:

Catalist's voter file has the names of 180 million registered voters, plus 75 million unregistered people (for use by voter registration groups), enhanced with commercial data, specialty data (like who owns hunting licenses), integrated it with the Democrat's VAN application, and with a tool for subscribers to mine the data.

Catalist's goal is to be a permanent piece of progressive infrastructure. Vijay talks about Tim O'Reilly's "What is Web 2.0" paper as his "baseline in driving Catalist." So he goes thru some of O'Reilly's key points about the development of web 2.0.

1. The web as a platform: Examples are Amazon, Facebook, Google. For Catalist, this means: if our data is inaccessible, it doesn't exist. (This is another reference to the Democratic data disaster of 2004.) A database administrator is a poor excuse for an interface where people can self-administer. From the very beginning, having a back end web interface was essential. We also created a web services API for progressive organizations with technical staff.

2. Harnessing collective intelligence. Examples are Wikipedia, Amazon, Flickr. For Catalist, this means storing, organizing and utilizing in perpetuity the collective personal data of its customers. It means removing the technical limitations around cooperation, building value-added meta-data that no one else can, and relying on its customers to make its data more correct. (This is sort of the open source model for bug fixing, but Catalist isn't an open product. But inside its ecosystem, it sounds like it's applying the same logic.)

3. Data is the next "Intel inside." For Catalist, this means instant access to information about nearly everyone over 18 in the US in a single format, easy upload of proprietary data for integrated data mining, giving back unique identifiers for each data point, and the creation of a proprietary matching system for the data. What this means you can combine field canvass lists, fundraising, membership, polling info and online engagement, to get a 360 degree view of people and figure out more ways to engage them (or ask them for money, he notes, if you haven't already).

4. End of the software release cycle. Examples are eBay, Netflix, who make their fixes in real time online. The same can be true for politics. Data can be updated in critical election months; no more stale data. And new features and bug fixes need to be deployed rapidly. (No more 2004 horror shows for the Democratic side, in essence. He draws a parallel to Christmas season at Amazon.)

5. Lightweight Programming Models. Examples are Amazon Web Services, YouTube's embed feature. Catalist's approach is to not try to do everything. Their web service is designed to allow other's creativity to take advantage, like MoveOn's "Vote Poke" application. Their formats are usable by microtargetters, and other groups can syndicate their data (like America Votes)

6. Software Above the Level of the Single Device. They're making application configurations for field, analytics, fundraising, strategy, pollsters. etc. Vijay mentions the need to make these more Blackberry friendly, given how many political staffers have them.

7. Rich user experiences. Catalist's Q tool has a professional UI design, maps, drag and drop crosstabls, inline updated counts and access control for organizations.

Where is this going? With more data and more collaboration, and web services, more innovative applications will get built both by Catalist and others. He sees the semantic web coming, intelligent crowd sourcing, integrated web mining...and ultimately more progressive power.

Question time. I ask about their business model. Subscriptions to Catalist are $25K to $400K per year. Several hundred organizations are clients. About 40 people on staff. 15 terabytes in size database. He analogizes it to an electric company, where no one org would ever have the wherewithal to build one, but it is essential infrastructure.

I also ask: "How do they insure that they're only selling their services to progressives?" He says they haven't hit the interesting question yet of "what if Joe Lieberman wants our services?" It would be up to the board. AARP would be considered progressive, he thinks.

In terms of how information is shared internally: A lot of nonprofits that use Catalist's data release their own results back for others to use, such as Womens Voices, Womens Vote, one of their clients. By and large, donor data, membership info tends to be kept private by clients.

A very impressive presentation. Ravindran, who left Amazon to lead Catalist's technology team, has clearly brought the wisdom of Silicon Valley to the political infrastructure business. He's definitely someone to watch.

News Briefs

RSS Feed yesterday >

"Power Politics in the Age of Google"

TechPresident's editorial director, Micah Sifry, will be speaking this afternoon on a panel at Harvard University called "Power Politics in the Age of Google," alongside Susan Crawford, Nicco Mele, Elaine Kamarck and Alexis Ohanian. The panel will be moderated by Harvard Shorenstein Center Director Alex Jones, and will be live-streamed here. GO

House Republicans Get a Jump on the Budget

Via Politico's Mike Allen, the House Republicans are out with a video — this one attributed to Majority Whip Kevin McCarthy — getting the drop on President Barack Obama's next federal budget, expected Monday. GO

Mittbucks.com Lets Voters Compare Their Paychecks With Romney's

What would it take for Mitt Romney to be able to relate to the average American's daily economic life? He'd have to pay $1,208.09 for a gallon of gas, according to Mittbucks.com, a web site recently created by Adam Rosenscruggs and his wife Danielle in Washington, D.C. The eye-popping figure results from an annual income that I plugged in ... GO

What Twitter Won't Tell You About the Election

A new study released on Tuesday by the Pew Research Center for the People & the Press on Tuesday offers the opportunity to get real about what the political conversation on Twitter and Facebook can — or can't — tell you about the progression of the 2012 political campaign. Pew has found that even among users of Twitter and Facebook, a paltry percentage of people use social networks to get news about politics: Only 24 percent of Twitter users in the sample and 25 percent of Facebook users said they "sometimes" got campaign news through that network, while a full 40 percent of Twitter users in the sample and 46 percent of other social media users reported "never" getting campaign news through either Twitter or Facebook. GO

Navigating New York's "Road Map for the Digital City," One Year In

In May 2011, New York City Mayor Michael Bloomberg revealed a "Road Map for the Digital City," a plan to use technology to make city government more and participatory, and to leverage the city's tech sector for economic and civic gains.

New York City Chief Digital Officer Rachel Sterne will join our editorial director, Micah Sifry, on a conference call this Friday afternoon to discuss the progress on that road map so far. The call is free and open to anyone to join. You can sign up here.

GO

tuesday >

Pete Hoekstra's Campaign Website's "Offensive" Source Code Changed After Outcry

As if "chop suey fonts" and obvious graphic allusions to the stereotype of the Chinese as the Yellow Peril weren't controversial enough, the group that created an incendiary microsite for former Rep. Pete Hoekstra's campaign has managed to further fan the flames with what it's calling a mistake in its code. GO

Fidel Castro Loves the Internet

“The Internet is a revolutionary instrument that permits the receiving and transmission of ideas, in both directions, that is something we should know how to use,” Fidel Castro told a crowd of supporters on Feb. 4, according to the state-owned Cuban newspaper Granma International. Castro, who made his first public appearance since April 2011, launched his two-volume memoir, “Guerilla of Time,” and took the opportunity to discuss issues of importance to him. Earlier this week, Miranda Neubauer reported that one of these topics was the need for the Internet. Castro has been a proponent of the Internet as a tool for the exchange of ideas since 2003, but the average Cuban citizen faces great difficulty getting online. GO

Claire McCaskill Hires Blue State Digital's Alex Kellner As Digital Director

Missouri's senior Democratic Senator Claire McCaskill has hired Blue State Digital's Alex Kellner as its digital director. GO

More