Personal Democracy Plus Our premium content network. LEARN MORE You are not logged in. LOG IN NOW >

The Library of Congress is Going to Archive Twitter. Why?

BY Nancy Scola | Wednesday, April 14 2010

The Library of Congress just tweeted this: "Library to acquire ENTIRE Twitter archive -- ALL public tweets, ever, since March 2006! Details to follow."

Why would the Library of Congress want to become the repository for millions of tweets sent back and forth around the world over the last four years? Well, the LOC has embraced a mission of becoming the repository for the record of American history, and that's a history that has been unfurling increasingly online in the past several years. Still, it does raise questions. (In fact, enough so that when I first saw this tweet, I assumed that it was mistaken. But there it is, on @librarycongress.) Is this going to be a continuously updated tweet archive? Will it be searchable by researchers? Will it track retweets and responses? Why not archive Facebook?

And then there's the fact that this seems to tweak the social contract between Twitter and users a bit. Tweets have at least the feel of being ephemeral, even if we all know that that data is capable of being archived.

With this move, the Library of Congress is taking the vacuum approach. More data is better, and the Library puts itself in a better position to be a resource the more information it contains. But you can look at the legal profession, and how it has struggled somewhat to adapt to electronic discovery, and start to think that something like this isn't without risks. Beyond that, are you at all freaked out that the occasionally ridiculous things you tweet are going to be housed for all time in the nation's library?

But that's all uninformed speculation. This news is all of ten minutes old. Indeed, stay tuned.

UPDATE: The Library of Congress has posted on their blog the promised details on their plan to archive Twitter. Alas, at the moment the site is kaput from the traffic. Guess people are interested.

AND ANOTHER UPDATE: The LOC's blog post is back up, with promises of more detail to follow. The Library's Director of Communications Matt Raymond writes:

We will also be putting out a press release later with even more details and quotes. Expect to see an emphasis on the scholarly and research implications of the acquisition. I’m no Ph.D., but it boggles my mind to think what we might be able to learn about ourselves and the world around us from this wealth of data. And I’m certain we’ll learn things that none of us now can even possibly conceive.

And:

So if you think the Library of Congress is “just books,” think of this: The Library has been collecting materials from the web since it began harvesting congressional and presidential campaign websites in 2000. Today we hold more than 167 terabytes of web-based information, including legal blogs, websites of candidates for national office, and websites of Members of Congress. We also operate the National Digital Information Infrastructure and Preservation Program www.digitalpreservation.gov, which is pursuing a national strategy to collect, preserve and make available significant digital content, especially information that is created in digital form only, for current and future generations.

Twitter's Biz Stone spells out some of the nuances of the arrangement:

It is our pleasure to donate access to the entire archive of public Tweets to the Library of Congress for preservation and research. It's very exciting that tweets are becoming part of history. It should be noted that there are some specifics regarding this arrangement. Only after a six-month delay can the Tweets will be used for internal library use, for non-commercial research, public display by the library itself, and preservation.

News Briefs

RSS Feed today >

Cory Booker Hires Democratic Organizing Veteran Addisu Demissie To Manage Senate Run

Newark Mayor Cory Booker has hired a veteran of the Democratic organizing world Addisu Demissie to manage his run to succeed the late New Jersey Democratic Senator Frank Lautenberg of New Jersey. GO

ShareProgress Debuts Social Sharing Optimization Tools

ShareProgress, a left-leaning tech startup in downtown San Francisco, launched its social sharing optimization platform Tuesday after several months of testing with the progressive advocacy group CREDO Action. GO

New Organizing Institute to Move from Collecting Election Data to Organizing Election Officials

The New Organizing Institute, a progressive nonprofit that trains campaigners and is no led by former Obama for America data director Ethan Roeder, is launching a new initiative next week aiming to "fix that" for local elections. NOI will announce a national network where local election administration officials can congregate to share solutions to common issues. It's a transition for a team at NOI that had previously been managing the Voting Information Project, which collects data on polling places, election districts and voter registration deadlines and prepares it for third parties in machine-readable format. In the 2012 election cycle, backed by the Pew Charitable Trusts and partnered with Google, VIP made information available in all 50 states. GO

Russian SOPA Passed First Reading

A first draft of a law nicknamed “Russian SOPA” was approved by the Russian parliament last Friday, June 14. Like the original Stop Online Piracy Act, the bill will establish penalties and procedures for online copyright violations.

GO

monday >

Czech Prime Minister Resigns Following Corruption and Surveillance Scandal

The prime minister of the Czech Republic resigned yesterday, irreparably damaged by a corruption scandal and the possibility of impropriety in his personal life. According to the Czech constitution, his entire government will also have to relinquish office.

GO

friday >

Mayors of New York City and San Francisco Announce "Digital Cities" Summit

The Mayors of New York City and San Francisco announced Friday that they're co-hosting meetings in the Fall and early next year to examine the "best practices" that lead to tech-enabled economic growth. The meetings are follow-ups to the initial Bloomberg Technology Summit held last year in New York City. This year's summit in New York ... GO

New York State Joins GitHub to Get Feedback on Open Data Policy

New York is the first state to publish an initial draft of its open data guidelines on GitHub to seek feedback from the public, Governor Andrew Cuomo announced in a press release Thursday. GO

Brazilians Protest Forced Evictions on YouTube and in Mock World Cup

Tomorrow Brazilians who have been forced out of their housing in advance of the 2014 World Cup will stage their own “People's Cup” in Rio de Janeiro to draw awareness to forced evictions.

GO

A “Fix-Rate” for Corruption: Integrity Action Wins the Google Global Impact Award

“From wanachi (“citizen”) to up there,” Emmanuel Dzombo explains with an upward sweep of his hand, is how Integrity Action has begun to reverse the bureaucratic top-down approach that has often blocked development work in Kenya. Dzombo is a local leader in Chengoni, Kenya, a country that ranks towards the very bottom of Transparency International’s Corruption Perceptions Index – at 139. The organization believes it could do more, and Google.org seems to agree. The Google Impact Challenge will provide the charity with £500,000 that will allow it to develop a mobile application for tracking and collecting data from citizens. GO

Crowdsourced "Danger Maps" Track Air, Soil and Water Pollution in China

Chinese citizens are exposing sources of pollution and other environmental problems by contributing to the partially crowdsourced website 'Danger Maps'. So far, the Chinese government is letting them get away with it.

GO

thursday >

U.S. Privacy and Civil Liberties Oversight Board To Meet Next Wednesday

A long dormant independent agency that was at least nominally supposed to exercise a modicum of oversight over the booming intelligence-industrial complex is scrambling to meet up next Wednesday, but the public will still be none the wiser about what it plans to do, since it is a closed door meeting. The only indication that the toothless ... GO

Despite Software Problems, Civic Hackers are Pedaling Bike Share Data

Reporters are shoaling around the news that New York City's new bike sharing system, Citi Bike, is benighted with problems stemming from its high-tech software. But that's not putting the brakes on plans to explore what programmers might do with data generated by the system by hosting a Citi Bike Civic Hack Night later this month. GO

Grassroots Republicans Are Not Waiting for the RNC To Revamp Their Digital Strategy

Several members of the Republican Party rank and file aren't waiting around for the GOP to reinvent itself on the technological front. They're organizing events themselves to explore what a tech-enabled GOP might look like for the 2014 cycle. GO

wednesday >

New Russian Law Makes Publication of Information on Gay Rights Illegal

On June 11 the Russian parliament passed a bill against “homosexual propaganda” that effectively outlaws gay rights rallies and bans informational or pro-gay rights material from publication in the media or on the Internet. Violators of the law will risk heavy fines and censorship and, in the case of a media outlet, risk being shut down. It had near unanimous support, passing in a 436-to-0 vote, with only one abstention.

GO

Macedonia Draft Law to Regulate and Restrict the "Last Arena for Freedom of Speech"

The draft of a media regulation law in Macedonia has journalists and press freedom watchdogs up in arms. The proposed Law on Media and Audiovisual Media Services was written by the government behind closed doors and without input from the media or NGOs. It has been interpreted as a decisive move on the part of the government to limit speech online in a country where press freedoms are already limited. Until now, Internet-based news sites were not regulated like print media.

GO

Trying to Prosecute Online Piracy in Canada? Good Luck!

A private firm that is monitoring Canadians who download pirated content online has found itself at the center of a legal battle. GO

More