Personal Democracy Plus Our premium content network. LEARN MORE You are not logged in. LOG IN NOW >

Every Bill Coming Before the House Should Soon Be Available Online in Machine-Readable Format

BY Miranda Neubauer | Tuesday, January 17 2012

As of late last week, the House of Representatives began publishing some key legislative documents in machine-readable format at http://docs.house.gov, fulfilling a promise that had been announced last year. Going forward, the site will host a machine-readable version of every bill coming before the House, and currently hosts another structured set of data on all the bills coming before the House in a given week.

The availability of upcoming bills directly from the House as a structured data feed means that a developer could point a web application to that feed and grab official information about the goings-on of the House of Representatives that week, making it easier to understand the House and to follow along. It also means that going forward, every bill coming before the House will be made available in machine-readable XML format. Programmers and people who know from such things may now discuss the use of XML versus other formats for packaging structured data. It's unclear what interface the House will offer for developers to reach backwards through time to pluck individual bills from this repository as it grows. But the House leadership is beginning to deliver on its pledge to make it ever easier for developers and technologists to build tools that explain the workings of Congress, and to do so relying on the House as a primary source.

The Committee on House Administration unanimously adopted the "Standards for the Electronic Posting of House and Committee Documents & Data" in December, which among other guidelines notes that "committees are encouraged to post documents in XML when possible and should expect XML formats to become mandatory in the future."

The site is hosted by the House Clerk, with data coming from the House Majority Leader and the House Committee on Rules in addition to the House of Representatives.

Matt Lira, digital director in Majority Leader Eric Cantor (R-Va.)'s office, called the availability of machine-readable legislation a step as significant as allowing cameras on the House floor. He calls it a structural change to the functioning of the House.

"It's not just opening a door, it's installing a doorstop so that the door can never be closed in that way again," he said. With the role of the House Clerk, he said, it's the House as an institution making its information more available, independent of the personalities involved.

One key goal of the site is to make it easier to see what's in legislation before it is voted on, he said. As an example, he noted that in 2009, stimulus legislation was posted in a non-machine readable format two hours before a vote. Now, he said, legislative texts would be searchable by keyword and available for developers. "Government is pretty lousy at interfaces, but pretty good at data," he said, and added that the site could become a useful for source for media outlets or activist groups.

While currently mainly posting bills to be considered by the House, new standards also direct the inclusion of Committee documents in the future.

"Committee video of hearings and markups will be stored by the House to meet requirements for archiving, access, searchability, and authenticity," the standards also note.

Daniel Schuman from the Sunlight Foundation wrote that "the ongoing process of releasing documents online, in real-time, and in machine-readable manner is a tremendous sea change from the slow and ponderous paper publications that are often late, fairly difficult to use, and unfriendly to computers."

But J.H. Snider, president of iSolon.org and a network fellow at Harvard University’s Edmond J. Safra Center for Ethics, wrote in an e-mail that he still saw room for improvement.

"My basic complaint is that the data is machine-readable only internal to the U.S. House of Representatives, not across the federal government," he wrote. "Until the data can be automatically linked to related databases, the value of machine-readable information is significantly diminished. For example, I’d like the bill text to be automatically linked to the statutes they modify."

That might require some forward movement from places like the Government Printing Office, which maintains a current digital system for accessing federal regulations that relies heavily on PDFs.

Snider called the new site at least a small step in the right direction. He also suggested that in addition to sharing and RSS tools, an e-mail subscribe option should also be available.

Lira said that the offices involved in the site would look at analytics, and at feedback from the transparency community, to shape the site going forward.

In a statement, Rules Committee Chairman David Dreier (R-Calif.) praised the launch of the new website.

"When Republicans took the majority last year, we promised to change the way the House of Representatives conducts the people’s business," he said. "That’s why we adopted rules that for the first time promote the use of electronic files rather than paper printed at taxpayer expense, cutting costs and increasing public access."

This post has been updated.

News Briefs

RSS Feed yesterday >

The Problem with Crowdsourced Legislation

Writing for The Atlantic, Alexander Furnas, a master's candidate at the Oxford Internet Institute, critiques the platform for collaborative legislative markup built at Rep. Darrell Issa's (R-Calif.) and Sen. Ron Wyden's (D-Ore.) behest and launched with their legislative alternative to the Stop Online Piracy Act. The platform, he writes, is "flawed."

GO

Things Online Organizers Say

What do you get when you put hundreds of left-leaning, meme-obsessed activists in the same place at the same time?

One is Rootscamp, a weekend gathering of the progressive organizer tribe in Washington, D.C., that wrapped up Sunday. Hundreds of activists convened for an unconference to talk about new tools and tactics for organizing online. The other correct answer is an, um, stuff people say video targeted to their peers and with a series of guest cameos by leading online organizers, including Rebuild the Dream's Natalie Foster, MoveOn's Daniel Mintz and Julia Rosen, Reddit cofounder Aaron Swartz, and others.

GO

European Commission to Refer ACTA to Europe's Highest Court

The European Commission plans to refer the Anti-Counterfeiting Trade Agreement (ACTA) to the European Court of Justice "to assess whether ACTA is incompatible - in any way - with the EU's fundamental rights and freedoms, such as freedom of expression and information or data protection and the right to property in case of intellectual property," according to a statement released by one of the commissioners earlier today.

GO

Thursday 2/23 PDPlus Call: How Grassroots Conservatives Are Tapping the Power of Open Networks

Conservatives are using online social media in innovative new ways, catching up to or surpassing their counterparts on the other side of the aisle. This Thursday on the Personal Democracy Plus call, I'm looking forward to talking with Martin Avila, whose firm Terra Eclipse worked on Ron Paul's 2008 website, and more recently has partnered with Freedom Works to launch Freedom Connector, a social network that has grown to more than 160,000 active members in just one year. GO

Fact-Checking Group Launches Web Video Campaign To Discourage Flood of Deceptive SuperPAC Ads

A fact-checking web site run by the University of Pennsylvania on Tuesday launched an ambitious new attempt to stem the expected flood of deceptive television advertising placed by third-party political groups on broadcast networks by providing the public with a new tool with which to contact station managers who would be accepting those ... GO

friday >

U.S. Senate Could Save Hundreds of Thousands of Dollars If It Files Campaign Finance Reports Electronically, Says The FEC

One little-noted item in President Obama's budget proposal this week was a recommendation to require U.S. senators to file their campaign finance reports with the Federal Election Commission electronically. The FEC estimates that the switch from paper to bits would save it $430,000 annually. GO

Teddy Goff and Joe Rospars On How Obama's Campaign Is Trying to Get Back to the "We"

Getting back to the "we" of Barack Obama's 2008 campaign — the now-legendary level of energy and individual commitment from grassroots volunteers that Obama was able to harness en route to an improbable victory in the Democratic primary and then in the general election for the presidency of the United States — is in many ways the "central challenge" of his 2012 re-election effort, Obama for America Digital Director Teddy Goff said Friday.Speaking with Obama's chief digital strategist, Joe Rospars, and techPresident publisher Andrew Rasiej at a Social Media Week event in a conference room at Thomson-Reuters with a panoramic view of New York City, Goff described the myriad ways Obama's re-election effort is looking to harness digital tools to connect with voters, whether they be supporters from 2008 or newcomers to politics.

GO

february 16th >

Team Obama's Questlove Endorsement

In a video, Questlove, the drummer and joint frontman of the The Roots, the in-house band for Late Night with Jimmy Fallon, endorsed Barack Obama's reelection as part of the campaign's African Americans for Obama effort. "When I started supporting Barack Obama in 2008 he promised to bring real change and hope to our country and community as a whole," he says in the video. "This is not a quick fix. It's not like you can take a wand, 'BING,' and just make magic overnight. He needs eight years to finish the mission and we need to have his back." GO

More