Skip to main content

Huzzah! Library of Congress’ useless Twitter archive is almost complete… but you can’t read it yet

The U.S. Library of Congress’ slow complication of Twitter’s public tweet archive is, according to the organization, moving along well; The first objectives laid out for the archiving project has finally been achieved. There’s only one question, however: How do you actually share all of those tweets now that you have them?

In a blog post on the Library of Congress’ site, Gayle Osterberg, the Library’s Director of Communications, wrote that once the April 2010 agreement between the Library and Twitter to archive the public tweets from the service’s origins in 2006 had been signed, work began in earnest on how best to achieve that aim. “The Library’s first objectives were to acquire and preserve the 2006-10 archive,” Osterberg wrote, “to establish a secure, sustainable process for receiving and preserving a daily, ongoing stream of tweets through the present day; and to create a structure for organizing the entire archive by date.”

Recommended Videos

As of this month, she goes on to explain, those initial goals have been met. “We now have an archive of approximately 170 billion tweets and growing,” she stated in the blog post, adding that “the volume of tweets the Library receives each day has grown from 140 million beginning in February 2011 to nearly half a billion tweets each day as of October 2012.” With that part of the project dealt with, it’s time to turn to a perhaps more problematic task – one Osterberg politely describes as “addressing the significant technology challenges in making the archive accessible to researchers in a comprehensive, useful way.” In other words, how to actually make it an archive that serves any real purpose, as opposed to a permanent record that is – for all intents and purposes – unavailable to anyone outside of the Library itself.

In a five-page report updating progress on the project, the Library notes that it has already received more than 400 requests for access to the archive, but it hasn’t as yet approved any. The reason is that right now, even just searching the fixed 2006-2010 archive Twitter shared before offering “live” updates to the ongoing record can take up to one day – something that the Library describes as “an inadequate situation in which to begin offering access to researchers.”

“It is clear that technology to allow for scholarship access to large data sets is not nearly as advanced as the technology for creating and distributing that data,” the report continues, pointing out that “even the private sector has not yet implemented cost-effective commercial solutions because of the complexity and resource requirements of such a task.” As a workable solution is sought, the Library promises that it will “develop a basic level of access that can be implemented” for the archive. For example, it aims to consult with outside experts to try and build something permanent that can handle the interest in the archive.

Hopefully no-one’s waiting with baited breath to check out what pop culture ephemera we were all talking about seven years ago, because we’ve obviously got a long way to go.

Topics
Graeme McMillan
Former Digital Trends Contributor
A transplant from the west coast of Scotland to the west coast of America, Graeme is a freelance writer with a taste for pop…
Bluesky finally adds a feature many had been waiting for
A blue sky with clouds.

Bluesky has been making a lot of progress in recent months by simplifying the process to sign up while at the same time rolling out a steady stream of new features.

As part of those continuing efforts, the social media app has just announced that users can now send direct messages (DMs).

Read more
Incogni: Recover your privacy and remove personal information from the internet
Incogni remove your personal data from brokers and more

Everything you do while online is tracked digitally. Often connected to your email address or an issued IP, trackers can easily identify financial details, sensitive information like your social security number, demographics, contact details, like a phone number or address, and much more. In many ways, this information is tied to a digital profile and then collated, recorded, and shared via data brokers. There are many ways this information can be scooped up and just as many ways, this information can be shared and connected back to you and your family. The unfortunate reality is that, for most of us, we no longer have any true privacy.

The problem is exacerbated even more if you regularly use social media, share content or images online, or engage in discussions on places like Reddit or community boards. It's also scary to think about because even though we know this information is being collected, we don't necessarily know how much is available, who has it, or even what that digital profile looks like.

Read more
Reddit just achieved something for the first time in its 20-year history
The Reddit logo.

Reddit’s on a roll. The social media platform has just turned a profit for the first time in its 20-year history, and now boasts a record 97.2 million daily active users, marking a year-over-year increase of 47%. A few times during the quarter, the figure topped 100 million, which Reddit CEO and co-founder Steve Huffman said in a letter to shareholders had been a “long-standing milestone” for the site.

The company, which went public in March, announced the news in its third-quarter earnings results on Tuesday.

Read more