Skip to main content

Scientists encode the novel ‘Wonderful Wizard of Oz’ in DNA

A few years ago, Harvard scientists successfully managed to encode a low-resolution GIF of a horse galloping into the DNA of an e.coli bacteria. Now, researchers have shown off the next level of DNA encoding: By storing the entire L. Frank Baum novel “The Wonderful Wizard of Oz” (the basis for the classic 1939 Hollywood movie of almost the same name) in the form of DNA information.

Recommended Videos

“We start with the digital version of the text,” Stephen Jones, a research scientist who collaborated on the project, told Digital Trends. “We send that information to our program, which spits out a bunch of DNA sequences, made of A,C,G and Ts. Each sequence is used to make actual pieces of DNA. Those pieces could be stored in some pretty rough conditions for thousands to even millions of years, much like we’ve seen with sequenced dinosaur DNA.”

Please enable Javascript to view this content

Coding and decoding the text

Should someone, as Jones said, get the “burning desire” to read the novel in Esperanto, the constructed international auxiliary language it was translated into, they would take these DNA pieces and read back their sequence using a DNA sequencer. The sequence would then go through the algorithm developed by the team, which would translate it back into a digital version readable on computer. “So basically, a computer’s zeros and ones get turned into DNA’s As, Cs, Gs and Ts for storage, then the process is reversed when you’re ready to read,” Jones said.

Carrying out digital-to-DNA conversion has been possible for a long time. But the excitement of this work is the way that the conversion takes place. Digital and DNA storage have different issues, with digital storage being sensitive to electricity, temperatures, water, and more. DNA is more robust in these areas, but is prone to parts being accidentally deleted or added to during the encoding process.

“Academics and big companies like Google and Microsoft have been trying to figure a way around this for a long time,” Jones explained. “Usually, people just read enough copies of the DNA information that if one gets messed up, they can depend on another. You can imagine that process is very inefficient.”

An algorithm to overcome the problems

To overcome this, the team’s encoding algorithm has some neat qualities. To begin with, the information in each DNA sequence helps correct errors in every other DNA sequence’s information so that they build upon each other. The method also accounts for those deletions or additions, is flexible enough that it can be made stronger when a piece of information is really important (a character name in “Wizard of Oz,” for example) and weaker when the information doesn’t matter so much (a random word in the novel), and will specifically avoid DNA sequences known to be problematic like a string of A’s in a row. Finally, the method encrypts the information as it’s converted to DNA sequence, adding a layer of protection and privacy that could be useful with data more sensitive than a 120-year-old public domain novel.

“A top [real-world] use would be for long-term storage when you must keep the information, but use it infrequently,” Jones said, giving the example of historical banking data for years past. “Tech companies would see value for dormant accounts that no one’s using, but they don’t want to delete. There could [additionally] be a huge cost savings during storage. Storing DNA takes almost no energy — especially compared to keeping data servers plugged in and happy.”

This is a problem that at least one DNA storage company is working on, although it’s likely several years away from being viable. Nonetheless, work like this is a reminder that science is getting closer all the time.

A paper describing the work was recently published in the journal PNAS.

Luke Dormehl
Former Digital Trends Contributor
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
AMD’s upcoming RX 8800 could be a lot better than expected
AMD RX 7800 XT and RX 7700 XT graphics cards.

Waiting for the arrival of AMD's next-gen best graphics cards has been a lesson in managing expectations. Early leaks about RDNA 4 pointed toward a beastly flagship GPU, but for months now, it's been clear that AMD is not going in that direction. Instead, the company is aiming for the mainstream market, which suggests GPUs along the likes of the RTX 4070. However, the next-gen flagship might be a lot more powerful than we thought -- and we now know which GPUs are coming, as well as when to expect them to arrive.

Let's start with the "when." Jack Huynh, AMD's senior vice president and general manager of computing and graphics, shared on X that the company is set to hold a press event during CES 2025.

Read more
You need an RTX 4090 to play Indiana Jones at max settings; AMD isn’t listed
Indiana drawing a circle in red.

The upcoming Indiana Jones and the Great Circle is releasing on December 9, and Bethesda has just shared the hardware requirements for the game. What are we dealing with? Well, to say that you'll need one of the best graphics cards would be an understatement. If you want to play the new Indiana Jones at maximum settings, you'll need an RTX 4090 -- AMD cards aren't even listed as an option.

The latest Indiana Jones game is a real step up in terms of hardware requirements across the board. For starters, you need to have a hardware ray tracing GPU as a minimum requirement, and that will lock out all the people who are still running an older AMD card or an Nvidia GTX GPU, such as the GTX 1060 or GTX 1660 Super. However, this is just the tip of the iceberg.

Read more
Two features from the new Kindle Scribe are coming to the older model
The back of the Amazon Kindle Scribe.

The Kindle Scribe, Amazon's e-reader with handwriting input, was updated this October along with three more Kindle models. Although the 2024 Scribe visually resembles the first-generation model from two years ago, the former offers better annotation features and AI summaries. With a recent update, Amazon is eliminating the gap between the two generations and bringing these features to the older model.

The older Kindle Scribe model recently received an update that added two new features, Good e-Reader reported. The additions include Active Canvas, which allows you to scribble notes anywhere on a book from your Kindle library, just like you would on a physical book. When you do so, your jottings appear in a resizable box, anchored to that part of the text, and stay there even when you resize the font. Previously, notes would appear on the top of a particular page, which could make things confusing if you had multiple comments referring to different sections of the page.

Read more