HYGIEIAtech.

DNA Data Storage and Cryptography

Background:

DNA data storage is the process of encoding and decoding binary data into and from the nucleotide bases within strands of DNA. Using DNA molecules as a mechanism for storing information is an active area of research and has become a commercial service offered by biotechnology companies. Presently, this technology is restricted to a laboratory setting. The aim of this project was to investigate the feasibility of a portable sequencing platform capable of both encoding and retrieving information held in DNA sequences and simple enough for untrained operators to deploy in the field.

Process:

James was initially approached by a client who requested that he attempt to decode a cryptographic image embedded within a pool of lyophilised oligonucleotide sequences that had been produced by a commercial DNA storage company. Upon successful decoding of the encrypted data, the customer then requested a feasibility study to determine if the laboratory operations required for DNA data storage and retrieval could be turned into a portable system.

This entailed modifying a standard sequencing workflow, reducing the number of steps in the protocol and optimising processes to make it more suited to a field scenario. The study ultimately led to the design of a man-portable (briefcase-sized) and fully automated sequencing platform, capable of conducting a full sequencing operation from sample input to sequence output. The platform used pre-loaded, onboard reagents and a custom dispensing approach for accurately handling the small volumes of liquid involved in sequencing workflows. Critically, the user required no specialist domain knowledge in molecular biology to conduct a sequencing operation. This project was completed in collaboration with data scientists and cryptographers from Imperial College London and a specialist contract engineering partner.

Outcomes:

  1. Accurate and reproducible decoding of data encrypted within DNA sequences that were prepared and supplied by an expert third-party DNA storage company.

  2. Proven technical feasibility of a fully automated, portable sequencing platform for DNA data storage and cryptography, and subsequent prototype development.