
This research focus aims to optimise the readout layer of DNA data storage systems by improving sequencing-based data retrieval. To achieve this, DiDAX will explore novel methods for DNA concatenation to extend read length and reduce redundancy. Protocols for high-fidelity long-read sequencing will be developed and adapted specifically for DNA storage applications, with an emphasis on lowering error rates and improving yield.
In parallel, the project will design and implement software tools for decoding, data validation, and error correction, tailored to the unique characteristics of synthetic DNA libraries. These tools will support real-time analysis and enable efficient retrieval of stored information even from complex or fragmented DNA samples.
By integrating experimental and computational innovations, this focus area will enhance the scalability and reliability of DNA sequencing in data storage pipelines—contributing to DiDAX’s broader mission of building sustainable, high-density, and cost-effective digital storage technologies.