Towards Migration-Free "Just-in-Case" Data Archival for Future Cloud Data Lakes Using Synthetic DNA
- Others:
- Eurecom [Sophia Antipolis]
- Institut de pharmacologie moléculaire et cellulaire (IPMC) ; Université Nice Sophia Antipolis (1965 - 2019) (UNS) ; COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)
- Centre National de la Recherche Scientifique (CNRS)
- Université Côte d'Azur (UCA)
- Imperial College London
- ACM
- European Project: 863320,OligoArchive
- European Project: 101092877,SYCLOPS
- European Project: 101070141,GLACIATION
- European Project: 101058035,MoSS
Description
Given the growing adoption of AI, cloud data lakes are facing the need to support cost-effective "just-in-case" data archival over long time periods to meet regulatory compliance requirements. Unfortunately, current media technologies suffer from fundamental issues that will soon, if not already, make cost-effective data archival infeasible. In this paper, we present a vision for redesigning the archival tier of cloud data lakes based on a novel, obsolescence-free storage medium-synthetic DNA. In doing so, we make two contributions: (i) we highlight the challenges in using DNA for data archival and list several open research problems, (ii) we outline OligoArchive-DSM (OA-DSM)-an end-to-end DNA storage pipeline that we are developing to demonstrate the feasibility of our vision.
Abstract
International audience
Additional details
- URL
- https://hal.science/hal-04146635
- URN
- urn:oai:HAL:hal-04146635v1
- Origin repository
- UNICA