Genomic transcription regulatory element location analysis via poisson weighted lasso
- Others:
- Duke University [Durham]
- Laboratoire Jean Alexandre Dieudonné (JAD) ; Université Nice Sophia Antipolis (1965 - 2019) (UNS) ; COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)
- CEntre de REcherches en MAthématiques de la DEcision (CEREMADE) ; Université Paris Dauphine-PSL ; Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Centre National de la Recherche Scientifique (CNRS)
- Mathématiques et Informatique Appliquées (MIA-Paris) ; Institut National de la Recherche Agronomique (INRA)-AgroParisTech
- University of Wisconsin-Madison
Description
The distances between DNA Transcription Regulatory Elements (TRE) provide important clues to their dependencies and function within the gene regulation process. However, the locations of those TREs as well as their cross distances between occurrences are stochastic, in part due to the inherent limitations of Next Generation Sequencing methods used to localize them, in part due to biology itself. This paper describes a novel approach to analyzing these locations and their cross distances even at long range via a Poisson random convolution. The resulting deconvolution problem is ill-posed, and sparsity regularization is used to offset this challenge. Unlike previous work on sparse Poisson inverse problems, this paper adopts a weighted LASSO estimator with data-dependent weights calculated using concentration inequalities that account for the Poisson noise. This method exhibits better squared error performance than the classical (unweighted) LASSO both in theoretical performance bounds and in simulation studies, and can easily be computed using off-the-shelf LASSO solvers.
Abstract
International audience
Additional details
- URL
- https://hal-agroparistech.archives-ouvertes.fr/hal-01585531
- URN
- urn:oai:HAL:hal-01585531v1
- Origin repository
- UNICA