Published September 23, 2024 | Version v1
Publication

Resampling and averaging coordinates on data

Description

We introduce algorithms for robustly computing intrinsic coordinates on point clouds. Our approach relies on generating many candidate coordinates by subsampling the data and varying hyperparameters of the embedding algorithm (e.g., manifold learning). We then identify a subset of representative embeddings by clustering the collection of candidate coordinates and using shape descriptors from topological data analysis. The final output is the embedding obtained as an average of the representative embeddings using generalized Procrustes analysis. We validate our algorithm on both synthetic data and experimental measurements from genomics, demonstrating robustness to noise and outliers.

Additional details

Identifiers

URL
https://inria.hal.science/hal-04706757
URN
urn:oai:HAL:hal-04706757v1

Origin repository

Origin repository
UNICA