Published September 11, 2023
| Version v1
Publication
LIPSFUS: A neuromorphic dataset for audio-visual sensory fusion of lip reading
Description
This paper presents a sensory fusion neuromorphic
dataset collected with precise temporal synchronization using a
set of Address-Event-Representation sensors and tools. The target
application is the lip reading of several keywords for different
machine learning applications, such as digits, robotic commands,
and auxiliary rich phonetic short words. The dataset is enlarged
with a spiking version of an audio-visual lip reading dataset collected with frame-based cameras. LIPSFUS is publicly available
and it has been validated with a deep learning architecture for
audio and visual classification. It is intended for sensory fusion
architectures based on both artificial and spiking neural network
algorithms.
Additional details
Identifiers
- URL
- https://idus.us.es/handle//11441/148852
- URN
- urn:oai:idus.us.es:11441/148852
Origin repository
- Origin repository
- USE