Similarity search is a key operation in multimedia retrieval systems and recommender systems, and it will play an important role also for future machine learning and augmented reality applications. When these systems need to serve large objects with tight delay constraints, edge servers close to the enduser can operate as similarity caches to...
-
2022 (v1)Journal articleUploaded on: February 22, 2023
-
August 31, 2023 (v1)Journal article
An increasing number of applications rely on complex inference tasks that are based on machine learning (ML). Currently, there are two options to run such tasks: either they are served directly by the end device (e.g., smartphones, IoT equipment, smart vehicles), or offloaded to a remote cloud. Both options may be unsatisfactory for many...
Uploaded on: January 13, 2024 -
October 31, 2024 (v1)Publication
As Internet of Things (IoT) technology advances, end devices like sensors and smartphones are progressively equipped with AI models tailored to their local memory and computational constraints. Local inference reduces communication costs and latency; however, these smaller models typically underperform compared to more sophisticated models...
Uploaded on: November 1, 2024 -
2022 (v1)Journal article
A similarity cache can reply to a query for an object with similar objects stored locally. In some applications of similarity caches, queries and objects are naturally represented as points in a continuous space. This is for example the case of 360 • videos where user's head orientation-expressed in spherical coordinates-determines what part of...
Uploaded on: February 22, 2023