
Pose-guided token selection for the recognition of activities of daily living
In this paper we propose an improved token selection method that integrates semantic information from the ADL recognition task with that of human motion.
In this paper we propose an improved token selection method that integrates semantic information from the ADL recognition task with that of human motion.
In this paper we evaluate the efficiency of the most popular mobile vision transformer models in terms of latency and accuracy on ImageNet-1k.