Paper
16 July 2021 You don't drink a cupboard: improving egocentric action recognition with co-occurrence of verbs and nouns
Author Affiliations +
Proceedings Volume 11794, Fifteenth International Conference on Quality Control by Artificial Vision; 117940X (2021) https://doi.org/10.1117/12.2591298
Event: Fifteenth International Conference on Quality Control by Artificial Vision, 2021, Tokushima, Japan
Abstract
We propose a refinement module to improve action recognition by considering the semantic relevance between verbs and nouns. Existing methods recognize actions as a combination of verb and noun. However, they occasionally produce the semantically implausible combination, such as “drink a cupboard” or “open a carrot”. To tackle this problem, we propose a method that incorporates a word embedding model into an action recognition network. The word embedding model is trained to obtain co-occurrence between verbs and nouns and used to refine the initial class probabilities estimated by the network. Experimental results show that our method improves the estimation accuracy of verbs and nouns on the EPIC-KITCHENS Dataset.
© (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hiroki Kojima, Naoshi Kaneko, Seiya Ito, and Kazuhiko Sumi "You don't drink a cupboard: improving egocentric action recognition with co-occurrence of verbs and nouns", Proc. SPIE 11794, Fifteenth International Conference on Quality Control by Artificial Vision, 117940X (16 July 2021); https://doi.org/10.1117/12.2591298
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Information visualization

RGB color model

Cameras

Convolutional neural networks

Detection and tracking algorithms

Neural networks

Optical flow

RELATED CONTENT


Back to Top