Paper
22 October 2024 GaborNeXt: a low-level visual area encoding model based on convolution kernel optimization-improvement of Gabornet
Weichen Zhao, Linyuan Wang, Shuxiao Ma, Libin Hou, Chi Zhang, Bin Yan
Author Affiliations +
Proceedings Volume 13274, Sixteenth International Conference on Digital Image Processing (ICDIP 2024); 1327414 (2024) https://doi.org/10.1117/12.3039671
Event: Sixteenth International Conference on Digital Image Processing (ICDIP 2024), 2024, Haikou, HI, China
Abstract
The Gabor wavelet is widely used to simulate the receptive fields of simple cells in the low–level visual cortex (such as V1, V2, and V3) of the human visual system. Based on this, end-to-end encoding models have achieved advanced encoding results in the low–level visual cortex. However, most current end-to-end encoding models are lightweight models with relatively simple structures and few parameters. This limitation may cause the models to perform poorly in processing detailed features of different frequencies and directions in complex Gabor feature maps. In this paper, a novel visual coding model GaborNeXt based on Gabor features is proposed. The model utilizes ConvNeXt convolutional layers to group independent convolutional kernels for convolutional operations and concatenates the outputs of each group to enhance nonlinear expressive power. We conducted experiments on the NSD (Natural Scenes Dataset) and the results demonstrate that our model outperforms the baseline models in encoding accuracy across several the low-level visual cortex. Additionally, we compared the effects of various Gabor convolutional layer kernel sizes on model performance through ablation experiments and found that using larger convolutional kernels in the Gabor convolutional layer has a positive impact on the model's performance.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Weichen Zhao, Linyuan Wang, Shuxiao Ma, Libin Hou, Chi Zhang, and Bin Yan "GaborNeXt: a low-level visual area encoding model based on convolution kernel optimization-improvement of Gabornet", Proc. SPIE 13274, Sixteenth International Conference on Digital Image Processing (ICDIP 2024), 1327414 (22 October 2024); https://doi.org/10.1117/12.3039671
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visual process modeling

Visualization

Performance modeling

Voxels

Data modeling

Convolution

Functional magnetic resonance imaging

Back to Top