This work presents a multitemporal class-driven hierarchical Residual Neural Network (ResNet) designed for modelling the classification of Time Series (TS) of multispectral images at different semantical class levels. The architecture consists of a modification of the ResNet where we introduce additional branches to perform the classification at the different hierarchy levels and leverage on hierarchy-penalty maps to discourage incoherent hierarchical transitions within the classification. In this way, we improve the discrimination capabilities of classes at different levels of semantic details and train a modular architecture that can be used as a backbone network for introducing new specific classes and additional tasks considering limited training samples available. We exploit the class-hierarchy labels to train efficiently the different layers of the architecture, allowing the first layers to train faster on the first levels of the hierarchy modeling general classes (i.e., the macro-classes) and the intermediate classes, while using the last ones to discriminate more specific classes (i.e., the micro-classes). In this way, the targets are constrained in following the hierarchy defined, improving the classification of classes at the most detailed level. The proposed modular network has intrinsic adaptation capability that can be obtained through fine tuning. The experimental results, obtained on two tiles of the Amazonian Forest on 12 monthly composites of Sentinel-2 images acquired during 2019, demonstrate the effectiveness of the hierarchical approach in both generalizing over different hierarchical levels and learning discriminant features for an accurate classification at the micro-class level on a new target area, with a better representation of the minoritarian classes.
This work presents a system for multi-year crop type mapping based on the multi-temporal Long Short-Term Memory (LSTM) Deep Learning (DL) model and Sentinel 2 image Time Series (TS). The method assumes the availability of a pre-trained LSTM model for a given year and aims to update the corresponding crop type map fora different year considering a small amount of recent reference data. To this end, the proposed approach combines Self-Paced Learning (SPL) and fine-tuning (FT) techniques. While the SPL technique gradually incorporates samples from crop types that can be classified with high-confidence by the pre-trained model, the FT strategy adapts the network to those classes having low-confidence accuracy. This condition allows us to reduce the labeled samples required to achieve accurate classification results. The experimental results obtained on three tiles of the Austrian country on TSs of Sentinel 2 data acquired in 2019 and 2020 (considering a model pre-trained on images of 2018) demonstrate the capability of the LSTM to adapt to TS of images with different temporal and radiometric characteristic with respect to the one used to pre-train the model, with a relatively small number of training samples. As expected, by directly applying the model without performing any adaptation, we obtain a mean F-score (F1%) of 64% and 62% compared to 76% and 70% achieved by the proposed technique with only 1500 samples for 2019 and 2020, respectively.
The regular monitoring of agricultural areas is extremely important for mitigating food insecurity risks and for planning government interventions. In the literature, several deep learning algorithms have been recently proposed to perform land cover/ land use classification by using multispectral optical images. However, most of the considered deep learning models, such as the standard Convolutional Neural Networks (CNN), rely on mono-temporal images, focusing on spectral and textural features while discarding the temporal component, which is crucial for the accurate crop type mapping. In this work, we exploit a Long Short Term Memory (LSTM) deep learning classification architecture to characterize agricultural area dynamics by using the multitemporal multispectral information provided by satellite multispectral sensor Sentinel 2. Instead of considering a pre-trained network and applying to it a fine-tuning, the proposed architecture is trained from scratch in order to be tailored to the specific properties of the long time series of Sentinel 2 multispectral images. To face the lack of labeled training database, existing crop type maps available at the country level are used to generate a large set of weak reference data. First, the proposed method automatically extracts a large training dataset from existing crop type maps, by detecting those samples having the highest probability of being correctly classified. Then, the weak labeled samples extracted are used to train the deep LSTM architecture on a time series of Sentinel 2 images acquired over an entire year. The preliminary results obtained demonstrate the effectiveness of the proposed approach, which is promising at large scale.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.