Publicação

Influence of Sample Size in Land Cover Classification Accuracy Using Random Forest and Sentinel-2 Data in Portugal

Detalhes bibliográficos
Resumo:	Classification accuracy of remote sensing images with supervised learning depends on the quality and characteristics of training samples. Size is a key aspect of a sample and its impact on classification depends on several factors, including the classifier employed, dimension on the feature space and land cover characteristics. Random Forest classifier is considered to be of low sensitivity to variations in sample size. However, further investigation is required when feature spaces are large and training is performed with spectral subclasses of the land cover classes to be mapped. This paper proposes to assess the impact of sample size in the classification accuracy of Random Forest using multitemporal Sentinel-2 data and a detailed set of training subclasses to produce a map with general land cover classes. The results revealed similar classification accuracies after major reductions in sample size.
Autores principais:	Moraes, Daniel
Outros Autores:	Benevides, Pedro; Costa, Hugo; Moreira, Francisco D.; Caetano, Mario
Assunto:	Random Forest sample size Sentinel-2 training sample Computer Science Applications General Earth and Planetary Sciences SDG 15 - Life on Land
Ano:	2021
País:	Portugal
Tipo de documento:	documento de conferência
Tipo de acesso:	acesso aberto
Instituição associada:	Universidade Nova de Lisboa
Idioma:	inglês
Origem:	Repositório Institucional da UNL

Descrição
Resumo:	Classification accuracy of remote sensing images with supervised learning depends on the quality and characteristics of training samples. Size is a key aspect of a sample and its impact on classification depends on several factors, including the classifier employed, dimension on the feature space and land cover characteristics. Random Forest classifier is considered to be of low sensitivity to variations in sample size. However, further investigation is required when feature spaces are large and training is performed with spectral subclasses of the land cover classes to be mapped. This paper proposes to assess the impact of sample size in the classification accuracy of Random Forest using multitemporal Sentinel-2 data and a detailed set of training subclasses to produce a map with general land cover classes. The results revealed similar classification accuracies after major reductions in sample size.