MiniFrance

Citation Author(s):
Javiera
Castillo Navarro
ONERA
Bertrand
Le Saux
ESA/ESRIN
Alexandre
Boulch
valeo.ai
Nicolas
Audebert
CNAM
Sébastien
Lefèvre
IRISA/Université de Bretagne-Sud
Submitted by:
Javiera Castill...
Last updated:
Fri, 07/24/2020 - 10:02
DOI:
10.21227/b9pt-8x03
Data Format:
License:
5
1 rating - Please login to submit your rating.

Abstract 

The Dataset

We introduce a novel large-scale dataset for semi-supervised semantic segmentation in Earth Observation: the MiniFrance suite.

MiniFrance has several unprecedented properties: it is large-scale, containing over 2000 very high resolution aerial images,; it is varied, covering 16 conurbations in France, with various climates, different landscapes, and urban as well as countryside scenes; and it is challenging, considering land use classes with high-level semantics. Nevertheless, the most distinctive quality of MiniFrance is being the only dataset in the field especially designed for semi-supervised learning: it contains labeled and unlabeled images in its training partition, which reproduces a life-like scenario.

 

The Team

 

Instructions: 

##################################################

The MiniFrance Suite

##################################################

Authors:

Javiera Castillo Navarro, javiera.castillo_navarro@onera.fr

Bertrand Le Saux, bls@ieee.org

Alexandre Boulch, alexandre.boulch@valeo.com

Nicolas Audebert, nicolas.audebert@cnam.fr

Sébastien Lefèvre, sebastien.lefevre@irisa.fr

##################################################

About:

This dataset contains very high resolution RGB aerial images over 16 cities and their surroundings from different regions in France, obtained from IGN's BD ORTHO database (images from 2012 to 2014). Pixel-level land use and land cover annotations are provided, generated by rasterizing Urban Atlas 2012.

##################################################

This dataset is partitioned in three parts, defined by conurbations:

1. Labeled training data: data over Nice and Nantes/Saint Nazaire.

2. Unlabeled training data: data over Le Mans, Brest, Lorient, Caen, Calais/Dunkerque and Saint-Brieuc.

3. Test data: data over Marseille/Martigues, Rennes, Angers, Quimper, Vannes, Clermont-Ferrand, Cherbourg, Lille.

Due to the large-scale nature of the dataset, it is divided in several files to download:

- Images for the labeled training partition: contains RGB aerial images for french departments in the labeled training partition.

- Images for the unlabeled training partition (parts 1, 2 and 3): contain RGB aerial images for french departments in the unlabeled training partition.

- Images for the test partition (parts 1, 2, 3 and 4): contain RGB aerial images for french departments in the partition reserved for evaluation.

- Labels for the labeled partition

- Lists of files by conurbation and partition: contain .txt files that list all images included by city.

Land use maps are available for all images in the labeled training partition of the dataset. We consider here Urban Atlas classes at the second hierarchical level. Available classes are:

- 0: No information

- 1: Urban fabric

- 2: Industrial, commercial, public, military, private and transport units

- 3: Mine, dump and contruction sites

- 4: Artificial non-agricultural vegetated areas

- 5: Arable land (annual crops)

- 6: Permanent crops

- 7: Pastures

- 8: Complex and mixed cultivation patterns

- 9: Orchards at the fringe of urban classes

- 10: Forests

- 11: Herbaceous vegetation associations

- 12: Open spaces with little or no vegetation

- 13: Wetlands

- 14: Water

- 15: Clouds and shadows

##################################################

Citation: If you use this dataset for your work, please use the following citation:

@article{castillo2020minifrance,
title={{Semi-Supervised Semantic Segmentation in Earth Observation: The MiniFrance Suite, Dataset Analysis and Multi-task Network Study}},
author={Castillo-Navarro, Javiera and Audebert, Nicolas and Boulch, Alexandre and {Le Saux}, Bertrand and Lef{\`e}vre, S{\'e}bastien},
journal={Under review.},
year={2020}
}

##################################################

Copyright:

The images in this dataset are released under IGN's "licence ouverte". More information can be found at http://www.ign.fr/institut/activites/lign-lopen-data

The maps used to generate the labels in this dataset come from the Copernicus program, and as such are subject to the terms described here: https://land.copernicus.eu/local/urban-atlas/urban-atlas-2012?tab=metadata