DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie,
Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang.

Image Source and Usage License

The images of in DOTA-v1.0 dataset are manily collected from the Google Earth, some are taken by satellite JL-1, the others are taken by satellite GF-2 of the China Centre for Resources Satellite Data and Application.

Use of the images from Google Earth must respect the corresponding terms of use: "Google Earth" terms of use.

All images and their associated annotations in DOTA can be used for academic purposes only, but any commercial use is prohibited.

Object Category

The object categories in DOTA-v1.0 include: plane, ship, storage tank, baseball diamond, tennis court, basketball court, ground track field, harbor, bridge, large vehicle, small vehicle, helicopter, roundabout, soccer ball field and swimming pool.

Annotation format

In the dataset, each instance's location is annotated by a quadrilateral bounding boxes, which can be denoted as "x1, y1, x2, y2, x3, y3, x4, y4" where (xi, yi) denotes the positions of the oriented bounding boxes' vertices in the image. The vertices are arranged in a clockwise order. The following is the Visualization of adopted annotation method. The yellow point represents the starting point. which refers to: (a) top left corner of a plane, (b) top left corner of a large vehicle diamond, (c) the center of sector-shaped baseball.

Except the annotation of location, category label is assigned for each instance, which comes from one of the above 15 selected categories, and meanwhile a difficult label is provided which indicates whether the instance is difficult to be detected(1 for difficult, 0 for not difficult). Annotations for an image are saved in a text file with the same file name. At the first line, 'imagesource'(from GoogleEarth, GF-2 or JL-1) is given. At the second line, ’gsd’(ground sample distance, the physical size of one image pixel, in meters) is given. Note if the 'gsd' is missing, it is annotated to be 'null'. From third line to last line in annotation text file, annotation for each instance is given. The annotation format is:

					
'imagesource':imagesource
'gsd':gsd
x1, y1, x2, y2, x3, y3, x4, y4, category, difficult
x1, y1, x2, y2, x3, y3, x4, y4, category, difficult
...						
					
				
Cinque Terre
(a)
Cinque Terre
(b)
Cinque Terre
(c)

Development kit

The Development kit provide the following function

  • Load and visulize the data.
  • Evaluate the result.
  • Split and merge the data.

Trained Models

The following is the models trained in DOTA.

Data Download

You can download DOTA-v1.0 from either Baidu Drive or Google Drive, according to your network connections.