Data records and outputs

To optimise UAV data records and outputs, several crucial steps are yet to be taken.

Selecting a compatible format, such as GeoTIFF*, ensures seamless integration with analysis tools. Prioritising shareable formats facilitates collaborative sharing of findings. Adhering to standardised formats, following industry norms like those set by the Open Geospatial Consortium, promotes consistency and compatibility. Accurate error measurement and assessment of data accuracy are essential, with methodologies detailed in comprehensive review papers.

Finally, implementing robust data storage practices, like redundant systems and cloud backups, safeguards against potential loss, ensure the accessibility and reliability of UAV-derived data. *GeoTIFF is a public domain metadata standard which allows georeferencing information to be embedded within a TIFF file.

It is also important to collect appropriate metadata on the imagery collected at the time of capture where possible. Datasets automatically have their own metadata that will be recorded, and this can be accessed later via suitably scripted code. However, for weed detection, some of this data is not captured automatically and it is very useful for later processing and for keeping consistent records to collate this data at the outset. Such data should include where relevant:

  • Biological information, such as date, plant name, growth stage, soil and surface characteristics and GPS coordinates.
  • Data capture details, such as sensor make, lens, and lens focal length, sensor height, angle, ground speed. Other data may include data capture conditions, such as weather, seasons and light availability
  • Dataset generation details, such as label type, description and values, the number of labels and crops generated (including crop settings) and the number of images and instances of target plants in the images per class.

This data for each dataset could be collated in three small tables, similar to the following:

Biological Data

Property DatatypeCommentValue

Date of Capture

 

date

date of data capture campaign

 

Plant Name

 

string

name of plant

 

Growth Stage

Minimum

string

Biological growth stage indicator

 
 

Maximum

string

  

Soil Colour

 

string

background soil colour

 

Surface

Cover

string

background coverage type

 
 

Coverage

percentage

percentage indicating how much is covered by vegetation vs. background

 

Location

Lat

lattitude

Approximate lattitude in degrees (using consistent datum for all data e.g GDA 2020)

 
 

Long

longitude

Approximate longitutude in degrees (using consistent datum for all data e.g GDA 2020)

 

Capture Details

Property DatatypeCommentValue

Camera

Make

string

free text camera make

 
 

Lens

string

free text lens make

 
 

Lens Focallength

float

in milimeters

 

Data Capture

Height

float

in milimeters off the ground

 
 

Angle

degrees

elevation angle in degrees. Directly down = 90, forward = 0

 
 

Ground speed

float

in m/s. 0 for stationary

 

Camera FoV

 

float

Field of View for camera across diagonal of image,in degrees. 0 for image crops

 

Light type

 

enum

artificial or natural

 

Conditions

Season

enum

summer, winter, spring or autumn

 
 

Weather

string

description of weather conditions

 
 

Wind

string

description of wind conditions

 
 

Lighting

string

description of light conditions

 

Dataset generation details

Property DatatypeCommentValue

Label

Type

enum

one of: classification, object detection, segmentation

 
 

Description

string

description of what a label contains

 
 

Values

string

list of label names or numbers corresponding to class names

 

Generation

labels

enum

one of: automatic, semi-automatic, manual

 
 

crops

enum

yes / no

 
 

settings

string

crop settings, if used

 

Image Count

 

int

number of total images in dataset

 

Instance Count

 

int

number of instances in dataset, per class