iobjectspy.ml.vision package¶

Module contents¶

class iobjectspy.ml.vision.DataPreparation¶

Bases: object

Image data preparation process entry

static create_training_data(input_data, input_label, label_class_field, output_path, output_name, training_data_format, tile_format='jpg', tile_size_x=1024, tile_size_y=1024, tile_offset_x=512, tile_offset_y=512, tile_start_index=0, save_nolabel_tiles=False, input_compare_data=None, **kwargs)¶

Training data generation

Generate the training sample image tiles in specified size from the input imagery with labeled vector data.

The output includes pictures, annotations, and meta-information. The pictures and annotations name one-to-one correspondence.

Parameters:

input_data (str) – input image data, support image file
input_label (str or DatasetVector) – input vector label data, support vector dataset
label_class_field (str or None) – the field name represents the label categpries. If ‘None’ is specified, all labels are of the same category.
output_path (str) – output training data storage path
training_data_format (str) – output training data format, support four different formates: ‘VOC’, ‘MULTI_C’, ‘BINARY_C’, ‘SCENE_C’.
tile_format (str) – image tile format, support ‘tif’, ‘jpg’, ‘png’, and ‘origin formates’
tile_size_x (int) – tile size in x direction
tile_size_y (int) – tile size in y direction
tile_offset_x (int) – tile offset in the x direction
tile_offset_y (int) – tile offset in y direction
tile_start_index (int) – the initial index value for naming the tiles. The default is 0 and set to -1 when using this function to process multiple images.
save_nolabel_tiles (bool) – whether to save tiles without labels

Returns:

None

VOC format:: ./VOC

./VOC/Annotations/000000001.xml label tiles

./VOC/Images/000000001.jpg image tiles

./VOC/ImageSets/Main/train.txt, val.txt, test.txt, trainval.txt training dataset tile name, validation dataset tile name, test dataset tile name, training dataset and validation dataset tile name

./VOC/VOC.sda training data configuration file
MULTI_C format:: ./MULTI_C

./MULTI_C/Images/00000000.tif image tiles

./MULTI_C/Masks/00000000.png label tiles

./MULTI_C/MULTI_C.sda training data configuration file
BINARY_C format:: ./BINARY_C

./BINARY_C/Images/00000000.tif image tiles

./BINARY_C/Masks/00000000.png label tiles

./BINARY_C/BINARY_C.sda training data configuration file
SCENE_C format:: ./SCENE_C

./SCENE_C/0/00000000.tif image tiles

./SCENE_C/1/00000000.png image tiles

./SCENE_C/2/00000000.tif image tiles

….

./SCENE_C/scene_classification.csv mapping the relationship between saved image file path and the categories.

./SCENE_C/SCENE_C.sda Training data configuration file

class iobjectspy.ml.vision.ImageryEvaluation¶

Bases: object

static binary_classification(inference_data, ground_truth_data, inference_class_value_field=None, ground_truth_class_value_field=None, metric_type=None, out_data='', out_data_name='metric')¶

影像二元分类模型评估接口，可基于输入的真实标签数据和预测标签数据计算结果，支持影像和影像数据计算，矢量和矢量数据计算。

Parameters:

inference_data (str or DatasetVector) – 必选参数。推理结果数据集，输入的矢量面数据集来自于模型推理object_detect_infer
ground_truth_data (str or DatasetVector) – 必选参数。真实标签数据集，输入的矢量面数据集来自于真实的标签数据集
inference_class_value_field (str or None) – 可选参数。推理结果数据包含类别字段名。如果指定的字段为None，则默认去找’value’字段，若字段不存在，则所有记录都被认定为是同一个类
ground_truth_class_value_field (str or None) – 可选参数。真实数据类别字段名。如果指定的字段为None，则默认去找’value’字段，若字段不存在，则所有记录都被认定为是同一个类
metric_type (str or None) – 可选参数。待计算的指标名称。默认为None，为None时输出该功能全部指标。支持的metric_type为：PA,IoU,F1,Kappa
out_data (str or Datasource or DatasourceConnectionInfo) – 可选参数。输出文件（或数据源）路径
out_data_name (str) – 可选参数。输出文件（或数据集）名称

Returns: