Segmentation Masks and Metadata

Celantur container can generate two different segmentation masks and metadata per processed image:

  • Binary Segmentation

  • Instance Segmentation

It's activated with the --save-mask {all, instance, binary} parameter. The segmentation is saved as a PNG file.

Anonymization, binary segmentation and instance segmentation applied to an image with Celantur software.

Binary Segmentation

The binary segmentation mask consist of two colors:

  • Background is black

  • Anonymized segments are white

The file will be saved as image-name_bin_mask.png.

Instance Segmentation

The instance segmentation mask consists of multiple colors. The RGB color values are used to differentiate to individual instances/objects.

  • The R (red) channel encodes the object type:

    • License plate: 64

    • Person: 128

    • Face: 192

    • Vehicle: 255

  • The G (green) channel encodes individual instances/objects.

  • The B (blue) channel is 0.

E.g. [192, 85, 0] is a face.

The file will be saved as image-name_ins_maks.png.

Scale Down Mask Files

By adding the optional --mask-scale {0-100} (CLI) or /v1/file/1/instance-mask?mask-scale={0-100} (Container API) parameter, mask files will be scaled down by the specified ratio.

Metadata

Image metadata

Metadata about detected instances/objects are stored in the corresponding image-name.json file.

Image metadata example

Video metadata

Metadata is generated for individual frames and provided as a range of mulitple frames.

  • Batch/Stream mode: The information is stored in filename-[startframe]-[endframe].json in the output directory. The maximum number of frames covered by a single file is 500.

  • REST API mode: The infromation can be retrieved via the Download video metadata (JSON) endpoint.

Video metadata example

Metadata attribute reference

Attributes of a image or video frame

Attribute
Description

id

The id of the image (file name) or video frame (sequential number)

size

Size of the image or frame in [width, height]

duration

Duration of processing (inference and anonymization) an image or video frame. Does not include IO, e.g. read/write from hard drive.

filename

Name of the file

folder

Name of the folder (relative to root input folder)

Detected instances/objects provided as a list under the detections attribute

Attribute
Description

id

The id of the detection, a sequential number starting from 0.

parent_image

The name of the image or the id of the video frame the detected instance/object was found on

offset

The offset of the detection's bounding box from the upper left corner of the image (x/y coordinates in pixels)

bbox

The coordinates of the detection's bounding box (x1, y1, x2, y2)

score

The detection's confidence score. States how confident the model is about the detection being a specific label (see type_label) between 0.0 and 1.0.

is_anonymized

Specifies whether the detection was anonymized (or detected with method = detect).

type_label

The detection's label assigned by the model. E.g. face, license plates, etc.

type

Numerical representation of the type_label.

color

The detections color (RGB) in the instance segmentation mask.

duration

Processing duration for a video frame (only for videos).

Last updated