Multilabel classification models predict which subset of a predefined set of
possible labels best matches the input image.
The special case in which there is only 1 possible label is called "binary classification".
Datasets follow this structure:
endpoint_url/bucket
├── prefix/images/
├── prefix/annotations.yaml
└── prefix/metadata.yaml
Dataset images are placed directly inside images/ (subdirectories are ignored).
The metadata file looks something like
this:
task: multilabel classification
annotations: annotations.yaml
labels: [lab1, lab2, lab3]
The annotations field specifies the name of
the file containing the ground truth annotations.
Here's an example of
annotations file:
000.jpg: [lab2, lab3]
001.jpg: [lab1]
002.jpg: [] # no label
# ...