This challenge is divided into two categories. Details on each category are provided below

Category 1: Surgical tool classification and localization

This category will require the teams to train weakly supervised models. The model should localize (with bounding boxes) and classify the tools present within each frame of the video clips in the test set by training on noisy tool presence labels provided in the training set.

Category 2: Surgical task recognition

This category will require the teams to train fully supervised models. The model should classify the surgical step being performed within each frame of the test video clips by training on the surgical step labels provided in the training set.