Object Segmentation masks for ImageNet Video Dataset - 2015

  • Here we provide the binary object segmentation masks which were used to train our motion stream (Section 3.2 in the paper)

  • This data includes a total of 84,929 video frames, the corresponding optical flow and object segmentations obtained using our appearance stream model.

  • The series of filtering stages which make use of the bounding boxes provided with the original dataset ensures high quality.

