Description

10 videos as inputs, and segmented image sequences as ground-truth

Related datasets