Description

The Violent Scenes Detection (VSD) benchmark is a collection of ground-truth files based on the extraction of violent events in movies, together with high level audio and video concepts provided by Technicolor. It is intended to be used for assessing the quality of methods for the detection of violent scenes and/or the recognition of some high level, violent related, concepts in movies.The ground truth was created from a collection of 18 movies of different genres (from extremely violent movies to non violent movies).In addition to segments containing physical violence with this definition: physical violence or accident resulting in human injury or pain, annotations also include the following high-level concepts: presence of blood, fights, presence of fire, presence of guns, presence of cold arms, car chases and gory scenes, for the visual modality, presence of gunshots, explosions and screams for the audio modality.Violent segments and high level video conceptswere annotated at frame level at 25fps. Each segment or concept is therefore defined by its starting and ending frame numbers. Only segments which correspond to the targeted events were annotated, i.e. will be present in the ground-truth files.High level audio conceptsare defined by their starting and ending times in seconds. Contrary to what was done for the video part of the annotation, all segments of the movie can be found in the ground-truth files, i.e. those which correspond to the targeted events, and segments with no event.All segments and concepts audio and video may also have additional tags, describing the events, depending on their types.Related publications:C.H. Demarty, C. Penet, G. Gravier and M.Soleymani. A benchmarking campaign for the multimodal detection of violent scenes in movies.In Proceedings of the 12thinternational conference on Computer Vision Volume Part III (ECCV12),Andrea Fusiello, Vittorio Murino, and Rita Cucchiara (Eds), Col. Part III. Springer Verlag, Berlin. (pdf)C.H. Demarty, C. Penet, G. Gravier and M. Soleymani. The MediaEval 2012 Affect Task: Violent Scenes Detection.In Working Notes Proceedings of the MediaEval 2012 Workshop, Italy (2012). (pdf)

Related Papers

  • C.H. Demarty, C. Penet, G. Gravier and M. Soleymani. The MediaEval 2012 Affect Task: Violent Scenes Detection. In Working Notes Proceedings of the MediaEval 2012 Workshop, Italy (2012). (pdf) [link]
  • C.H. Demarty, C. Penet, G. Gravier and M.Soleymani. A benchmarking campaign for the multimodal detection of violent scenes in movies. In Proceedings of the 12th international conference on Computer Vision – Volume Part III (ECCV’12), Andrea Fusiello, Vittorio Murino, and Rita Cucchiara (Eds), Col. Part III. Springer Verlag, Berlin. (pdf) [link]