Welcome to CSML

Video Background Subtraction

  • Home
  • Video Background Subtraction
project image
project image
  • Static

    0

  • Hard shadow

    50

  • Outside region of interes

    85

  • Unknown motion (usually around moving objects, due to semi-transparency and motion blur)

    170

  • Motion

    255

Introduction

Creating a foreground mask, which is a binary picture of the pixels that belong to moving elements in the scene, can be done with still cameras by using a technique known as background subtraction (BS), which is a common and commonly used method. In this study, we will show a video subtraction pipeline that makes use of both computer vision and deep learning techniques.

Dataset

The dataset, CDNET, can be downloaded from here. This dataset contains 11 video categories with 4–6 video sequences in each category. However, we only used eight video sequences. They are: busStation, canoe, fountain02, highway, office, park, peopleInShade, sidewalk. Each individual video file (.zip or.7z) can be downloaded separately. Alternatively, all video files within one category can be downloaded as a single .zip or .7z file. Each video file, when uncompressed, becomes a directory that contains the following:

  1. a sub-directory named “input” containing a separate JPEG file for each frame of the input video

  2. a sub-directory named “groundtruth” containing a separate BMP file for each frame of the groundtruth

  3. “an empty folder named “results” for binary results (1 binary image per frame per video you have processed)

  4. files named “ROI.bmp” and “ROI.jpg” showing the spatial region of interest

  5. a file named “temporalROI.txt” containing two frame numbers. Only the frames in this range will be used to calculate your score

The groundtruth images contain five labels, namely

  • 0 : Static

  • 50 : Hard shadow

  • 85 : Outside region of interest

  • 170 : Unknown motion (usually around moving objects, due to semi-transparency and motion blur)

  • 255 : Motion