BU Logo
ECE Logo
ISS Laboratory Logo
BU Linux logo
Video

Next-Generation Methods for Image Sequence
Analysis, Processing, and Transmission

This research thrust concentrates around the idea that by processing multiple frames of an image sequence jointly, one can achieve not only better results, e.g., more accurate segmentation or improved compression ratio, but also extract qualitatively new information, such as occlusion and newly-exposed area information. Following are the current projects in this thrust:

Joint space-time segmentation and analysis of video sequences
Team: M. Ristivojevic, J. Konrad, collaborators from the University of Nice, France
Funding: National Science Foundation (CISE-CCR-SPS, Intern. Collab. USA-Frrance)
Traditional video processing methods use two image frames at a time to analyze such dynamics as motion, occlusions, etc. We explore new framework that is based on joint treatment of many image frames (e.g., 20-30). A form of joint space-time processing, this framework is essentially three-dimensional (3-D) since its domain is the x-y-t space of image sequences. It is expected to result in more reliable video segmentation, detection of occlusion effects and identification of various dynamic events. To date, we have developed a video segmentation method within this framework that is based on an active-surface model and level-set solution. Applied to both synthetic and natural image sequences this method results in “object tunnels” in the x-y-t space, that we have used successfully to identify certain occlusion events and measure time instants of object occlusions, disappearance, entry, etc. See the above link for more details and results.

Video sequence compression based on spatio-temporal transformations
Team: N. Bozinovic, J. Konrad, collaborators from the University of Nice, France
Funding: National Science Foundation (CISE-CCR-SPS, Intern. Collab. USA-France)
It is widely believed in the research community today that the next significant video coding gains will come from a joint compression of multiple video frames, such as offered by 3D wavelet coding. We have been studying the behavior of video data subject to various 3-D (x-y-t) transformations. We have developed a multiple-frame characterization of linear motion in the DCT domain. An extension of the fundamental result in Fourier domain, this outcome allows us to better understand spectral composition of 3D-DCT-transformed video data. Based on this, we have developed 3D DCT coefficient scanning patterns that are more efficient than scans used to date, and may lead to efficient 3D transform-based video coding techniques.

[VIP] [Members] [Projects] [Publications] [Equipment]