CRCV PhD Disseration Announcement

Tuesday, November 4, 2014 10 a.m. to 11 a.m.
The Center for Research in Computer
Vision (CRCV)
is
pleased to announce the final oral examination for the degree of Doctor of
Philosophy of one of its students.
Mr. Guang Shu "VISUAL
ANALYSIS OF EXTREMELY DENSE CROWDED SCENES

Thursday, November 4,
2014 · 10:00 a.m. · HEC 101

Abstract: Video Surveillance is becoming increasingly important for public security and home safety. Computer vision algorithms for human detection, tracking, segmentation, action recognition etc play key roles in modern video surveillance systems.

This dissertation deals with the means of improving significantly the current state of human detection, tracking and segmentation based on learning scene-specific information in a video. Firstly, we propose a novel method for human detection by iteratively refining the output of a generic detector. In our approach, a DPM human detector is employed to collect detection examples; a support vector machine (SVM) classifier is trained using superpixel-based Bag-of-Words (BoW) feature and is used to validate the detections. The BoW feature encodes useful color features of the scene, hence it can handle the problem of appearance variation.

Given robust human detection, we propose a robust multiple-human tracking framework using a deformable part-based model (DPM). Our approach learns part-based person-specific SVM classifiers which capture articulations of moving human bodies with dynamically changing backgrounds. With the part-based model, we handle occlusions in both detection and tracking stages. In the detection stage, we select the subset of parts which maximizes the probability of detection, which significantly improves detection performance in crowded scenes. In the tracking stage, we distribute the score of the learned-person classifier among its corresponding parts, which allows us to track partial occlusions, and prevent the performance of the classifiers from being degraded.

Next, in order to obtain precise boundaries of humans, we propose a novel method for multiple human segmentation in video by incorporating human detection and part-based detection potential into a multi-frame optimization framework. In the first stage, we separate superpixels corresponding to a human and background by using the part-based detection potentials and minimizing an energy function using Conditional Random Field (CRF). In the second stage, the spatio-temporal constraints of the video is leveraged to build tracklet-based Gaussian Mixture Models (GMM) , and the boundaries are smoothed by multi-frame graph optimization.

Finally, we have developed an efficient real time tracking system, NONA, for high-definition surveillance video. We implement the system using a multi-threaded architecture (Intel Threading Building Blocks (TBB)), which executes video ingestion, tracking and video output in parallel. We employ a Fast Fourier Transform based normalized cross correlation as the core tracking algorithm for efficiency. We also incorporate Adaptive Template Scaling to handle the scale change, and Local Frame Differencing to resolve challenging issues such as occlusion and cluttered backgrounds. Read More

Location:

L3Harris Engineering Center: 101

Contact:


Calendar:

Events at UCF

Category:

Academic

Tags:

n/a