中文

System and method for image and video segmentation by anisotropic kernel mean shift

2021-10-28

United States Patent: 7,397,948. Inventors: Cohen; Michael, Thiesson; Bo, Xu; Ying-Qing, Wang; Jue

United States Patent: 7,397,948

Inventors: Cohen; Michael, Thiesson; Bo, Xu; Ying-Qing, Wang; Jue

发明人:Cohen; Michael、Thiesson; Bo、徐迎庆、王珏


Abstract: Mean shift is a nonparametric estimator of density which has been applied to image and video segmentation. Traditional mean shift based segmentation uses a radially symmetric kernel to estimate local density, which is not optimal in view of the often structured nature of image and more particularly video data. The system and method of the invention employs an anisotropic kernel mean shift in which the shape, scale, and orientation of the kernels adapt to the local structure of the image or video. The anisotropic kernel is decomposed to provide handles for modifying the segmentation based on simple heuristics. Experimental results show that the anisotropic kernel mean shift outperforms the original mean shift on image and video segmentation in the following aspects: 1) it gets better results on general images and video in a smoothness sense; 2) the segmented results are more consistent with human visual saliency; and 3) the system and method is robust to initial parameters.