Incremental learning of 3D-DCT compact representations for robust visual tracking

Li, Xi; Dick, Anthony; Shen, Chunhua; Van Den Hengel, Anton; Wang, Han; 李鑫; 王菡子

Incremental learning of 3D-DCT compact representations for robust visual tracking

dc.contributor.author	Li, Xi	zh_CN
dc.contributor.author	Dick, Anthony	zh_CN
dc.contributor.author	Shen, Chunhua	zh_CN
dc.contributor.author	Van Den Hengel, Anton	zh_CN
dc.contributor.author	Wang, Han	zh_CN
dc.contributor.author	李鑫	zh_CN
dc.contributor.author	王菡子	zh_CN
dc.date.accessioned	2015-07-22T07:12:07Z
dc.date.available	2015-07-22T07:12:07Z
dc.date.issued	2013	zh_CN
dc.description.abstract	Visual tracking usually requires an object appearance model that is robust to changing illumination, pose, and other factors encountered in video. Many recent trackers utilize appearance samples in previous frames to form the bases upon which the object appearance model is built. This approach has the following limitations: 1) The bases are data driven, so they can be easily corrupted, and 2) it is difficult to robustly update the bases in challenging situations. In this paper, we construct an appearance model using the 3D discrete cosine transform (3D-DCT). The 3D-DCT is based on a set of cosine basis functions which are determined by the dimensions of the 3D signal and thus independent of the input video data. In addition, the 3D-DCT can generate a compact energy spectrum whose high-frequency coefficients are sparse if the appearance samples are similar. By discarding these high-frequency coefficients, we simultaneously obtain a compact 3D-DCT-based object representation and a signal reconstruction-based similarity measure (reflecting the information loss from signal reconstruction). To efficiently update the object representation, we propose an incremental 3D-DCT algorithm which decomposes the 3D-DCT into successive operations of the 2D discrete cosine transform (2D-DCT) and 1D discrete cosine transform (1D-DCT) on the input video data. As a result, the incremental 3D-DCT algorithm only needs to compute the 2D-DCT for newly added frames as well as the 1D-DCT along the third dimension, which significantly reduces the computational complexity. Based on this incremental 3D-DCT algorithm, we design a discriminative criterion to evaluate the likelihood of a test sample belonging to the foreground object. We then embed the discriminative criterion into a particle filtering framework for object state inference over time. Experimental results demonstrate the effectiveness and robustness of the proposed tracker. ? 1979-2012 IEEE.	zh_CN
dc.identifier.citation	IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013,35(4):863-881	zh_CN
dc.identifier.issn	0162-8828	zh_CN
dc.identifier.other	20131016089543	zh_CN
dc.identifier.uri	https://dspace.xmu.edu.cn/handle/2288/90429
dc.language.iso	en_US	zh_CN
dc.publisher	IEEE Computer Society	zh_CN
dc.source.uri	http://dx.doi.org/10.1109/TPAMI.2012.166	zh_CN
dc.subject	Discrete cosine transforms	zh_CN
dc.subject	Signal analysis	zh_CN
dc.subject	Signal reconstruction	zh_CN
dc.subject	Template matching	zh_CN
dc.subject	Tracking (position)	zh_CN
dc.subject	Video recording	zh_CN
dc.title	Incremental learning of 3D-DCT compact representations for robust visual tracking	zh_CN
dc.type	Article	zh_CN

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Incremental learning of 3....html
Size:: 624 B
Format:: Hypertext Markup Language

Download

Collections

萨本栋－已发表论文