（论文阅读22/100）Learning a Deep Compact Image Representation for Visual Tracking

文献阅读笔记
简介	题目	Learning a Deep Compact Image Representation for Visual Tracking
	作者	N Wang, DY Yeung
	原文链接	Learning a Deep Compact Image Representation for Visual Tracking (neurips.cc)
	关键词	Object tracking、DLT、SDAE
	研究问题	tracking the trajectory of a moving object in a video with possibly very complex background 在尽可能复杂背景的视频中跟踪目标的移动轨迹
	研究方法	DLT：deep learning tracker 试图结合生成性跟踪器和判别性跟踪器的思想，开发一个鲁棒的判别性跟踪器。更强调无监督的特征学习问题； Specifically, by using auxiliary natural images, we train a stacked denoising autoencoder （SDAE） offline to learn generic image features that are more robust against variations. 通过使用辅助的自然图像，我们离线训练一个堆叠的去噪自编码器，以学习对变化更稳定的通用图像特征。 This is then followed by knowledge transfer from offline training to the online tracking process. 随后是知识转移从线下培训到线上跟踪的过程。 Online tracking involves a classification neural network which is constructed from the encoder part of the trained autoencoder as a feature extractor and an additional classification layer. 在线跟踪涉及一个分类神经网络，该网络由训练好的自编码器的编码器部分作为特征提取器和一个额外的分类层构建而成。 Both the feature extractor and the classifier can be further tuned to adapt to appearance changes of the moving object. 特征提取器和分类器都可以进行进一步的调整以适应移动对象的外观变化。
	研究结论	相比其他跟踪器，在保持低计算成本和实时性的情况下更准确
	创新不足	it would be an interesting direction to investigate a shift-variant CNN. 目前的跟踪器中的分类层只是一个线性分类器。将其扩展到更强大的分类器中，就像在其他判别跟踪器中一样，可能会为进一步的性能提升提供更大的空间。
	额外知识	生成式和判别式方法：生成式：假设被跟踪的对象可以用某种生成式过程来描述，因此跟踪相当于在可能无限多的候选对象中找到最可能的候选对象。关键是开发更加稳定的图像表示。灵感来源于robust estimation and sparse coding，such as the alternating direction method of multipliers (ADMM) and accelerated gradient methods。Some popular generative trackers include incremental visual tracking (IVT) , which represents the tracked object based on principal component analysis (PCA), and the l1 tracker (L1T)。判别式：，判别式方法将跟踪视为一个二分类问题，学习将被跟踪对象与其背景明确区分开来。Some representative trackers in this category are the online AdaBoost (OAB) tracker [6], multiple instance learning (MIL) tracker [3], and structured output tracker (Struck). 对比：由于使用了更丰富的图像表示，生成式跟踪器通常在较不复杂的环境下产生更准确的结果，而判别式跟踪器由于明确地将背景考虑在内，因此对强遮挡和变化具有更强的鲁棒性。 The particle filter framework：粒子滤波框架一种基于序列观测值估计动力系统潜在状态变量的序贯蒙特卡洛重要性采样方法。它通过一组粒子来近似后验状态分布，而不仅仅是像模式这样的单点。对于视觉跟踪而言，这一特性使得跟踪器更容易从错误的跟踪结果中恢复 Kalman filter卡尔曼滤波：比较先进的跟踪器：MTT, CT , VTD , MIL, a latest variant of L1T, TLD, and IVT.