Definition

Temporal 3D ConvNet (T3D) applied DenseNet structure on 3D Convolutional Network model. The 3D Temporal Transition Layer, similar to the inception module of GoogLeNet), stacked after DenseBlock to capture different temporal lengths

Also, the model utilized the pre-trained 2D ConvNet as a teacher to make the 3D ConvNet learn mod-level feature representation by image-video correspondence task. During the training, the model parameters of the 2d ConvNet is frozen.