Definition

Expand 3D CNN (X3D) model tried to find the optimal parameters for 3D Residual Networks using AutoML method. It tunes six parameters:

X-Fast ( $γ_{τ}$ ): the input frame rate (temporal resolution)
X-Temporal ( $γ_{t}$ ): the number of frames in the input
X-Spatial ( $γ_{s}$ ): the spacial resolution
X-Depth ( $γ_{d}$ ): the depth of the network
X-Width ( $γ_{w}$ ): the number of channels for all layers
X-Bottleneck ( $γ_{b}$ ): the inner channel width of the center convolutional filter in each residual block

My Knowledge Base