Definition

Expand 3D CNN (X3D) model tried to find the optimal parameters for 3D Residual Networks using AutoML method. It tunes six parameters:

  • X-Fast (): the input frame rate (temporal resolution)
  • X-Temporal (): the number of frames in the input
  • X-Spatial (): the spacial resolution
  • X-Depth (): the depth of the network
  • X-Width (): the number of channels for all layers
  • X-Bottleneck (): the inner channel width of the center convolutional filter in each residual block