A convolutional neural network (cnn) is a neural network where one or more of the layers employs a convolution as the function applied to the output of the previous layer. The task i want to do is autonomous driving using sequences of images. I am training a convolutional neural network for object detection
Demure Nudes
Apart from the learning rate, what are the other hyperparameters that i should tune
And in what order of importance
A convolutional neural network (cnn) that does not have fully connected layers is called a fully convolutional network (fcn) See this answer for more info Pooling), upsampling (deconvolution), and copy and crop operations. The concept of cnn itself is that you want to learn features from the spatial domain of the image which is xy dimension
So, you cannot change dimensions like you mentioned. Typically for a cnn architecture, in a single filter as described by your number_of_filters parameter, there is one 2d kernel per input channel There are input_channels * number_of_filters sets of weights, each of which describe a convolution kernel So the diagrams showing one set of weights per input channel for each filter are correct.
But if you have separate cnn to extract features, you can extract features for last 5 frames and then pass these features to rnn
And then you do cnn part for 6th frame and you pass the features from 2,3,4,5,6 frames to rnn which is better