The research paper is available on the link:
https://arxiv.org/pdf/1606.02147.pdf
Not able to understand the initial block of the Enet architecture.
Statement given in research paper on page 3:
ENet initial block. MaxPooling is performed with non-overlapping 2 × 2 windows,
and the convolution has 13 filters, which sums up to 16 feature maps after concatenation.
So the question is, How are we getting the 16 filters after concatenation?