I'm working on character recognition using dlib on linux. When I train my network on 1000 characters the network converges until it reaches 100% of accuracy but when I try training it with 10,000 or 100,000 characters it is not converging anymore to 100% accuracy. My learning rate is still decreasing but the amount of wrong prediction is not changing. The network I'm using is the following
using net_type = loss_multiclass_log<
dlib::fc<nbClassConst,
dlib::relu<dlib::fc<120,
dlib::relu<dlib::fc<400,
dlib::max_pool<2, 2, 2, 2, dlib::relu<con<16, 5, 5, 1, 1,
dlib::max_pool<2, 2, 2, 2, dlib::relu<con<6, 5, 5, 1, 1,
dlib::input<matrix<unsigned char>>
>>>>>>>>>>>>;
It is based on the dnn_introduction2_ex.cpp example from dlib. I don't know if I have to play with some parameters of the network or if I this network is just not adapted to what I want to do. I would appreciate any suggestion or help.