I have encrypted text dataset and i want to classify it using neural network algorithm. I know that there is a pattern in the encrypted data. example of my input data :
diss%^ghghE(t dffd$#KL*vb xod@#:n>did ....
My questions is should i treat encrypted data as if its normal text and create vocabulary and transform my data into sequence of indices ? should i clean my data first from all the special characters ?
What i tried is i cleaned all data from special characters, then created a vocabulary and transform my data into sequences however i am getting a very low accuracy. but my model works well when my data is in natural language.
Any help is appreciated.