I am working on classification problem for Covid19, Pneumonia and healthy lung.
I have 3000 images for each class.
- Can I apply vision transformers for this image classification instead of normal CNN?
- Or is there any prerequisite for applying this? I am new to transformers.
I have tried all CNN and they have achieved 95% accuracy till now.