Step by Step Implementation explained : Vision Transformer for Image Classification
Github: https://github.com/AarohiSingla/Image...
*******************************************************
For queries: You can comment in comment section or you can mail me at [email protected]
*******************************************************
In 2020, Google Brain team introduced a Transformerbased model that can be used to solve an image classification task called Vision Transformer (ViT). Its performance is very competitive in comparison with conventional CNNs on several image classification benchmarks.
Vision transformer (ViT) is a transformer used in the field of computer vision that works based on the working nature of the transformers used in the field of natural language processing.
#transformers #computervision