Vision Transformer - Dog Breed Classifier

PyTorchComputer VisionHugging Face
Screenshot of Vision Transformer - Dog Breed Classifier project

About This Project

A Vision Transformer fine-tuned on the Stanford Dogs dataset for 120-class breed classification. The model is deployed as a Gradio app on Hugging Face Spaces, where users can upload a photo and view the top predicted breeds with confidence scores.

Key Features

  • ViT fine-tuning on Stanford Dogs
  • 120-class breed classification
  • Gradio app on Hugging Face Spaces

Technologies

PyTorchComputer VisionHugging Face