Vision Transformer - Dog Breed Classifier

PyTorchComputer VisionHugging Face

About This Project

A Vision Transformer fine-tuned on the Stanford Dogs dataset for 120-class breed classification. The model is deployed as a Gradio app on Hugging Face Spaces, where users can upload a photo and view the top predicted breeds with confidence scores.

Key Features

ViT fine-tuning on Stanford Dogs
120-class breed classification
Gradio app on Hugging Face Spaces

Technologies

PyTorchComputer VisionHugging Face

← All Projects Learn About My Journey