Skip to yearly menu bar Skip to main content


Poster

Flaws of ImageNet, Computer Vision's Favourite Dataset

Nikita Kisel · Illia Volkov · Kateřina Hanzelková · Klara Janouskova · Jiri Matas

Hall 3 + Hall 2B #60
[ ] [ Project Page ]
Fri 25 Apr midnight PDT — 2:30 a.m. PDT

Abstract:

Since its release, ImageNet-1k dataset has become a gold standard for evaluating model performance. It has served as the foundation for numerous other datasets and training tasks in computer vision. As models have improved in accuracy, issues related to label correctness have become increasingly apparent. In this blog post, we analyze the issues in the ImageNet-1k dataset, including incorrect labels, overlapping or ambiguous class definitions, training-evaluation domain shifts, and image duplicates. The solutions for some problems are straightforward. For others, we hope to start a broader conversation about refining this influential dataset to better serve future research.

Live content is unavailable. Log in and register to view live content