Skip to yearly menu bar Skip to main content


Poster

CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale

ZeMing Gong · Austin Wang · Xiaoliang Huo · Joakim Bruslund Haurum · Scott C Lowe · Graham W Taylor · Angel Chang

Hall 3 + Hall 2B #18
[ ] [ Project Page ]
Fri 25 Apr 7 p.m. PDT — 9:30 p.m. PDT

Abstract:

Measuring biodiversity is crucial for understanding ecosystem health. While prior works have developed machine learning models for taxonomic classification of photographic images and DNA separately, in this work, we introduce a multi-modal approach combining both, using CLIP-style contrastive learning to align images, barcode DNA, and text-based representations of taxonomic labels in a unified embedding space. This allows for accurate classification of both known and unknown insect species without task-specific fine-tuning, leveraging contrastive learning for the first time to fuse DNA and image data. Our method surpasses previous single-modality approaches in accuracy by over 8% on zero-shot learning tasks, showcasing its effectiveness in biodiversity studies.

Live content is unavailable. Log in and register to view live content