Workshop on Weakly Supervised Learning

Workshop

Workshop on Weakly Supervised Learning

Benjamin Roth · Barbara Plank · Alex Ratner · Katharina Kann · Dietrich Klakow · Michael Hedderich

Fri 7 May, 7 a.m. PDT

[ Abstract ] Workshop Website

Deep learning relies on massive training sets of labeled examples to learn from - often tens of thousands to millions to reach peak predictive performance. However, large amounts of training data are only available for very few standardized learning problems. Even small variations of the problem specification or changes in the data distribution would necessitate re-annotation of large amounts of data.

However, domain knowledge can often be expressed by sets of prototypical descriptions. These knowledge-based descriptions can be either used as rule-based predictors or as labeling functions for providing partial data annotations. The growing field of weak supervision provides methods for refining and generalizing such heuristic-based annotations in interaction with deep neural networks and large amounts of unannotated data.

In this workshop, we want to advance theory, methods and tools for allowing experts to express prior coded knowledge for automatic data annotations that can be used to train arbitrary deep neural networks for prediction. Learning with weak supervision is both studied from a theoretical perspective as well as applied to a variety of tasks from areas like natural language processing and computer vision. This workshop aims at bringing together researchers from this wide range of fields to facilitate discussions across research areas that share the common ground of using weak supervision. A target of this workshop is also to inspire applications of weak supervision to new scenarios and to enable researchers to work on tasks that so far have been considered too low-resource.

As weak supervision addresses one of the major issues of current machine learning techniques, the lack of labeled data, it has also started to obtain commercial interest. This workshop is an opportunity to bridge innovations from academia and the requirements of industry settings.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Fri 7:00 a.m. - 7:10 a.m.	Introduction and Opening Remarks ( Introduction ) >	🔗
Fri 7:10 a.m. - 8:10 a.m.	Invited Speaker Dan Roth - Natural Language Understanding with Incidental Supervision ( Keynote Talk ) > SlidesLive Video	Dan Roth 🔗
Fri 8:10 a.m. - 8:25 a.m.	Invited Talk Dan Roth - Q&A ( Q&A ) >	🔗
Fri 8:25 a.m. - 9:10 a.m.	Invited Speaker Marine Carpuat - Weak Supervision for Cross-Lingual Semantic Analysis ( Keynote Talk ) > SlidesLive Video	Marine Carpuat 🔗
Fri 9:10 a.m. - 9:25 a.m.	Invited Speaker Marine Carpuat - Q&A ( Q&A ) >	🔗
Fri 9:25 a.m. - 9:40 a.m.	Dependency Structure Misspecification in Multi-Source Weak Supervision Models ( Contributed Talk ) > SlidesLive Video	Salva Rühling Cachay 🔗
Fri 9:40 a.m. - 9:50 a.m.	Dependency Structure Misspecification in Multi-Source Weak Supervision Models - Q&A ( Q&A ) >	🔗
Fri 9:50 a.m. - 9:53 a.m.	AutoTriggER: Named Entity Recognition with Auxiliary Trigger Extraction ( Poster Spotlight ) > SlidesLive Video	Dong-Ho Lee 🔗
Fri 9:53 a.m. - 9:56 a.m.	Handling Long-Tail Queries with Slice-Aware Conversational Systems ( Poster Spotlight ) > SlidesLive Video	Cheng Wang 🔗
Fri 9:56 a.m. - 9:59 a.m.	Tabular Data Modeling via Contextual Embeddings ( Poster Spotlight ) > SlidesLive Video	Xin Huang 🔗
Fri 9:59 a.m. - 10:02 a.m.	TADPOLE: Task ADapted Pre-training via anOmaLy dEtection ( Poster Spotlight ) > SlidesLive Video	Vivek Madan 🔗
Fri 10:02 a.m. - 10:05 a.m.	Active WeaSuL: Improving Weak Supervision with Active Learning ( Poster Spotlight ) > SlidesLive Video	Samantha Biegel 🔗
Fri 10:05 a.m. - 10:08 a.m.	Transformer Language Models as Universal Computation Engines ( Poster Spotlight ) > SlidesLive Video	Kevin Lu 🔗
Fri 10:15 a.m. - 11:15 a.m.	Poster Session 1 ( Poster Session ) > link Link	🔗
Fri 11:15 a.m. - 11:20 a.m.	Welcome Back ( Introduction ) >	🔗
Fri 11:20 a.m. - 11:50 a.m.	Invited Speaker Heng Ji - InfoSurgeon: Cross-media Weak Supervision for Knowledge-Element Level Fake News Detection ( Keynote Talk ) > SlidesLive Video	Heng Ji 🔗
Fri 11:50 a.m. - 12:05 p.m.	Invited Speaker Heng Ji - Q&A ( Q&A ) >	🔗
Fri 12:05 p.m. - 12:20 p.m.	Weakly Supervised Multi-task Learning for Concept-based Explainability ( Contributed Talk ) > SlidesLive Video	Vladimir Balayan 🔗
Fri 12:20 p.m. - 12:30 p.m.	Weakly Supervised Multi-task Learning for Concept-based Explainability - Q&A ( Q&A ) >	🔗
Fri 12:30 p.m. - 12:45 p.m.	Better Adaptation to Distribution Shifts with Robust Pseudo-Labeling ( Contributed Talk ) > SlidesLive Video	Evgenia Rusak 🔗
Fri 12:45 p.m. - 12:55 p.m.	Better Adaptation to Distribution Shifts with Robust Pseudo-Labeling - Q&A ( Q&A ) >	🔗
Fri 12:55 p.m. - 12:58 p.m.	Using system context information to complement weakly labeled data ( Poster Spotlight ) > SlidesLive Video	Matthias Meyer 🔗
Fri 12:58 p.m. - 1:01 p.m.	CIGMO: Learning categorical invariant deep generative models from grouped data ( Poster Spotlight ) > SlidesLive Video	Haruo Hosoya 🔗
Fri 1:01 p.m. - 1:04 p.m.	Pre-Training by Completing Points Cloud ( Poster Spotlight ) > SlidesLive Video	Hanchen Wang 🔗
Fri 1:04 p.m. - 1:07 p.m.	Weakly-Supervised Group Disentanglement using Total Correlation ( Poster Spotlight ) > SlidesLive Video	Linh Tran 🔗
Fri 1:07 p.m. - 1:10 p.m.	Is Disentanglement all you need? Comparing Concept-based & Disentanglement Approaches ( Poster Spotlight ) > SlidesLive Video	Dmitry Kazhdan 🔗
Fri 1:07 p.m. - 1:10 p.m.	Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks ( Poster Spotlight ) > SlidesLive Video	Curtis G Northcutt 🔗
Fri 1:20 p.m. - 2:20 p.m.	Poster Session 2 ( Poster Session ) > link Link	🔗
Fri 2:20 p.m. - 2:25 p.m.	Welcome Back ( Introduction ) >	🔗
Fri 2:25 p.m. - 3:10 p.m.	Invited Speaker Lu Jiang - Robust Deep Learning and Applications ( Keynote Talk ) > SlidesLive Video	Lu Jiang 🔗
Fri 3:10 p.m. - 3:25 p.m.	Invited Speaker Lu Jiang - Q&A ( Q&A ) >	🔗
Fri 3:25 p.m. - 3:55 p.m.	Invited Speaker Paroma Varma - Snorkel: Programmatically Labeling Training Data ( Keynote Talk ) > SlidesLive Video	Paroma Varma 🔗
Fri 3:55 p.m. - 4:10 p.m.	Invited Speaker Paroma Varma - Q&A ( Q&A ) >	🔗
Fri 4:10 p.m. - 5:10 p.m.	Panel Discussion ( Panel ) >	🔗
Fri 5:10 p.m. - 5:25 p.m.	Concluding Remarks ( Conclusion ) >	🔗
Fri 5:25 p.m. - 6:00 p.m.	Post Workshop Hangout ( Gather.Town ) > link Link	🔗