Workshop
Deep Learning for Code
Torsten Scholak · Gabriel Orlanski · Disha Shrivastava · Arun Raja · Dzmitry Bahdanau · Jonathan Herzig
Fri 29 Apr, 5 a.m. PDT
An exciting application area of machine learning and deep learning methods is completion, repair, synthesis, and automatic explanation of program code. This field has received a fair amount of attention in the last decade, yet arguably the recent application of large scale language modelling techniques to the domain of code holds a tremendous promise to completely revolutionize this area. The new large pretrained models excel at completing code and synthesizing code from natural language descriptions; they work across a wide range of domains, tasks, and programming languages. The excitement about new possibilities is spurring tremendous interest in both industry and academia. Yet, we are just beginning to explore the potential of large-scale deep learning for code, and state-of-the-art models still struggle with correctness and generalization. This calls for platforms to exchange ideas and discuss the challenges in this line of work. Deep Learning for Code (DL4C) is a workshop that will provide a platform for researchers to share their work on deep learning for code.DL4C welcomes researchers interested in a number of topics, including but not limited to: AI code assistants, representations and model architectures for code, pretraining methods, methods for producing code from natural language, static code analysis and evaluation of deep learning for code techniques.
Schedule
Fri 5:00 a.m. - 5:15 a.m.
|
Opening Remarks
(
Announcement
)
>
|
🔗 |
Fri 5:15 a.m. - 6:00 a.m.
|
Deep Learning Models for Bug Detection and Repair
(
Invited Talk
)
>
SlidesLive Video |
Miltiadis Allamanis 🔗 |
Fri 6:00 a.m. - 6:45 a.m.
|
Learning to Program by Learning to Read
(
Invited Talk
)
>
|
Jacob Andreas 🔗 |
Fri 6:45 a.m. - 7:00 a.m.
|
Coffee Break
|
🔗 |
Fri 7:00 a.m. - 7:10 a.m.
|
Learning to Superoptimize Real-World Programs
(
Best Paper Spotlight
)
>
|
Alexander Shypula · Pengcheng Yin · Jeremy Lacomis · Claire Le Goues · Edward Schwartz · Graham Neubig 🔗 |
Fri 7:10 a.m. - 7:20 a.m.
|
CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code
(
Spotlight
)
>
SlidesLive Video |
Nadezhda Chirkova · Sergei Troshin 🔗 |
Fri 7:20 a.m. - 7:30 a.m.
|
NS3: Neuro-Symbolic Semantic Code Search
(
Spotlight
)
>
|
Shushan Arakelyan · Anna Hakhverdyan · Miltiadis Allamanis · Christophe Hauser · Luis Garcia · Xiang Ren 🔗 |
Fri 7:30 a.m. - 8:15 a.m.
|
In-IDE Code Generation from Natural Language: Promise and Challenges
(
Invited Talk
)
>
|
Graham Neubig 🔗 |
Fri 8:15 a.m. - 9:00 a.m.
|
Competitive Programming with AlphaCode
(
Invited Talk
)
>
|
David Choi · Yujia Li 🔗 |
Fri 9:00 a.m. - 10:00 a.m.
|
Lunch Break
|
🔗 |
Fri 10:00 a.m. - 11:00 a.m.
|
Panel Discussion
(
Discussion Panel
)
>
|
Miltiadis Allamanis · Jacob Andreas · Graham Neubig · David Choi · Yujia Li · Jerry Tworek · Xinyun Chen 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Code Summarization: Do Transformers Really Understand Code? ( Poster ) > link | Ankita Sontakke · Manasi Patwardhan · Lovekesh Vig · Raveendra Kumar Medicherla · Ravindra Naik · Gautam Shroff 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Patch Generation with Language Models: Feasibility and Scaling Behavior ( Poster ) > link | Sophia Kolak · Ruben Martins · Claire Le Goues · Vincent Hellendoorn 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Compositional Generalization and Decomposition in Neural Program Synthesis ( Poster ) > link | Kensen Shi · Joey Hong · Manzil Zaheer · Pengcheng Yin · Charles Sutton 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Learning to Walk over Relational Graphs of Source Code ( Poster ) > link | Pardis Pashakhanloo · Aaditya Naik · Hanjun Dai · Petros Maniatis · Mayur Naik 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Scotch: A Semantic Code Search Engine for IDEs ( Poster ) > link | Samip Dahal · Adyasha Maharana · Mohit Bansal 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Generating Programming Puzzles to Train Language Models ( Poster ) > link | Patrick Haluptzok · Matthew Bowers · Adam Tauman Kalai 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Fix Bugs with Transformer through a Neural-Symbolic Edit Grammar ( Poster ) > link | Yaojie Hu · Xingjian Shi · Qiang Zhou · Lee Pike 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Neural Instruction Combiner ( Poster ) > link | sandya mannarswamy · Dibyendu Das 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
On-the-fly Discovery of Local Bugs using Inconsistency Analysis ( Poster ) > link | Srinivasan Sengamedu · Qiang Zhou · Hangqi Zhao 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
COBRA: Enhancing DNN Latency Prediction with Language Models trained on Source Code ( Poster ) > link | Robin Zbinden · Lukas Mauch · Fabien Cardinaux 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Code Editing from Few Exemplars by Adaptive Multi-Extent Composition ( Poster ) > link | Peizhao Li · Xuchao Zhang · Ziyu Yao · Wei Cheng · Haifeng Chen · Hongfu Liu 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
A Systematic Evaluation of Large Language Models of Code ( Poster ) > link | Frank F Xu · Uri Alon · Graham Neubig · Vincent Hellendoorn 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions ( Poster ) > link | David Bieber · Rishab Goel · Daniel F Zheng · Hugo Larochelle · Danny Tarlow 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Show Your Work: Scratchpads for Intermediate Computation with Language Models ( Poster ) > link |
12 presentersMaxwell Nye · Anders J Andreassen · Guy Gur-Ari · Henryk Michalewski · Jacob Austin · David Bieber · David Dohan · Aitor Lewkowycz · Maarten Bosma · David Luan · Charles Sutton · Augustus Odena |
Fri 11:00 a.m. - 12:15 p.m.
|
ReGVD: Revisiting Graph Neural Networks for Vulnerability Detection ( Poster ) > link | Van-Anh Nguyen · Dai Quoc Nguyen · Van Nguyen · Trung Le · Quan Tran · Dinh Phung 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code ( Poster ) > link | Nadezhda Chirkova · Sergei Troshin 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
Learning to Superoptimize Real-World Programs ( Poster ) > link | Alexander Shypula · Pengcheng Yin · Jeremy Lacomis · Claire Le Goues · Edward Schwartz · Graham Neubig 🔗 |
Fri 11:00 a.m. - 12:15 p.m.
|
NS3: Neuro-Symbolic Semantic Code Search ( Poster ) > link | Shushan Arakelyan · Anna Hakhverdyan · Miltiadis Allamanis · Christophe Hauser · Luis Garcia · Xiang Ren 🔗 |
Fri 12:15 p.m. - 12:30 p.m.
|
Coffee Break
|
🔗 |
Fri 12:30 p.m. - 1:15 p.m.
|
Where generative models meet search: a brief history of recent advancements in neural program synthesis.
(
Invited Talk
)
>
|
Jerry Tworek 🔗 |
Fri 1:15 p.m. - 2:00 p.m.
|
Learning to Model Structures and Execution for Program Synthesis
(
Invited Talk
)
>
|
Xinyun Chen 🔗 |
Fri 2:00 p.m. - 2:15 p.m.
|
Closing Remarks
(
Announcement
)
>
|
🔗 |