Topic Keywords

[ $\ell_1$ norm ] [ $f-$divergence ] [ 3D Convolution ] [ 3D deep learning ] [ 3D generation ] [ 3d point cloud ] [ 3D Reconstruction ] [ 3D scene understanding ] [ 3D shape representations ] [ 3D shapes learning ] [ 3D vision ] [ 3D Vision ] [ abstract reasoning ] [ abstract rules ] [ Acceleration ] [ accuracy ] [ acoustic condition modeling ] [ Action localization ] [ action recognition ] [ activation maximization ] [ activation strategy. ] [ Active learning ] [ Active Learning ] [ AdaBoost ] [ adaptive heavy-ball methods ] [ Adaptive Learning ] [ adaptive methods ] [ adaptive optimization ] [ ADMM ] [ Adversarial Accuracy ] [ Adversarial Attack ] [ Adversarial Attacks ] [ adversarial attacks/defenses ] [ Adversarial computer programs ] [ Adversarial Defense ] [ Adversarial Example Detection ] [ Adversarial Examples ] [ Adversarial Learning ] [ Adversarial Machine Learning ] [ adversarial patch ] [ Adversarial robustness ] [ Adversarial Robustness ] [ Adversarial training ] [ Adversarial Training ] [ Adversarial Transferability ] [ aesthetic assessment ] [ affine parameters ] [ age estimation ] [ Aggregation Methods ] [ AI for earth science ] [ ALFRED ] [ Algorithm ] [ algorithmic fairness ] [ Algorithmic fairness ] [ Algorithms ] [ alignment ] [ alignment of semantic and visual space ] [ amortized inference ] [ Analogies ] [ annotation artifacts ] [ anomaly-detection ] [ Anomaly detection with deep neural networks ] [ anonymous walk ] [ appearance transfer ] [ approximate constrained optimization ] [ approximation ] [ Approximation ] [ Architectures ] [ argoverse ] [ Artificial Integlligence ] [ ASR ] [ assistive technology ] [ associative memory ] [ Associative Memory ] [ asynchronous parallel algorithm ] [ Atari ] [ Attention ] [ Attention Mechanism ] [ Attention Modules ] [ attractors ] [ attributed walks ] [ Auction Theory ] [ audio understanding ] [ Audio-Visual ] [ audio visual learning ] [ audio-visual representation ] [ audio-visual representation learning ] [ Audio-visual sound separation ] [ audiovisual synthesis ] [ augmented deep reinforcement learning ] [ autodiff ] [ Autoencoders ] [ automated data augmentation ] [ automated machine learning ] [ automatic differentiation ] [ AutoML ] [ autonomous learning ] [ autoregressive language model ] [ Autoregressive Models ] [ AutoRL ] [ auxiliary information ] [ auxiliary latent variable ] [ Auxiliary Learning ] [ auxiliary task ] [ Average-case Analysis ] [ aversarial examples ] [ avoid knowledge leaking ] [ backdoor attack ] [ Backdoor Attacks ] [ Backdoor Defense ] [ Backgrounds ] [ backprop ] [ back translation ] [ backward error analysis ] [ bagging ] [ batchnorm ] [ Batch Normalization ] [ batch reinforcement learning ] [ Batch Reinforcement Learning ] [ batch selection ] [ Bayesian ] [ Bayesian classification ] [ Bayesian inference ] [ Bayesian Inference ] [ Bayesian networks ] [ Bayesian Neural Networks ] [ behavior cloning ] [ belief-propagation ] [ Benchmark ] [ benchmarks ] [ benign overfitting ] [ bert ] [ BERT ] [ beta-VAE ] [ better generalization ] [ biased sampling ] [ biases ] [ Bias in Language Models ] [ bidirectional ] [ bilevel optimization ] [ Bilinear games ] [ Binary Embeddings ] [ Binary Neural Networks ] [ binaural audio ] [ binaural speech ] [ biologically plausible ] [ Biometrics ] [ bisimulation ] [ Bisimulation ] [ bisimulation metrics ] [ bit-flip ] [ bit-level sparsity ] [ blind denoising ] [ blind spots ] [ block mdp ] [ boosting ] [ bottleneck ] [ bptt ] [ branch and bound ] [ Brownian motion ] [ Budget-Aware Pruning ] [ Budget constraints ] [ Byzantine resilience ] [ Byzantine SGD ] [ CAD modeling ] [ calibration ] [ Calibration ] [ calibration measure ] [ cancer research ] [ Capsule Networks ] [ Catastrophic forgetting ] [ Catastrophic Forgetting ] [ Causal Inference ] [ Causality ] [ Causal network ] [ certificate ] [ certified defense ] [ Certified Robustness ] [ challenge sets ] [ change of measure ] [ change point detection ] [ channel suppressing ] [ Channel Tensorization ] [ Channel-Wise Approximated Activation ] [ Chaos ] [ chebyshev polynomial ] [ checkpointing ] [ Checkpointing ] [ chemistry ] [ CIFAR ] [ Classification ] [ class imbalance ] [ clean-label ] [ Clustering ] [ Clusters ] [ CNN ] [ CNNs ] [ Code Compilation ] [ Code Representations ] [ Code Structure ] [ code summarization ] [ Code Summarization ] [ Cognitively-inspired Learning ] [ cold posteriors ] [ collaborative learning ] [ Combinatorial optimization ] [ common object counting ] [ commonsense question answering ] [ Commonsense Reasoning ] [ Communication Compression ] [ co-modulation ] [ complete verifiers ] [ complex query answering ] [ Composition ] [ compositional generalization ] [ compositional learning ] [ compositional task ] [ Compressed videos ] [ Compressing Deep Networks ] [ Compression ] [ computation ] [ computational biology ] [ Computational Biology ] [ computational complexity ] [ Computational imaging ] [ Computational neuroscience ] [ Computational resources ] [ computer graphics ] [ Computer Vision ] [ concentration ] [ Concentration of Measure ] [ Concept-based Explanation ] [ concept drift ] [ Concept Learning ] [ conditional expectation ] [ Conditional GANs ] [ Conditional Generation ] [ Conditional generative adversarial networks ] [ conditional layer normalization ] [ Conditional Neural Processes ] [ Conditional Risk Minimization ] [ Conditional Sampling ] [ conditional text generation ] [ Conferrability ] [ confidentiality ] [ conformal inference ] [ conformal prediction ] [ conjugacy ] [ conservation law ] [ consistency ] [ consistency training ] [ Consistency Training ] [ constellation models ] [ constrained beam search ] [ Constrained optimization ] [ constrained RL ] [ constraints ] [ constraint satisfaction ] [ contact tracing ] [ Contextual Bandits ] [ Contextual embedding space ] [ Continual learning ] [ Continual Learning ] [ continuation method ] [ continuous and scalar conditions ] [ continuous case ] [ Continuous Control ] [ continuous convolution ] [ continuous games ] [ continuous normalizing flow ] [ continuous time ] [ Continuous-time System ] [ continuous treatment effect ] [ contrastive divergence ] [ Contrastive learning ] [ Contrastive Learning ] [ Contrastive Methods ] [ contrastive representation learning ] [ control barrier function ] [ controlled generation ] [ Controlled NLG ] [ Convergence ] [ Convergence Analysis ] [ convex duality ] [ Convex optimization ] [ ConvNets ] [ convolutional kernel methods ] [ Convolutional Layer ] [ convolutional models ] [ Convolutional Networks ] [ copositive programming ] [ corruptions ] [ COST ] [ Counterfactual inference ] [ counterfactuals ] [ Counterfactuals ] [ covariant neural networks ] [ covid-19 ] [ COVID-19 ] [ Cross-domain ] [ cross-domain few-shot learning ] [ cross-domain video generation ] [ cross-episode attention ] [ cross-fitting ] [ cross-lingual pretraining ] [ Cryptographic inference ] [ cultural transmission ] [ Curriculum Learning ] [ curse of memory ] [ curvature estimates ] [ custom voice ] [ cycle-consistency regularization ] [ cycle-consistency regularizer ] [ DAG ] [ DARTS stability ] [ Data augmentation ] [ Data Augmentation ] [ data cleansing ] [ Data-driven modeling ] [ data-efficient learning ] [ data-efficient RL ] [ Data Flow ] [ data labeling ] [ data parallelism ] [ Data Poisoning ] [ Data Protection ] [ Dataset ] [ dataset bias ] [ dataset compression ] [ dataset condensation ] [ dataset corruption ] [ dataset distillation ] [ dataset summarization ] [ data structures ] [ debiased training ] [ debugging ] [ Decentralized Optimization ] [ decision boundary geometry ] [ decision trees ] [ declarative knowledge ] [ deep-anomaly-detection ] [ Deep Architectures ] [ Deep denoising priors ] [ deep embedding ] [ Deep Ensembles ] [ deep equilibrium models ] [ Deep Equilibrium Models ] [ Deepfake ] [ deep FBSDEs ] [ Deep Gaussian Processes ] [ Deep generative model ] [ Deep generative modeling ] [ Deep generative models ] [ deeplearning ] [ Deep learning ] [ Deep Learning ] [ deep learning dynamics ] [ Deep Learning Theory ] [ deep network training ] [ deep neural network ] [ deep neural networks. ] [ Deep Neural Networks ] [ deep one-class classification ] [ deep Q-learning ] [ Deep reinforcement learning ] [ Deep Reinforcement Learning ] [ deep ReLU networks ] [ Deep residual neural networks ] [ deep RL ] [ deep sequence model ] [ deepset ] [ Deep Sets ] [ Deformation Modeling ] [ delay ] [ Delay differential equations ] [ denoising score matching ] [ Dense Retrieval ] [ Density estimation ] [ Density Estimation ] [ Density ratio estimation ] [ dependency based method ] [ deployment-efficiency ] [ depression ] [ depth separation ] [ descent ] [ description length ] [ determinantal point processes ] [ Device Placement ] [ dialogue state tracking ] [ differentiable optimization ] [ Differentiable physics ] [ Differentiable Physics ] [ Differentiable program generator ] [ differentiable programming ] [ Differentiable rendering ] [ Differentiable simulation ] [ differential dynamica programming ] [ differential equations ] [ Differential Geometry ] [ differentially private deep learning ] [ Differential Privacy ] [ diffusion probabilistic models ] [ diffusion process ] [ dimension ] [ Directed Acyclic Graphs ] [ Dirichlet form ] [ Discrete Optimization ] [ discretization error ] [ disentangled representation learning ] [ Disentangled representation learning ] [ Disentanglement ] [ distance ] [ Distillation ] [ distinct elements ] [ Distributed ] [ distributed deep learning ] [ distributed inference ] [ Distributed learning ] [ distributed machine learning ] [ Distributed ML ] [ Distributed Optimization ] [ distributional robust optimization ] [ distribution estimation ] [ distribution shift ] [ diverse strategies ] [ diverse video generation ] [ Diversity denoising ] [ Diversity Regularization ] [ DNN ] [ DNN compression ] [ document analysis ] [ document classification ] [ document retrieval ] [ domain adaptation theory ] [ Domain Adaption ] [ Domain Generalization ] [ domain randomization ] [ Domain Translation ] [ double descent ] [ Double Descent ] [ doubly robustness ] [ Doubly-weighted Laplace operator ] [ Dropout ] [ drug discovery ] [ Drug discovery ] [ dst ] [ Dual-mode ASR ] [ Dueling structure ] [ Dynamical Systems ] [ dynamic computation graphs ] [ dynamics ] [ dynamics prediction ] [ dynamic systems ] [ Early classification ] [ Early pruning ] [ early stopping ] [ EBM ] [ Edit ] [ EEG ] [ effective learning rate ] [ Efficiency ] [ Efficient Attention Mechanism ] [ efficient deep learning ] [ Efficient Deep Learning ] [ Efficient Deep Learning Inference ] [ Efficient ensembles ] [ efficient inference ] [ efficient inference methods ] [ Efficient Inference Methods ] [ EfficientNets ] [ efficient network ] [ Efficient Networks ] [ Efficient training ] [ Efficient Training ] [ efficient training and inference. ] [ egocentric ] [ eigendecomposition ] [ Eigenspectrum ] [ ELBO ] [ electroencephalography ] [ EM ] [ Embedding Models ] [ Embedding Size ] [ Embodied Agents ] [ embodied vision ] [ emergent behavior ] [ empirical analysis ] [ Empirical Game Theory ] [ empirical investigation ] [ Empirical Investigation ] [ empirical study ] [ empowerment ] [ Encoder layer fusion ] [ end-to-end entity linking ] [ End-to-End Object Detection ] [ Energy ] [ Energy-Based GANs ] [ energy based model ] [ energy-based model ] [ Energy-based model ] [ energy based models ] [ Energy-based Models ] [ Energy Based Models ] [ Energy-Based Models ] [ Energy Score ] [ ensemble ] [ Ensemble ] [ ensemble learning ] [ ensembles ] [ Ensembles ] [ entity disambiguation ] [ entity linking ] [ entity retrieval ] [ entropic algorithms ] [ Entropy Maximization ] [ Entropy Model ] [ entropy regularization ] [ epidemiology ] [ episode-level pretext task ] [ episodic training ] [ equilibrium ] [ equivariant ] [ equivariant neural network ] [ ERP ] [ Evaluation ] [ evaluation of interpretability ] [ Event localization ] [ evolution ] [ Evolutionary algorithm ] [ Evolutionary Algorithm ] [ Evolutionary Algorithms ] [ Excess risk ] [ experience replay buffer ] [ experimental evaluation ] [ Expert Models ] [ Explainability ] [ explainable ] [ Explainable AI ] [ Explainable Model ] [ explaining decision-making ] [ explanation method ] [ explanations ] [ Explanations ] [ Exploration ] [ Exponential Families ] [ exponential tilting ] [ exposition ] [ external memory ] [ Extrapolation ] [ extremal sector ] [ facial recognition ] [ factor analysis ] [ factored MDP ] [ Factored MDP ] [ fairness ] [ Fairness ] [ faithfulness ] [ fast DNN inference ] [ fast learning rate ] [ fast-mapping ] [ fast weights ] [ FAVOR ] [ Feature Attribution ] [ feature propagation ] [ features ] [ feature visualization ] [ Feature Visualization ] [ Federated learning ] [ Federated Learning ] [ Few Shot ] [ few-shot concept learning ] [ few-shot domain generalization ] [ Few-shot learning ] [ Few Shot Learning ] [ fine-tuning ] [ finetuning ] [ Fine-tuning ] [ Finetuning ] [ fine-tuning stability ] [ Fingerprinting ] [ First-order Methods ] [ first-order optimization ] [ fisher ratio ] [ flat minima ] [ Flexibility ] [ flow graphs ] [ Fluid Dynamics ] [ Follow-the-Regularized-Leader ] [ Formal Verification ] [ forward mode ] [ Fourier Features ] [ Fourier transform ] [ framework ] [ Frobenius norm ] [ from-scratch ] [ frontend ] [ fruit fly ] [ fully-connected ] [ Fully-Connected Networks ] [ future frame generation ] [ future link prediction ] [ fuzzy tiling activation function ] [ Game Decomposition ] [ Game Theory ] [ GAN ] [ GAN compression ] [ GANs ] [ Garbled Circuits ] [ Gaussian Copula ] [ Gaussian Graphical Model ] [ Gaussian Isoperimetric Inequality ] [ Gaussian mixture model ] [ Gaussian process ] [ Gaussian Process ] [ Gaussian Processes ] [ gaussian process priors ] [ GBDT ] [ generalisation ] [ Generalization ] [ Generalization Bounds ] [ generalization error ] [ Generalization Measure ] [ Generalization of Reinforcement Learning ] [ generalized ] [ generalized Girsanov theorem ] [ Generalized PageRank ] [ Generalized zero-shot learning ] [ Generation ] [ Generative Adversarial Network ] [ Generative Adversarial Networks ] [ generative art ] [ Generative Flow ] [ Generative Model ] [ Generative modeling ] [ Generative Modeling ] [ generative modelling ] [ Generative Modelling ] [ Generative models ] [ Generative Models ] [ genetic programming ] [ Geodesic-Aware FC Layer ] [ geometric ] [ Geometric Deep Learning ] [ G-invariance regularization ] [ global ] [ global optima ] [ Global Reference ] [ glue ] [ GNN ] [ GNNs ] [ goal-conditioned reinforcement learning ] [ goal-conditioned RL ] [ goal reaching ] [ gradient ] [ gradient alignment ] [ Gradient Alignment ] [ gradient boosted decision trees ] [ gradient boosting ] [ gradient decomposition ] [ Gradient Descent ] [ gradient descent-ascent ] [ gradient flow ] [ Gradient flow ] [ gradient flows ] [ gradient redundancy ] [ Gradient stability ] [ Grammatical error correction ] [ Granger causality ] [ Graph ] [ graph classification ] [ graph coarsening ] [ Graph Convolutional Network ] [ Graph Convolutional Neural Networks ] [ graph edit distance ] [ Graph Generation ] [ Graph Generative Model ] [ graph-level prediction ] [ graph networks ] [ Graph neural network ] [ Graph Neural Network ] [ Graph neural networks ] [ Graph Neural Networks ] [ Graph pooling ] [ graph representation learning ] [ Graph representation learning ] [ Graph Representation Learning ] [ graph shift operators ] [ graph-structured data ] [ graph structure learning ] [ Greedy Learning ] [ grid cells ] [ grounding ] [ group disparities ] [ group equivariance ] [ Group Equivariance ] [ Group Equivariant Convolution ] [ group equivariant self-attention ] [ group equivariant transformers ] [ group sparsity ] [ Group-supervised learning ] [ gumbel-softmax ] [ Hamiltonian systems ] [ hard-label attack ] [ hard negative mining ] [ hard negative sampling ] [ Hardware-Aware Neural Architecture Search ] [ Harmonic Analysis ] [ harmonic distortion analysis ] [ healthcare ] [ Healthcare ] [ heap allocation ] [ Hessian matrix ] [ Heterogeneity ] [ Heterogeneous ] [ heterogeneous data ] [ Heterogeneous data ] [ Heterophily ] [ heteroscedasticity ] [ heuristic search ] [ hidden-parameter mdp ] [ hierarchical contrastive learning ] [ Hierarchical Imitation Learning ] [ Hierarchical Multi-Agent Learning ] [ Hierarchical Networks ] [ Hierarchical Reinforcement Learning ] [ Hierarchy-Aware Classification ] [ high-dimensional asymptotics ] [ high-dimensional statistic ] [ high-resolution video generation ] [ hindsight relabeling ] [ histogram binning ] [ historical color image classification ] [ HMC ] [ homomorphic encryption ] [ Homophily ] [ Hopfield layer ] [ Hopfield networks ] [ Hopfield Networks ] [ human-AI collaboration ] [ human cognition ] [ human-computer interaction ] [ human preferences ] [ human psychophysics ] [ humans in the loop ] [ hybrid systems ] [ Hyperbolic ] [ hyperbolic deep learning ] [ Hyperbolic Geometry ] [ hypercomplex representation learning ] [ hypergradients ] [ Hypernetworks ] [ hyperparameter ] [ Hyperparameter Optimization ] [ Hyper-Parameter Optimization ] [ HYPERPARAMETER OPTIMIZATION ] [ Image Classification ] [ image completion ] [ Image compression ] [ Image Editing ] [ Image Generation ] [ Image manipulation ] [ Image Modeling ] [ ImageNet ] [ image reconstruction ] [ Image segmentation ] [ Image Synthesis ] [ image-to-action learning ] [ Image-to-Image Translation ] [ image translation ] [ image warping ] [ imbalanced learning ] [ Imitation Learning ] [ Impartial Learning ] [ implicit bias ] [ Implicit Bias ] [ Implicit Deep Learning ] [ implicit differentiation ] [ implicit functions ] [ implicit neural representations ] [ Implicit Neural Representations ] [ Implicit Representation ] [ Importance Weighting ] [ impossibility ] [ incoherence ] [ Incompatible Environments ] [ Incremental Tree Transformations ] [ independent component analysis ] [ indirection ] [ Individual mediation effects ] [ Inductive Bias ] [ inductive biases ] [ inductive representation learning ] [ infinitely wide neural network ] [ Infinite-Width Limit ] [ infinite-width networks ] [ influence functions ] [ Influence Functions ] [ Information bottleneck ] [ Information Bottleneck ] [ Information Geometry ] [ information-theoretical probing ] [ Information theory ] [ Information Theory ] [ Initialization ] [ input-adaptive multi-exit neural networks ] [ input convex neural networks ] [ input-convex neural networks ] [ InstaHide ] [ Instance adaptation ] [ instance-based label noise ] [ Instance learning ] [ Instance-wise Learning ] [ Instrumental Variable Regression ] [ integral probability metric ] [ intention ] [ interaction networks ] [ Interactions ] [ interactive fiction ] [ Internet of Things ] [ Interpolation Peak ] [ Interpretability ] [ interpretable latent representation ] [ Interpretable Machine Learning ] [ interpretable policy learning ] [ in-the-wild data ] [ Intrinsically Motivated Reinforcement Learning ] [ Intrinsic Motivation ] [ intrinsic motivations ] [ Intrinsic Reward ] [ Invariance and Equivariance ] [ invariance penalty ] [ invariances ] [ Invariant and equivariant deep networks ] [ Invariant Representations ] [ invariant risk minimization ] [ Invariant subspaces ] [ inverse graphics ] [ Inverse reinforcement learning ] [ Inverse Reinforcement Learning ] [ Inverted Index ] [ irl ] [ IRM ] [ irregularly spaced time series ] [ irregular-observed data modelling ] [ isometric ] [ Isotropy ] [ iterated learning ] [ iterative training ] [ JEM ] [ Johnson-Lindenstrauss Transforms ] [ kernel ] [ Kernel Learning ] [ kernel method ] [ kernel-ridge regression ] [ kernels ] [ keypoint localization ] [ Knowledge distillation ] [ Knowledge Distillation ] [ Knowledge factorization ] [ Knowledge Graph Reasoning ] [ knowledge uncertainty ] [ Kullback-Leibler divergence ] [ Kurdyka-Łojasiewicz geometry ] [ label noise robustness ] [ Label Representation ] [ Label shift ] [ label smoothing ] [ Langevin dynamics ] [ Langevin sampling ] [ Language Grounding ] [ Language Model ] [ Language modeling ] [ Language Modeling ] [ Language Modelling ] [ Language Model Pre-training ] [ language processing ] [ language-specific modeling ] [ Laplace kernel ] [ Large-scale ] [ Large-scale Deep Learning ] [ large scale learning ] [ Large-scale Machine Learning ] [ large-scale pre-trained language models ] [ large-scale training ] [ large vocabularies ] [ Last-iterate Convergence ] [ Latency-aware Neural Architecture Search ] [ Latent Simplex ] [ latent space of GANs ] [ Latent Variable Models ] [ lattices ] [ Layer order ] [ layerwise sparsity ] [ learnable ] [ learned algorithms ] [ Learned compression ] [ learned ISTA ] [ Learning ] [ learning action representations ] [ learning-based ] [ learning dynamics ] [ Learning Dynamics ] [ Learning in Games ] [ learning mechanisms ] [ Learning physical laws ] [ Learning Theory ] [ Learning to Hash ] [ learning to optimize ] [ Learning to Optimize ] [ learning to rank ] [ Learning to Rank ] [ learning to teach ] [ learning with noisy labels ] [ Learning with noisy labels ] [ library ] [ lifelong ] [ Lifelong learning ] [ Lifelong Learning ] [ lifted inference ] [ likelihood-based models ] [ likelihood-free inference ] [ limitations ] [ limited data ] [ linear bandits ] [ Linear Convergence ] [ linear estimator ] [ Linear Regression ] [ linear terms ] [ linformer ] [ Lipschitz constants ] [ Lipschitz constrained networks ] [ Local Explanations ] [ locality sensitive hashing ] [ Locally supervised training ] [ local Rademacher complexity ] [ log-concavity ] [ Logic ] [ Logic Rules ] [ logsignature ] [ Long-Tailed Recognition ] [ long-tail learning ] [ Long-term dependencies ] [ long-term prediction ] [ long-term stability ] [ loss correction ] [ Loss function search ] [ Loss Function Search ] [ lossless source compression ] [ Lottery Ticket ] [ Lottery Ticket Hypothesis ] [ lottery tickets ] [ low-dimensional structure ] [ lower bound ] [ lower bounds ] [ Low-latency ASR ] [ low precision training ] [ low rank ] [ low-rank approximation ] [ low-rank tensors ] [ L-smoothness ] [ LSTM ] [ Lyapunov Chaos ] [ Machine learning ] [ Machine Learning ] [ machine learning for code ] [ Machine Learning for Robotics ] [ Machine Learning (ML) for Programming Languages (PL)/Software Engineering (SE) ] [ machine learning systems ] [ Machine translation ] [ Machine Translation ] [ magnitude-based pruning ] [ Manifold clustering ] [ Manifolds ] [ Many-task ] [ mapping ] [ Markov chain Monte Carlo ] [ Markov Chain Monte Carlo ] [ Markov jump process ] [ Masked Reconstruction ] [ mathematical reasoning ] [ Matrix and Tensor Factorization ] [ matrix completion ] [ matrix decomposition ] [ Matrix Factorization ] [ max-margin ] [ MCMC ] [ MCMC sampling ] [ mean estimation ] [ mean-field dynamics ] [ mean separation ] [ Mechanism Design ] [ medical time series ] [ mel-filterbanks ] [ memorization ] [ Memorization ] [ Memory ] [ memory efficient ] [ memory efficient training ] [ Memory Mapping ] [ memory optimized training ] [ Memory-saving ] [ mesh ] [ Message Passing ] [ Message Passing GNNs ] [ meta-gradients ] [ Meta-learning ] [ Meta Learning ] [ Meta-Learning ] [ Metric Surrogate ] [ minimax optimal rate ] [ Minimax Optimization ] [ minimax risk ] [ Minmax ] [ min-max optimization ] [ mirror-prox ] [ Missing Data Inference ] [ Missing value imputation ] [ Missing Values ] [ misssing data ] [ mixed precision ] [ Mixed Precision ] [ Mixed-precision quantization ] [ mixture density nets ] [ mixture of experts ] [ mixup ] [ Mixup ] [ MixUp ] [ MLaaS ] [ MoCo ] [ Model Attribution ] [ model-based control ] [ model-based learning ] [ Model-based Reinforcement Learning ] [ Model-Based Reinforcement Learning ] [ model-based RL ] [ Model-based RL ] [ Model Biases ] [ Model compression ] [ model extraction ] [ model fairness ] [ Model Inversion ] [ model order reduction ] [ model ownership ] [ model predictive control ] [ model-predictive control ] [ Model Predictive Control ] [ Model privacy ] [ Models for code ] [ models of learning and generalization ] [ Model stealing ] [ Modern Hopfield Network ] [ modern Hopfield networks ] [ modified equation analysis ] [ modular architectures ] [ Modular network ] [ modular networks ] [ modular neural networks ] [ modular representations ] [ modulated convolution ] [ Molecular conformation generation ] [ molecular design ] [ Molecular Dynamics ] [ molecular graph generation ] [ Molecular Representation ] [ Molecule Design ] [ Momentum ] [ momentum methods ] [ momentum optimizer ] [ monotonicity ] [ Monte Carlo ] [ Monte-Carlo tree search ] [ Monte Carlo Tree Search ] [ morphology ] [ Morse theory ] [ mpc ] [ Multi-agent ] [ Multi-agent games ] [ Multiagent Learning ] [ multi-agent platform ] [ Multi-Agent Policy Gradients ] [ Multi-agent reinforcement learning ] [ Multi-agent Reinforcement Learning ] [ Multi-Agent Reinforcement Learning ] [ Multi-Agent Transfer Learning ] [ multiclass classification ] [ multi-dimensional discrete action spaces ] [ Multi-domain ] [ multi-domain disentanglement ] [ multi-head attention ] [ Multi-Hop ] [ multi-hop question answering ] [ Multi-hop Reasoning ] [ Multilingual Modeling ] [ multilingual representations ] [ multilingual transformer ] [ multilingual translation ] [ Multimodal ] [ Multi-Modal ] [ Multimodal Attention ] [ multi-modal learning ] [ Multimodal Learning ] [ Multi-Modal Learning ] [ Multimodal Spaces ] [ Multi-objective optimization ] [ multi-player ] [ Multiplicative Weights Update ] [ Multi-scale Representation ] [ multitask ] [ Multi-task ] [ Multi-task Learning ] [ Multi Task Learning ] [ Multi-Task Learning ] [ multi-task learning theory ] [ Multitask Reinforcement Learning ] [ Multi-view Learning ] [ Multi-View Learning ] [ Multi-view Representation Learning ] [ Mutual Information ] [ MuZero ] [ Named Entity Recognition ] [ NAS ] [ nash ] [ natural gradient descent ] [ Natural Language Processing ] [ natural scene statistics ] [ natural sparsity ] [ Negative Sampling ] [ negotiation ] [ nested optimization ] [ network architecture ] [ Network Architecture ] [ Network Inductive Bias ] [ network motif ] [ Network pruning ] [ Network Pruning ] [ networks ] [ network trainability ] [ network width ] [ Neural Architecture Search ] [ Neural Attention Distillation ] [ neural collapse ] [ Neural data compression ] [ Neural IR ] [ neural kernels ] [ neural link prediction ] [ Neural Model Explanation ] [ neural module network ] [ Neural Network ] [ Neural Network Bounding ] [ neural network calibration ] [ Neural Network Gaussian Process ] [ neural network robustness ] [ Neural networks ] [ Neural Networks ] [ neural network training ] [ Neural Network Verification ] [ neural ode ] [ Neural ODE ] [ Neural ODEs ] [ Neural operators ] [ Neural Physics Engines ] [ Neural Processes ] [ neural reconstruction ] [ neural sound synthesis ] [ neural spike train ] [ neural symbolic reasoning ] [ neural tangent kernel ] [ Neural tangent kernel ] [ Neural Tangent Kernel ] [ neural tangent kernels ] [ Neural text decoding ] [ neurobiology ] [ Neuroevolution ] [ Neuro symbolic ] [ Neuro-Symbolic Learning ] [ neuro-symbolic models ] [ NLI ] [ NLP ] [ Node Embeddings ] [ noise contrastive estimation ] [ Noise-contrastive learning ] [ Noise model ] [ noise robust learning ] [ Noisy Demonstrations ] [ noisy label ] [ Noisy Label ] [ Noisy Labels ] [ Non-asymptotic Confidence Intervals ] [ non-autoregressive generation ] [ nonconvex ] [ non-convex learning ] [ Non-Convex Optimization ] [ Non-IID ] [ nonlinear control theory ] [ nonlinear dynamical systems ] [ nonlinear Hawkes process ] [ nonlinear walk ] [ Non-Local Modules ] [ non-minimax optimization ] [ nonnegative PCA ] [ nonseparable Hailtonian system ] [ non-smooth models ] [ non-stationary stochastic processes ] [ no-regret learning ] [ normalized maximum likelihood ] [ normalize layer ] [ normalizers ] [ Normalizing Flow ] [ normalizing flows ] [ Normalizing flows ] [ Normalizing Flows ] [ normative models ] [ novelty-detection ] [ ntk ] [ number of linear regions ] [ numerical errors ] [ numerical linear algebra ] [ object-centric representations ] [ Object detection ] [ Object Detection ] [ object-keypoint representations ] [ ObjectNet ] [ Object Permanence ] [ Observational Imitation ] [ ODE ] [ offline ] [ offline/batch reinforcement learning ] [ off-line reinforcement learning ] [ offline reinforcement learning ] [ Offline Reinforcement Learning ] [ offline RL ] [ off-policy evaluation ] [ Off Policy Evaluation ] [ Off-policy policy evaluation ] [ Off-Policy Reinforcement Learning ] [ off-policy RL ] [ one-class-classification ] [ one-to-many mapping ] [ Open-domain ] [ open domain complex question answering ] [ open source ] [ Optimal Control Theory ] [ optimal convergence ] [ optimal power flow ] [ Optimal Transport ] [ optimal transport maps ] [ Optimisation for Deep Learning ] [ optimism ] [ Optimistic Gradient Descent Ascent ] [ Optimistic Mirror Decent ] [ Optimistic Multiplicative Weights Update ] [ Optimization ] [ order learning ] [ ordinary differential equation ] [ orthogonal ] [ orthogonal layers ] [ orthogonal machine learning ] [ Orthogonal Polynomials ] [ Oscillators ] [ outlier detection ] [ outlier-detection ] [ Outlier detection ] [ out-of-distribution ] [ Out-of-distribution detection in deep learning ] [ out-of-distribution generalization ] [ Out-of-domain ] [ over-fitting ] [ Overfitting ] [ overparameterisation ] [ over-parameterization ] [ Over-parameterization ] [ Overparameterization ] [ overparameterized neural networks ] [ Over-smoothing ] [ Oversmoothing ] [ over-squashing ] [ PAC Bayes ] [ padding ] [ parallel Monte Carlo Tree Search (MCTS) ] [ parallel tempering ] [ Parameter-Reduced MLR ] [ part-based ] [ Partial Amortization ] [ Partial differential equation ] [ partial differential equations ] [ partially observed environments ] [ particle inference ] [ pca ] [ pde ] [ pdes ] [ PDEs ] [ performer ] [ persistence diagrams ] [ personalized learning ] [ perturbation sets ] [ Peter-Weyl Theorem ] [ phase retrieval ] [ Physical parameter estimation ] [ physical reasoning ] [ physical scene understanding ] [ Physical Simulation ] [ physical symbol grounding ] [ physics ] [ physics-guided deep learning ] [ piecewise linear function ] [ pipeline toolkit ] [ plan-based reward shaping ] [ Planning ] [ Poincaré Ball Model ] [ Point cloud ] [ Point clouds ] [ point processes ] [ pointwise mutual information ] [ poisoning ] [ poisoning attack ] [ poisson matrix factorization ] [ policy learning ] [ Policy Optimization ] [ polynomial time ] [ Pose Estimation ] [ Position Embedding ] [ Position Encoding ] [ post-hoc calibration ] [ Post-Hoc Correction ] [ Post Training Quantization ] [ power grid management ] [ Predictive Modeling ] [ predictive uncertainty ] [ Predictive Uncertainty Estimation ] [ pretrained language model ] [ pretrained language model. ] [ pre-trained language model fine-tuning ] [ Pretrained Language Models ] [ Pretrained Text Encoders ] [ pre-training ] [ Pre-training ] [ Primitive Discovery ] [ principal components analysis ] [ Privacy ] [ privacy leakage from gradients ] [ privacy preserving machine learning ] [ Privacy-utility tradeoff ] [ probabelistic models ] [ probabilistic generative models ] [ probabilistic inference ] [ probabilistic matrix factorization ] [ Probabilistic Methods ] [ probabilistic multivariate forecasting ] [ probabilistic numerics ] [ probabilistic programs ] [ probably approximated correct guarantee ] [ Probe ] [ probing ] [ procedural generation ] [ procedural knowledge ] [ product of experts ] [ Product Quantization ] [ Program obfuscation ] [ Program Synthesis ] [ Proper Scoring Rules ] [ protein ] [ prototype propagation ] [ Provable Robustness ] [ provable sample efficiency ] [ proximal gradient descent-ascent ] [ proxy ] [ Pruning ] [ Pruning at initialization ] [ pseudo-labeling ] [ Pseudo-Labeling ] [ QA ] [ Q-learning ] [ Quantization ] [ quantum machine learning ] [ quantum mechanics ] [ Quantum Mechanics ] [ Question Answering ] [ random ] [ Random Feature ] [ Random Features ] [ Randomized Algorithms ] [ Random Matrix Theory ] [ Random Weights Neural Networks ] [ rank-collapse ] [ rank-constrained convex optimization ] [ rao ] [ rao-blackwell ] [ Rate-distortion optimization ] [ raven's progressive matrices ] [ real time recurrent learning ] [ real-world ] [ Real-world image denoising ] [ reasoning paths ] [ recommendation systems ] [ recommender system ] [ Recommender Systems ] [ recovery likelihood ] [ rectified linear unit ] [ Recurrent Generative Model ] [ Recurrent Neural Network ] [ Recurrent neural networks ] [ Recurrent Neural Networks ] [ recursive dense retrieval ] [ reformer ] [ regime agnostic methods ] [ Regression ] [ Regression without correspondence ] [ regret analysis ] [ regret minimization ] [ Regularization ] [ Regularization by denoising ] [ regularized markov decision processes ] [ Reinforcement ] [ Reinforcement learning ] [ Reinforcement Learning ] [ Reinforcement Learnings ] [ Reinforcement learning theory ] [ relabelling ] [ Relational regularized autoencoder ] [ Relation Extraction ] [ relaxed regularization ] [ relu network ] [ ReLU networks ] [ Rematerialization ] [ Render-and-Compare ] [ Reparameterization ] [ repetitions ] [ replica exchange ] [ representational learning ] [ representation analysis ] [ Representation learning ] [ Representation Learning ] [ representation learning for computer vision ] [ representation learning for robotics ] [ representation of dynamical systems ] [ Representation Theory ] [ reproducibility ] [ reproducible research ] [ Reproducing kernel Hilbert space ] [ resampling ] [ reset-free ] [ residual ] [ ResNets ] [ resource constrained ] [ Restricted Boltzmann Machines ] [ retraining ] [ Retrieval ] [ reverse accuracy ] [ reverse engineering ] [ reward learning ] [ reward randomization ] [ reward shaping ] [ reweighting ] [ Rich observation ] [ rich observations ] [ risk-averse ] [ Risk bound ] [ Risk Estimation ] [ risk sensitive ] [ rl ] [ RMSprop ] [ RNA-protein interaction prediction ] [ RNA structure ] [ RNA structure embedding ] [ RNN ] [ RNNs ] [ robotic manipulation ] [ robust ] [ robust control ] [ robust deep learning ] [ Robust Deep Learning ] [ robust learning ] [ Robust Learning ] [ Robust Machine Learning ] [ Robustness ] [ Robustness certificates ] [ Robust Overfitting ] [ ROC ] [ Role-Based Learning ] [ rooted graphs ] [ Rotation invariance ] [ rtrl ] [ Runtime Systems ] [ Saddle-point Optimization ] [ safe ] [ Safe exploration ] [ safe planning ] [ Saliency ] [ Saliency Guided Data Augmentation ] [ saliency maps ] [ SaliencyMix ] [ sample complexity separation ] [ Sample Efficiency ] [ sample information ] [ sample reweighting ] [ Sampling ] [ sampling algorithms ] [ Scalability ] [ Scale ] [ scale-invariant weights ] [ Scale of initialization ] [ scene decomposition ] [ scene generation ] [ Scene Understanding ] [ Science ] [ science of deep learning ] [ score-based generative models ] [ score matching ] [ score-matching ] [ SDE ] [ Second-order analysis ] [ second-order approximation ] [ second-order optimization ] [ Security ] [ segmented models ] [ selective classification ] [ Self-Imitation ] [ self supervised learning ] [ Self-supervised learning ] [ Self-supervised Learning ] [ Self Supervised Learning ] [ Self-Supervised Learning ] [ self-supervision ] [ self-training ] [ self-training theory ] [ semantic anomaly detection ] [ semantic directions in latent space ] [ semantic graphs ] [ Semantic Image Synthesis ] [ semantic parsing ] [ semantic role labeling ] [ semantic-segmentation ] [ Semantic Segmentation ] [ Semantic Textual Similarity ] [ semi-infinite duality ] [ semi-nonnegative matrix factorization ] [ semiparametric inference ] [ semi-supervised ] [ Semi-supervised Learning ] [ Semi-Supervised Learning ] [ semi-supervised learning theory ] [ Sentence Embeddings ] [ Sentence Representations ] [ Sentiment ] [ separation of variables ] [ Sequence Data ] [ Sequence Modeling ] [ sequence models ] [ Sequence-to-sequence learning ] [ sequence-to-sequence models ] [ sequential data ] [ Sequential probability ratio test ] [ Sequential Representation Learning ] [ set prediction ] [ set transformer ] [ SGD ] [ SGD noise ] [ sgld ] [ Shape ] [ shape bias ] [ Shape Bias ] [ Shape Encoding ] [ shapes ] [ Shapley values ] [ Sharpness Minimization ] [ side channel analysis ] [ Sigma Delta Quantization ] [ sign agnostic learning ] [ signal propagation ] [ signature ] [ sim2real ] [ sim2real transfer ] [ simple ] [ Singularity analysis ] [ singular value decomposition ] [ Sinkhorn algorithm ] [ skeleton-based action recognition ] [ sketch-based modeling ] [ sketches ] [ Skill Discovery ] [ SLAM ] [ sliced fused Gromov Wasserstein ] [ Sliced Wasserstein ] [ Slowdown attacks ] [ slowness ] [ Smooth games ] [ smoothing ] [ SMT Solvers ] [ social perception ] [ Soft Body ] [ soft labels ] [ software ] [ sound classification ] [ sound spatialization ] [ Source Code ] [ sparse Bayesian learning ] [ Sparse Embedding ] [ sparse embeddings ] [ sparse reconstruction ] [ sparse representation ] [ sparse representations ] [ sparse stochastic gates ] [ Sparsity ] [ Sparsity Learning ] [ spatial awareness ] [ spatial bias ] [ spatial uncertainty ] [ spatio-temporal forecasting ] [ spatio-temporal graph ] [ spatio-temporal modeling ] [ spatio-temporal modelling ] [ spatiotemporal prediction ] [ Spatiotemporal Understanding ] [ Spectral Analysis ] [ Spectral Distribution ] [ Spectral Graph Filter ] [ spectral regularization ] [ speech generation ] [ speech-impaired ] [ speech processing ] [ speech recognition. ] [ Speech Recognition ] [ spherical distributions ] [ spiking neural network ] [ spurious correlations ] [ square loss vs cross-entropy ] [ stability theory ] [ State abstraction ] [ state abstractions ] [ state-space models ] [ statistical learning theory ] [ Statistical Learning Theory ] [ statistical physics ] [ Statistical Physics ] [ statistical physics methods ] [ Steerable Kernel ] [ Stepsize optimization ] [ stochastic asymptotics ] [ stochastic control ] [ (stochastic) gradient descent ] [ Stochastic Gradient Descent ] [ stochastic gradient Langevin dynamics ] [ stochastic process ] [ Stochastic Processes ] [ stochastic subgradient method ] [ Storage Capacity ] [ straight-through ] [ straightthrough ] [ strategic behavior ] [ Streaming ASR ] [ structural biology ] [ structural credit assignment ] [ structural inductive bias ] [ Structured Pruning ] [ Structure learning ] [ structure prediction ] [ structures prediction ] [ Style Mixing ] [ Style Transfer ] [ subgraph reasoning. ] [ sublinear ] [ submodular optimization ] [ Subspace clustering ] [ Summarization ] [ summary statistics ] [ superpixel ] [ supervised contrastive learning ] [ Supervised Deep Networks ] [ Supervised Learning ] [ support estimation ] [ surprisal ] [ surrogate models ] [ svd ] [ SVD ] [ Symbolic Methods ] [ symbolic regression ] [ symbolic representations ] [ Symmetry ] [ symplectic networks ] [ Syntax ] [ Synthetic benchmark dataset ] [ synthetic-to-real generalization ] [ Systematic generalisation ] [ Systematicity ] [ System identification ] [ Tabular ] [ tabular data ] [ Tabular Data ] [ targeted attack ] [ Task Embeddings ] [ task generation ] [ task-oriented dialogue ] [ Task-oriented Dialogue System ] [ task reduction ] [ Task Segmentation ] [ Teacher-Student Learning ] [ teacher-student model ] [ temporal context ] [ Temporal knowledge graph ] [ temporal networks ] [ tensor product ] [ Text-based Games ] [ Text Representation ] [ Text Retrieval ] [ Text to speech ] [ Text to speech synthesis ] [ text-to-sql ] [ Texture ] [ Texture Bias ] [ Textworld ] [ Theorem proving ] [ theoretical issues in deep learning ] [ theoretical limits ] [ theoretical study ] [ Theory ] [ Theory of deep learning ] [ theory of mind ] [ Third-Person Imitation ] [ Thompson sampling ] [ time-frequency representations ] [ timescale ] [ timescales ] [ Time Series ] [ Time series forecasting ] [ time series prediction ] [ topic modelling ] [ Topology ] [ training dynamics ] [ Training Method ] [ trajectory ] [ trajectory optimization ] [ trajectory prediction ] [ Transferability ] [ Transfer learning ] [ Transfer Learning ] [ transformation invariance ] [ Transformer ] [ Transformers ] [ traveling salesperson problem ] [ Tree-structured Data ] [ trembl ] [ tropical function ] [ trust region ] [ two-layer neural network ] [ Uncertainty ] [ uncertainty calibration ] [ Uncertainty estimates ] [ Uncertainty estimation ] [ Uncertainty Machine Learning ] [ understanding ] [ understanding CNNs ] [ Understanding Data Augmentation ] [ understanding decision-making ] [ understanding deep learning ] [ Understanding Deep Learning ] [ understanding neural networks ] [ U-Net ] [ unidirectional ] [ uniprot ] [ universal approximation ] [ Universal approximation ] [ Universality ] [ universal representation learning ] [ universal sound separation ] [ unlabeled data ] [ Unlabeled Entity Problem ] [ Unlearnable Examples ] [ unrolled algorithms ] [ Unsupervised denoising ] [ Unsupervised Domain Translation ] [ unsupervised image denoising ] [ Unsupervised learning ] [ Unsupervised Learning ] [ unsupervised learning theory ] [ unsupervised loss ] [ Unsupervised Meta-learning ] [ unsupervised object discovery ] [ Unsupervised reinforcement learning ] [ unsupervised skill discovery ] [ unsupervised stabilization ] [ Upper Confidence bound applied to Trees (UCT) ] [ Usable Information ] [ VAE ] [ Value factorization ] [ value learning ] [ vanishing gradient problem ] [ variable binding ] [ variable convergence ] [ Variable Embeddings ] [ Variance Networks ] [ Variational Auto-encoder ] [ Variational autoencoders ] [ Variational Autoencoders ] [ Variational inference ] [ variational information bottleneck ] [ Verification ] [ video analysis ] [ Video Classification ] [ Video Compression ] [ video generation ] [ video-grounded dialogues ] [ Video prediction ] [ Video Reasoning ] [ video recognition ] [ Video Recognition ] [ video representation learning ] [ video synthesis ] [ video-text learning ] [ views ] [ virtual environment ] [ vision-and-language-navigation ] [ visual counting ] [ visualization ] [ visual perception ] [ Visual Reasoning ] [ visual reinforcement learning ] [ visual representation learning ] [ visual saliency ] [ vocoder ] [ voice conversion ] [ Volume Analysis ] [ VQA ] [ vulnerability of RL ] [ wanet ] [ warping functions ] [ Wasserstein ] [ wasserstein-2 barycenters ] [ wasserstein-2 distance ] [ Wasserstein distance ] [ waveform generation ] [ weakly-supervised learning ] [ weakly supervised representation learning ] [ Weak supervision ] [ Weak-supervision ] [ webly-supervised learning ] [ weight attack ] [ weight balance ] [ Weight quantization ] [ weight-sharing ] [ wide local minima ] [ Wigner-Eckart Theorem ] [ winning tickets ] [ wireframe model ] [ word-learning ] [ world models ] [ World Models ] [ worst-case generalisation ] [ xai ] [ XAI ] [ zero-order optimization ] [ zero-shot learning ] [ Zero-shot learning ] [ Zero-shot Learning ] [ Zero-shot synthesis ]

185 Results

Poster
Mon 1:00 Interpreting and Boosting Dropout from a Game-Theoretic View
Hao Zhang, Sen Li, YinChao Ma, Mingjie Li, Yichen Xie, Quanshi Zhang
Poster
Mon 1:00 MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space
Tsz Him Cheung, Dit-Yan Yeung
Poster
Mon 1:00 On Learning Universal Representations Across Languages
Xiangpeng Wei, Rongxiang Weng, Yue Hu, Luxi Xing, Heng Yu, Weihua Luo
Poster
Mon 1:00 Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Dong Bok Lee, Dongchan Min, Seanie Lee, Sung Ju Hwang
Poster
Mon 1:00 Improve Object Detection with Feature-based Knowledge Distillation: Towards Accurate and Efficient Detectors
Linfeng Zhang, Kaisheng Ma
Poster
Mon 1:00 Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits
Jiawang Bai, Baoyuan Wu, Yong Zhang, Yiming Li, Zhifeng Li, Shu-Tao Xia
Poster
Mon 1:00 Batch Reinforcement Learning Through Continuation Method
Yijie Guo, Shengyu Feng, Nicolas Le Roux, Ed H. Chi, Honglak Lee, Minmin Chen
Poster
Mon 1:00 Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch
Aojun Zhou, Yukun Ma, Junnan Zhu, Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li
Poster
Mon 1:00 Wasserstein-2 Generative Networks
Alexander Korotin, Vage Egiazarian, Arip Asadulaev, Alexander Safin, Evgeny Burnaev
Poster
Mon 1:00 Domain Generalization with MixStyle
Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang
Poster
Mon 1:00 Exploring Balanced Feature Spaces for Representation Learning
Bingyi Kang, Yu Li, Sain Xie, Zehuan Yuan, Jiashi Feng
Poster
Mon 1:00 SALD: Sign Agnostic Learning with Derivatives
Matan Atzmon, Yaron Lipman
Poster
Mon 1:00 Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets
Hayeon Lee, Eunyoung Hyung, Sung Ju Hwang
Spotlight
Mon 3:40 Generalization in data-driven models of primary visual cortex
Konstantin-Klemens Lurz, Mohammad Bashiri, Konstantin Willeke, Akshay Jagadish, Eric Wang, Edgar Walker, Santiago Cadena Cadena, Taliah Muhammad, Erick M Cobos, Andreas Tolias, Alexander S Ecker, Fabian Sinz
Oral
Mon 5:00 Geometry-aware Instance-reweighted Adversarial Training
Jingfeng Zhang, Jianing ZHU, Gang Niu, Bo Han, Masashi Sugiyama, Mohan Kankanhalli
Mon 6:00 WiML@ICLR 2021 Virtual Panel
Poster
Mon 9:00 MultiModalQA: complex question answering over text, tables and images
Alon Talmor, Ori Yoran, Amnon Catav, Dan Lahav, Yizhong Wang, Akari Asai, Gabriel Ilharco, Hannaneh Hajishirzi, Jonathan Berant
Poster
Mon 9:00 What Can You Learn From Your Muscles? Learning Visual Representation from Human Interactions
Kiana Ehsani, Daniel Gordon, Thomas H Nguyen, Roozbeh Mottaghi, Ali Farhadi
Poster
Mon 9:00 WrapNet: Neural Net Inference with Ultra-Low-Precision Arithmetic
Renkun Ni, Hong-Min Chu, Oscar Castaneda, Ping-yeh Chiang, Christoph Studer, Tom Goldstein
Poster
Mon 9:00 Effective Distributed Learning with Random Features: Improved Bounds and Algorithms
Yong Liu, Jiankun Liu, Shuqiang Wang
Poster
Mon 9:00 Training GANs with Stronger Augmentations via Contrastive Discriminator
Jongheon Jeong, Jinwoo Shin
Poster
Mon 9:00 Reset-Free Lifelong Learning with Skill-Space Planning
Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch
Poster
Mon 9:00 Pruning Neural Networks at Initialization: Why Are We Missing the Mark?
Jonathan Frankle, Gintare Dziugaite, Anonymous A Author, Michael Carbin
Poster
Mon 9:00 What Should Not Be Contrastive in Contrastive Learning
Tete Xiao, Xiaolong Wang, Alyosha Efros, trevor darrell
Poster
Mon 17:00 Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control
Zhuang Liu, Xuanlin Li, Bingyi Kang, trevor darrell
Poster
Mon 17:00 Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning
Tianlong Chen, Zhenyu Zhang, Sijia Liu, Shiyu Chang, Zhangyang Wang
Poster
Mon 17:00 Random Feature Attention
Hao Peng, Nikolaos Pappas, Dani Yogatama, Roy Schwartz, Noah Smith, Lingpeng Kong
Poster
Mon 17:00 The Role of Momentum Parameters in the Optimal Convergence of Adaptive Polyak's Heavy-ball Methods
Wei Tao, sheng long, Gaowei Wu, Qing Tao
Poster
Mon 17:00 Incorporating Symmetry into Deep Dynamics Models for Improved Generalization
Rui Wang, Robin Walters, Rose Yu
Poster
Mon 17:00 Optimal Regularization can Mitigate Double Descent
Preetum Nakkiran, Prayaag Venkat, Sham M Kakade, Tengyu Ma
Poster
Mon 17:00 Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting
Sayna Ebrahimi, Suzanne Petryk, Akash Gokul, William Gan, Joseph E Gonzalez, Marcus Rohrbach, trevor darrell
Poster
Mon 17:00 Learning a Latent Simplex in Input Sparsity Time
Ainesh Bakshi, Chiranjib Bhattacharyya, Ravi Kannan, David Woodruff, Samson Zhou
Poster
Mon 17:00 Layer-adaptive Sparsity for the Magnitude-based Pruning
Jaeho Lee, Sejun Park, Sangwoo Mo, Sungsoo Ahn, Jinwoo Shin
Poster
Mon 17:00 MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training
Beidi Chen, Zichang Liu, Binghui Peng, Zhaozhuo Xu, Jonathan L Li, Tri Dao, Zhao Song, Anshumali Shrivastava, Christopher Re
Poster
Mon 17:00 Zero-shot Synthesis with Group-Supervised Learning
Yunhao Ge, Sami Abu-El-Haija, Gan Xin, Laurent Itti
Oral
Mon 19:00 SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments
Glen Berseth, Daniel Geng, Coline M Devin, Nicholas Rhinehart, Chelsea Finn, Dinesh Jayaraman, Sergey Levine
Spotlight
Mon 21:56 Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Dong Bok Lee, Dongchan Min, Seanie Lee, Sung Ju Hwang
Invited Talk
Tue 0:00 Geometric Deep Learning: the Erlangen Programme of ML
Michael Bronstein
Poster
Tue 1:00 Learning Accurate Entropy Model with Global Reference for Image Compression
Yichen Qian, Zhiyu Tan, Xiuyu Sun, Ming Lin, Dongyang Li, Zhenhong Sun, Li Hao, Rong Jin
Poster
Tue 1:00 Probing BERT in Hyperbolic Spaces
Boli Chen, Yao Fu, Guangwei Xu, Pengjun Xie, Chuanqi Tan, Mosha Chen, Liping Jing
Poster
Tue 1:00 BOIL: Towards Representation Change for Few-shot Learning
Jaehoon Oh, Hyungjun Yoo, ChangHwan Kim, Se-Young Yun
Poster
Tue 1:00 Identifying nonlinear dynamical systems with multiple time scales and long-range dependencies
Dominik Schmidt, Georgia Koppe, Zahra Monfared, Max Beutelspacher, Daniel Durstewitz
Poster
Tue 1:00 On Self-Supervised Image Representations for GAN Evaluation
Stanislav Morozov, Andrey Voynov, Artem Babenko
Poster
Tue 1:00 Accurate Learning of Graph Representations with Graph Multiset Pooling
Jinheon Baek, Minki Kang, Sung Ju Hwang
Poster
Tue 1:00 Prediction and generalisation over directed actions by grid cells
Changmin Yu, Timothy Behrens, Neil Burgess
Poster
Tue 1:00 Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering
Yuxuan Zhang, Wenzheng Chen, Huan Ling, Jun Gao, Yinan Zhang, Antonio Torralba, Sanja Fidler
Poster
Tue 1:00 Multiscale Score Matching for Out-of-Distribution Detection
Ahsan Mahmood, Junier Oliva, Martin A Styner
Poster
Tue 1:00 FedMix: Approximation of Mixup under Mean Augmented Federated Learning
Tehrim Yoon, Sumin Shin, Sung Ju Hwang, Eunho Yang
Poster
Tue 1:00 Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic
Deunsol Yoon, Sunghoon Hong, Byung-Jun Lee, Kee-Eung Kim
Poster
Tue 1:00 Generalization in data-driven models of primary visual cortex
Konstantin-Klemens Lurz, Mohammad Bashiri, Konstantin Willeke, Akshay Jagadish, Eric Wang, Edgar Walker, Santiago Cadena Cadena, Taliah Muhammad, Erick M Cobos, Andreas Tolias, Alexander S Ecker, Fabian Sinz
Spotlight
Tue 5:28 Identifying nonlinear dynamical systems with multiple time scales and long-range dependencies
Dominik Schmidt, Georgia Koppe, Zahra Monfared, Max Beutelspacher, Daniel Durstewitz
Tue 6:00 Lapsed Physicists Wine-and-Cheese (#1)
Poster
Tue 9:00 On the mapping between Hopfield networks and Restricted Boltzmann Machines
Matthew Smart, Anton Zilman
Poster
Tue 9:00 Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning
Zhiyuan Li, Yuping Luo, Kaifeng Lyu
Poster
Tue 9:00 Ringing ReLUs: Harmonic Distortion Analysis of Nonlinear Feedforward Networks
Christian Ali Mehmeti-Göpel, David Hartmann, Michael Wand
Poster
Tue 9:00 Mapping the Timescale Organization of Neural Language Models
Hsiang-Yun Sherry Chien, Jinhan Zhang, Christopher Honey
Poster
Tue 9:00 Large Associative Memory Problem in Neurobiology and Machine Learning
Dmitry Krotov, John J Hopfield
Poster
Tue 9:00 Tent: Fully Test-Time Adaptation by Entropy Minimization
Dequan Wang, Evan Shelhamer, Shaoteng Liu, Bruno Olshausen, trevor darrell
Poster
Tue 9:00 Understanding Over-parameterization in Generative Adversarial Networks
Yogesh Balaji, Mohammadmahdi Sajedi, Neha Kalibhat, Mucong Ding, Dominik Stöger, Mahdi Soltanolkotabi, Soheil Feizi
Poster
Tue 9:00 SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments
Glen Berseth, Daniel Geng, Coline M Devin, Nicholas Rhinehart, Chelsea Finn, Dinesh Jayaraman, Sergey Levine
Poster
Tue 9:00 Learning a Latent Search Space for Routing Problems using Variational Autoencoders
André Hottung, Bhanu Bhandari, Kevin Tierney
Oral
Tue 11:00 Iterated learning for emergent systematicity in VQA
Ankit Vani, Max Schwarzer, Yuchen Lu, Eeshan Dhekane, Aaron Courville
Oral
Tue 12:15 Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering
Yuxuan Zhang, Wenzheng Chen, Huan Ling, Jun Gao, Yinan Zhang, Antonio Torralba, Sanja Fidler
Oral
Tue 13:13 On the mapping between Hopfield networks and Restricted Boltzmann Machines
Matthew Smart, Anton Zilman
Poster
Tue 17:00 Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability
Jeremy Cohen, Simran Kaur, Yuanzhi Li, Zico Kolter, Ameet Talwalkar
Poster
Tue 17:00 Diverse Video Generation using a Gaussian Process Trigger
Gaurav Shrivastava, Abhinav Shrivastava
Poster
Tue 17:00 Learning to Reach Goals via Iterated Supervised Learning
Dibya Ghosh, Abhishek Gupta, Ashwin D Reddy, Justin Fu, Coline M Devin, Ben Eysenbach, Sergey Levine
Poster
Tue 17:00 Discrete Graph Structure Learning for Forecasting Multiple Time Series
Chao Shang, Jie Chen, Jinbo Bi
Poster
Tue 17:00 Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint Learning
Wonyong Jeong, Jaehong Yoon, Eunho Yang, Sung Ju Hwang
Poster
Tue 17:00 Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Zhang, Tom Paine, Ofir Nachum, Cosmin Paduraru, George Tucker, ziyu wang, Mohammad Norouzi
Poster
Tue 17:00 Usable Information and Evolution of Optimal Representations During Training
Michael Kleinman, Alessandro Achille, Daksh Idnani, Jonathan Kao
Poster
Tue 17:00 Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Xuanlin Li, Brandon Trabucco, Dong Huk Park, Michael Luo, Sheng Shen, trevor darrell, Yang Gao
Poster
Tue 17:00 Can a Fruit Fly Learn Word Embeddings?
Yuchen Liang, Chaitanya Ryali, Ben Hoover, Leopold Grinberg, Saket Navlakha, Mohammed J Zaki, Dmitry Krotov
Poster
Tue 17:00 Why Are Convolutional Nets More Sample-Efficient than Fully-Connected Nets?
Zhiyuan Li, Yi Zhang, Sanjeev Arora
Poster
Tue 17:00 Bowtie Networks: Generative Modeling for Joint Few-Shot Recognition and Novel-View Synthesis
Zhipeng Bao, Yu-Xiong Wang, Martial Hebert
Oral
Tue 21:18 MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training
Beidi Chen, Zichang Liu, Binghui Peng, Zhaozhuo Xu, Jonathan L Li, Tri Dao, Zhao Song, Anshumali Shrivastava, Christopher Re
Poster
Wed 1:00 Geometry-aware Instance-reweighted Adversarial Training
Jingfeng Zhang, Jianing ZHU, Gang Niu, Bo Han, Masashi Sugiyama, Mohan Kankanhalli
Poster
Wed 1:00 Communication in Multi-Agent Reinforcement Learning: Intention Sharing
WOOJUN KIM, Jongeui Park, Youngchul Sung
Poster
Wed 1:00 Return-Based Contrastive Representation Learning for Reinforcement Learning
Guoqing Liu, Chuheng Zhang, Li Zhao, Tao Qin, Jinhua Zhu, Li Jian, Nenghai Yu, Tie-Yan Liu
Poster
Wed 1:00 New Bounds For Distributed Mean Estimation and Variance Reduction
Peter Davies, Vijaykrishna Gurunathan, Niusha Moshrefi, Saleh Ashkboos, Dan Alistarh
Poster
Wed 1:00 Reweighting Augmented Samples by Minimizing the Maximal Expected Loss
Mingyang Yi, LU HOU, Lifeng Shang, Xin Jiang, Qun Liu, Zhi-Ming Ma
Poster
Wed 1:00 BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction
Yuhang Li, Ruihao Gong, Xu Tan, Yang Yang, Peng Hu, Qi Zhang, fengwei yu, Wei Wang, Shi Gu
Spotlight
Wed 5:25 Tent: Fully Test-Time Adaptation by Entropy Minimization
Dequan Wang, Evan Shelhamer, Shaoteng Liu, Bruno Olshausen, trevor darrell
Wed 6:00 The Untraditional Path to Data Science
Poster
Wed 9:00 Unsupervised Audiovisual Synthesis via Exemplar Autoencoders
Kangle Deng, Aayush Bansal, Deva Ramanan
Poster
Wed 9:00 Continuous Wasserstein-2 Barycenter Estimation without Minimax Optimization
Alexander Korotin, Lingxiao Li, Justin Solomon, Evgeny Burnaev
Poster
Wed 9:00 Graph Information Bottleneck for Subgraph Recognition
Junchi Yu, Tingyang Xu, Yu Rong, Yatao Bian, Junzhou Huang, Ran He
Poster
Wed 9:00 Neural Synthesis of Binaural Speech From Mono Audio
Alexander Richard, Dejan Markovic, Israel Gebru, Steven Krenn, Gladstone A Butler, Fernando Torre, Yaser Sheikh
Poster
Wed 9:00 Learning Mesh-Based Simulation with Graph Networks
Tobias Pfaff, Meire Fortunato, Alvaro Sanchez Gonzalez, Peter Battaglia
Poster
Wed 9:00 Graph Traversal with Tensor Functionals: A Meta-Algorithm for Scalable Learning
Elan Markowitz, Keshav Balasubramanian, Mehrnoosh Mirtaheri, Sami Abu-El-Haija, Bryan Perozzi, Greg Ver Steeg, Aram Galstyan
Poster
Wed 9:00 Anytime Sampling for Autoregressive Models via Ordered Autoencoding
Yilun Xu, Yang Song, Sahaj Garg, Linyuan Gong, Rui Shu, Aditya Grover, Stefano Ermon
Poster
Wed 9:00 Iterated learning for emergent systematicity in VQA
Ankit Vani, Max Schwarzer, Yuchen Lu, Eeshan Dhekane, Aaron Courville
Poster
Wed 9:00 DARTS-: Robustly Stepping out of Performance Collapse Without Indicators
Xiangxiang Chu, Victor Wang, Bo Zhang, Shun Lu, Xiaolin Wei, Junchi Yan
Poster
Wed 9:00 SEED: Self-supervised Distillation For Visual Representation
Jacob Zhiyuan Fang, Jianfeng Wang, Lijuan Wang, Lei Zhang, 'YZ' Yezhou Yang, Zicheng Liu
Poster
Wed 9:00 IsarStep: a Benchmark for High-level Mathematical Reasoning
Wenda Li, Lei Yu, Yuhuai Wu, Lawrence Paulson
Poster
Wed 9:00 Average-case Acceleration for Bilinear Games and Normal Matrices
Carles Domingo i Enrich, Fabian Pedregosa, Damien Scieur
Poster
Wed 9:00 Theoretical bounds on estimation error for meta-learning
James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI, Toniann Pitassi, Richard Zemel
Poster
Wed 9:00 Benchmarks for Deep Off-Policy Evaluation
Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, ziyu wang, Alexander Novikov, Sherry Yang, Michael Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Paine
Poster
Wed 9:00 Optimism in Reinforcement Learning with Generalized Linear Function Approximation
Yining Wang, Ruosong Wang, Simon Du, Akshay Krishnamurthy
Wed 10:00 What can AI researchers do to help prevent Lethal Autonomous Weapons?
Oral
Wed 11:15 Learning to Reach Goals via Iterated Supervised Learning
Dibya Ghosh, Abhishek Gupta, Ashwin D Reddy, Justin Fu, Coline M Devin, Ben Eysenbach, Sergey Levine
Spotlight
Wed 13:38 Dynamic Tensor Rematerialization
Marisa Kirisame, Steven S. Lyubomirsky, Altan Haan, Jennifer Brennan, Mike He, Jared G Roesch, Tianqi Chen, Zachary Tatlock
Oral
Wed 16:00 Neural Synthesis of Binaural Speech From Mono Audio
Alexander Richard, Dejan Markovic, Israel Gebru, Steven Krenn, Gladstone A Butler, Fernando Torre, Yaser Sheikh
Spotlight
Wed 16:45 Learning Mesh-Based Simulation with Graph Networks
Tobias Pfaff, Meire Fortunato, Alvaro Sanchez Gonzalez, Peter Battaglia
Poster
Wed 17:00 Measuring Massive Multitask Language Understanding
Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, Jacob Steinhardt
Poster
Wed 17:00 Robust Overfitting may be mitigated by properly learned smoothening
Tianlong Chen, Zhenyu Zhang, Sijia Liu, Shiyu Chang, Zhangyang Wang
Poster
Wed 17:00 A PAC-Bayesian Approach to Generalization Bounds for Graph Neural Networks
Renjie Liao, Raquel Urtasun, Richard Zemel
Poster
Wed 17:00 Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation
Jungo Kasai, Nikolaos Pappas, Hao Peng, James Cross, Noah Smith
Poster
Wed 17:00 AdaSpeech: Adaptive Text to Speech for Custom Voice
Mingjian Chen, Xu Tan, Bohan Li, Eric Liu, Tao Qin, sheng zhao, Tie-Yan Liu
Poster
Wed 17:00 INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving
Yuhuai Wu, Albert Jiang, Jimmy Ba, Roger Grosse
Poster
Wed 17:00 Estimating Lipschitz constants of monotone deep equilibrium models
Chirag Pabbaraju, Ezra Winston, Zico Kolter
Poster
Wed 17:00 Estimating informativeness of samples with Smooth Unique Information
Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini, Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto
Poster
Wed 17:00 On the Critical Role of Conventions in Adaptive Human-AI Collaboration
Andy Shih, Arjun Sawhney, Jovana Kondic, Stefano Ermon, Dorsa Sadigh
Poster
Wed 17:00 Meta Back-Translation
Hieu Pham, Xinyi Wang, Yiming Yang, Graham Neubig
Poster
Wed 17:00 Wandering within a world: Online contextualized few-shot learning
Mengye Ren, Michael L Iuzzolino, Mike Mozer, Richard Zemel
Poster
Wed 17:00 GANs Can Play Lottery Tickets Too
Xuxi Chen, Zhenyu Zhang, Yongduo Sui, Tianlong Chen
Poster
Wed 17:00 In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness
Sang Michael Xie, Ananya Kumar, Robbie Jones, Fereshte Khani, Tengyu Ma, Percy Liang
Poster
Wed 17:00 Efficient Wasserstein Natural Gradients for Reinforcement Learning
Ted Moskovitz, Michael Arbel, Ferenc Huszar, Arthur Gretton
Spotlight
Wed 19:25 Large Scale Image Completion via Co-Modulated Generative Adversarial Networks
Shengyu Zhao, Jonathan Cui, Yilun Sheng, Yue Dong, Xiao Liang, Eric Chang, Yan Xu
Oral
Wed 19:55 Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, Jifeng Dai
Spotlight
Wed 21:25 Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control
Zhuang Liu, Xuanlin Li, Bingyi Kang, trevor darrell
Poster
Thu 1:00 Efficient Generalized Spherical CNNs
Oliver Cobb, Christopher Wallis, Augustine Mavor-Parker, Augustin Marignier, Matthew Price, Mayeul d'Avezac, Jason McEwen
Poster
Thu 1:00 An Unsupervised Deep Learning Approach for Real-World Image Denoising
Dihan Zheng, Sia Huat Tan, Xiaowen Zhang, Zuoqiang Shi, Kaisheng Ma, Chenglong Bao
Poster
Thu 1:00 Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, Jifeng Dai
Poster
Thu 1:00 The inductive bias of ReLU networks on orthogonally separable data
Mary Phuong, Christoph H Lampert
Poster
Thu 1:00 Optimal Conversion of Conventional Artificial Neural Networks to Spiking Neural Networks
Shikuang Deng, Shi Gu
Poster
Thu 1:00 Private Image Reconstruction from System Side Channels Using Generative Models
Yuanyuan Yuan, Shuai Wang, Junping Zhang
Poster
Thu 1:00 Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction
Wonkwang Lee, Whie Jung, Han Zhang, Ting Chen, Jing Yu Koh, Thomas E Huang, Hyungsuk Yoon, Honglak Lee, Seunghoon Hong
Poster
Thu 1:00 IOT: Instance-wise Layer Reordering for Transformer Structures
Jinhua Zhu, Lijun Wu, Yingce Xia, Shufang Xie, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu
Poster
Thu 1:00 Learnable Embedding sizes for Recommender Systems
Siyi Liu, Chen Gao, Yihong Chen, Depeng Jin, Yong Li
Poster
Thu 1:00 Contrastive Learning with Adversarial Perturbations for Conditional Text Generation
Seanie Lee, Dong Bok Lee, Sung Ju Hwang
Spotlight
Thu 3:15 Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic
Deunsol Yoon, Sunghoon Hong, Byung-Jun Lee, Kee-Eung Kim
Spotlight
Thu 4:55 On Self-Supervised Image Representations for GAN Evaluation
Stanislav Morozov, Andrey Voynov, Artem Babenko
Poster
Thu 9:00 Learning to live with Dale's principle: ANNs with separate excitatory and inhibitory units
Jonathan Cornford, Damjan Kalajdzievski, Marco Leite, Amélie Lamarquette, Dimitri Kullmann, Blake A Richards
Poster
Thu 9:00 Robust early-learning: Hindering the memorization of noisy labels
Xiaobo Xia, Tongliang Liu, Bo Han, Chen Gong, Nannan Wang, Zongyuan Ge, Yi Chang
Poster
Thu 9:00 CaPC Learning: Confidential and Private Collaborative Learning
Christopher Choquette-Choo, Natalie Dullerud, Adam Dziedzic, Yunxiang Zhang, Somesh Jha, Nicolas Papernot, Xiao Wang
Poster
Thu 9:00 Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma Augmented Gaussian Processes
Jake Snell, Richard Zemel
Poster
Thu 9:00 Dynamic Tensor Rematerialization
Marisa Kirisame, Steven S. Lyubomirsky, Altan Haan, Jennifer Brennan, Mike He, Jared G Roesch, Tianqi Chen, Zachary Tatlock
Poster
Thu 9:00 Blending MPC & Value Function Approximation for Efficient Reinforcement Learning
Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots
Poster
Thu 9:00 Neural Spatio-Temporal Point Processes
Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel
Poster
Thu 9:00 Directed Acyclic Graph Neural Networks
Veronika Thost, Jie Chen
Oral
Thu 11:45 Why Are Convolutional Nets More Sample-Efficient than Fully-Connected Nets?
Zhiyuan Li, Yi Zhang, Sanjeev Arora
Poster
Thu 17:00 Prototypical Representation Learning for Relation Extraction
Ning Ding, Xiaobin Wang, Yao Fu, Guangwei Xu, Rui Wang, Pengjun Xie, Ying Shen, Fei Huang, Hai-Tao Zheng, Rui Zhang
Poster
Thu 17:00 Learning to Sample with Local and Global Contexts in Experience Replay Buffer
Youngmin Oh, Kimin Lee, Jinwoo Shin, Eunho Yang, Sung Ju Hwang
Poster
Thu 17:00 No MCMC for me: Amortized sampling for fast and stable training of energy-based models
Will Grathwohl, Jacob Kelly, Milad Hashemi, Mohammad Norouzi, Kevin Swersky, David Duvenaud
Poster
Thu 17:00 Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data
Colin Wei, Kendrick Shen, Yining Chen, Tengyu Ma
Poster
Thu 17:00 Large Scale Image Completion via Co-Modulated Generative Adversarial Networks
Shengyu Zhao, Jonathan Cui, Yilun Sheng, Yue Dong, Xiao Liang, Eric Chang, Yan Xu
Poster
Thu 17:00 Self-supervised Representation Learning with Relative Predictive Coding
Yao-Hung Hubert Tsai, Martin Q Ma, Muqiao Yang, Han Zhao, LP Morency, Ruslan Salakhutdinov
Thu 17:00 Lapsed Physicists Wine-and-Cheese (#2)
Poster
Thu 17:00 Molecule Optimization by Explainable Evolution
Binghong Chen, Tianzhe Wang, Chengtao Li, Hanjun Dai, Le Song
Poster
Thu 17:00 Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization
Juntae Lee, Mihir Jain, Hyoungwoo Park, Sungrack Yun
Poster
Thu 17:00 Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
Wenhan Xiong, Lorraine Li, Srini Iyer, Jingfei Du, Patrick Lewis, William Yang Wang, Yashar Mehdad, Scott Yih, Sebastian Riedel, Douwe Kiela, Barlas Oguz
Poster
Thu 17:00 CT-Net: Channel Tensorization Network for Video Classification
Kunchang Li, xianhang li, Yali Wang, Jun Wang, Yu Qiao
Poster
Thu 17:00 Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization
Kaidi Cao, Yining Chen, Junwei Lu, Nikos Arechiga, Adrien Gaidon, Tengyu Ma
Oral
Thu 19:00 Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data
Colin Wei, Kendrick Shen, Yining Chen, Tengyu Ma
Spotlight
Thu 20:15 Random Feature Attention
Hao Peng, Nikolaos Pappas, Dani Yogatama, Roy Schwartz, Noah Smith, Lingpeng Kong
Spotlight
Thu 20:58 Learning a Latent Simplex in Input Sparsity Time
Ainesh Bakshi, Chiranjib Bhattacharyya, Ravi Kannan, David Woodruff, Samson Zhou
Workshop
Fri 2:30 Science and Engineering of Deep Learning
Levent Sagun, Caglar Gulcehre, Adriana Romero, Negar Rostamzadeh, Stefano Sarao Mannelli, Lenka Zdeborova, Samy Bengio
Workshop
Fri 5:00 Geometric and Topological Representation Learning
Guy Wolf, Xiuyuan Cheng, Smita Krishnaswamy, Jure Leskovec, Bastian Rieck, Soledad Villar
Workshop
Fri 5:55 AI for Public Health
Bryan Wilder, Ioana Bica, Marie-Laure Charpignon, Emma Pierson
Workshop
Fri 5:55 The Role of Mathematical Reasoning in General Artificial Intelligence
Yuhuai Wu, Kshitij Bansal, Wenda Li, Melanie Mitchell, David McAllester, John Harrison
Workshop
Fri 6:00 AIMOCC -- AI: Modeling Oceans and Climate Change
Luis Martí, Nayat Sánchez-Pi
Workshop
Fri 6:00 A Roadmap to Never-Ending RL
Feryal Behbahani, Khimya Khetarpal, Louis Kirsch, Rose Wang, Annie Xie, Adam White, Doina Precup
Workshop
Fri 6:00 Invited talk by Aisha Walcott
Aisha Walcott-Bryant
Workshop
Fri 6:30 How Can Findings About The Brain Improve AI Systems?
Shinji Nishimoto, Leila Wehbe, Alexander Huth, Javier Turek, Nicole Beckage, Vy Vo, Mariya Toneva, Hsiang-Yun Chien, Shailee Jain, Richard Antonello
Workshop
Fri 7:00 Workshop on Learning to Learn
Sarah Bechtle, Todor Davchev, Yevgen Chebotar, Timothy Hospedales, Franziska Meier
Workshop
Fri 7:00 2nd Workshop on Practical ML for Developing Countries: Learning Under Limited/low Resource Scenarios
Esube Bekele, Waheeda Saib, Timnit Gebru, Meareg Hailemariam, Vukosi Marivate, Judy Gichoya
Workshop
Fri 7:10 Invited Speaker Dan Roth - Natural Language Understanding with Incidental Supervision
Dan Roth
Workshop
Fri 7:27 Nitesh Chawla, Frank M. Freimann Professor of Computer Science & Engineering and Director of Lucy Family Institute for Data and Society at the University of Notre Dame
Nitesh Chawla
Workshop
Fri 7:45 Coffee break and short paper presentations and discussion.
Hernán Lira, Björn Lütjens, Mark Veillette, Dava Newman, Konstantin Klemmer, Sudipan Saha, Matthias Kahl, Lin Xu, Xiaoxiang Zhu, Hiske Overweg, Ioannis N. Athanasiadis, Nayat Sánchez-Pi, Luis Martí
Workshop
Fri 7:55 ICLR 2021 Workshop on Embodied Multimodal Learning (EML)
Ruohan Gao, Andrew Owens, Dinesh Jayaraman, Yuke Zhu, Jiajun Wu, Kristen Grauman
Workshop
Fri 8:01 "Generative Models for Image Synthesis" by Jan Kautz, NVIDIA
Jan Kautz
Workshop
Fri 8:03 Data Science to fight against COVID-19 by Nuria Oliver
Nuria Oliver
Workshop
Fri 8:25 Invited Speaker Marine Carpuat - Weak Supervision for Cross-Lingual Semantic Analysis
Marine Carpuat
Workshop
Fri 8:40 Biased Client Selection for Improved Convergence of Federated Learning
Gauri Joshi
Workshop
Fri 9:40 Inference Risks for Machine Learning
David Evans
Workshop
Fri 9:51 "Towards Financial Synthetic Data" by Manuela M. Veloso, J.P.Morgan, CMU
Manuela Veloso
Workshop
Fri 10:00 Panel: Values in science and engineering of ML research
Danielle Belgrave, Meredith Broussard, Silvia Chiappa, Jonathan Frankle, Sandra Wachter, Shakir Mohamed, Emily Dinan
Workshop
Fri 10:32 Bharath Hariharan, Assistant Professor of Computer Science at Cornell University
Bharath Hariharan
Workshop
Fri 10:51 "Bias and Generalization of Deep Generative Models" by Stefano Ermon, Stanford University
Stefano Ermon
Workshop
Fri 11:00 Jack Gallant, UC Berkeley: Neuroscience and AI/ML: Examples from studies of navigation and attention
gallant, Alexander Huth
Workshop
Fri 11:20 Invited Speaker Heng Ji - InfoSurgeon: Cross-media Weak Supervision for Knowledge-Element Level Fake News Detection
Heng Ji
Workshop
Fri 11:51 "Generative Modeling for Music Generation" by Sander Dieleman, DeepMind
Sander Dieleman
Workshop
Fri 12:51 "Ethical Considerations of Generative AI" by Emily Denton, Google’s Ethical AI team
Emily Denton
Workshop
Fri 15:25 Invited Speaker Paroma Varma - Snorkel: Programmatically Labeling Training Data
Paroma Varma