Topic Keywords

[ $\ell_1$ norm ] [ $f-$divergence ] [ 3D Convolution ] [ 3D deep learning ] [ 3D generation ] [ 3d point cloud ] [ 3D Reconstruction ] [ 3D scene understanding ] [ 3D shape representations ] [ 3D shapes learning ] [ 3D vision ] [ 3D Vision ] [ abstract reasoning ] [ abstract rules ] [ Acceleration ] [ accuracy ] [ acoustic condition modeling ] [ Action localization ] [ action recognition ] [ activation maximization ] [ activation strategy. ] [ Active learning ] [ Active Learning ] [ AdaBoost ] [ adaptive heavy-ball methods ] [ Adaptive Learning ] [ adaptive methods ] [ adaptive optimization ] [ ADMM ] [ Adversarial Accuracy ] [ Adversarial Attack ] [ Adversarial Attacks ] [ adversarial attacks/defenses ] [ Adversarial computer programs ] [ Adversarial Defense ] [ Adversarial Example Detection ] [ Adversarial Examples ] [ Adversarial Learning ] [ Adversarial Machine Learning ] [ adversarial patch ] [ Adversarial robustness ] [ Adversarial Robustness ] [ Adversarial training ] [ Adversarial Training ] [ Adversarial Transferability ] [ aesthetic assessment ] [ affine parameters ] [ age estimation ] [ Aggregation Methods ] [ AI for earth science ] [ ALFRED ] [ Algorithm ] [ algorithmic fairness ] [ Algorithmic fairness ] [ Algorithms ] [ alignment ] [ alignment of semantic and visual space ] [ amortized inference ] [ Analogies ] [ annotation artifacts ] [ anomaly-detection ] [ Anomaly detection with deep neural networks ] [ anonymous walk ] [ appearance transfer ] [ approximate constrained optimization ] [ approximation ] [ Approximation ] [ Architectures ] [ argoverse ] [ Artificial Integlligence ] [ ASR ] [ assistive technology ] [ associative memory ] [ Associative Memory ] [ asynchronous parallel algorithm ] [ Atari ] [ Attention ] [ Attention Mechanism ] [ Attention Modules ] [ attractors ] [ attributed walks ] [ Auction Theory ] [ audio understanding ] [ Audio-Visual ] [ audio visual learning ] [ audio-visual representation ] [ audio-visual representation learning ] [ Audio-visual sound separation ] [ audiovisual synthesis ] [ augmented deep reinforcement learning ] [ autodiff ] [ Autoencoders ] [ automated data augmentation ] [ automated machine learning ] [ automatic differentiation ] [ AutoML ] [ autonomous learning ] [ autoregressive language model ] [ Autoregressive Models ] [ AutoRL ] [ auxiliary information ] [ auxiliary latent variable ] [ Auxiliary Learning ] [ auxiliary task ] [ Average-case Analysis ] [ aversarial examples ] [ avoid knowledge leaking ] [ backdoor attack ] [ Backdoor Attacks ] [ Backdoor Defense ] [ Backgrounds ] [ backprop ] [ back translation ] [ backward error analysis ] [ bagging ] [ batchnorm ] [ Batch Normalization ] [ batch reinforcement learning ] [ Batch Reinforcement Learning ] [ batch selection ] [ Bayesian ] [ Bayesian classification ] [ Bayesian inference ] [ Bayesian Inference ] [ Bayesian networks ] [ Bayesian Neural Networks ] [ behavior cloning ] [ belief-propagation ] [ Benchmark ] [ benchmarks ] [ benign overfitting ] [ bert ] [ BERT ] [ beta-VAE ] [ better generalization ] [ biased sampling ] [ biases ] [ Bias in Language Models ] [ bidirectional ] [ bilevel optimization ] [ Bilinear games ] [ Binary Embeddings ] [ Binary Neural Networks ] [ binaural audio ] [ binaural speech ] [ biologically plausible ] [ Biometrics ] [ bisimulation ] [ Bisimulation ] [ bisimulation metrics ] [ bit-flip ] [ bit-level sparsity ] [ blind denoising ] [ blind spots ] [ block mdp ] [ boosting ] [ bottleneck ] [ bptt ] [ branch and bound ] [ Brownian motion ] [ Budget-Aware Pruning ] [ Budget constraints ] [ Byzantine resilience ] [ Byzantine SGD ] [ CAD modeling ] [ calibration ] [ Calibration ] [ calibration measure ] [ cancer research ] [ Capsule Networks ] [ Catastrophic forgetting ] [ Catastrophic Forgetting ] [ Causal Inference ] [ Causality ] [ Causal network ] [ certificate ] [ certified defense ] [ Certified Robustness ] [ challenge sets ] [ change of measure ] [ change point detection ] [ channel suppressing ] [ Channel Tensorization ] [ Channel-Wise Approximated Activation ] [ Chaos ] [ chebyshev polynomial ] [ checkpointing ] [ Checkpointing ] [ chemistry ] [ CIFAR ] [ Classification ] [ class imbalance ] [ clean-label ] [ Clustering ] [ Clusters ] [ CNN ] [ CNNs ] [ Code Compilation ] [ Code Representations ] [ Code Structure ] [ code summarization ] [ Code Summarization ] [ Cognitively-inspired Learning ] [ cold posteriors ] [ collaborative learning ] [ Combinatorial optimization ] [ common object counting ] [ commonsense question answering ] [ Commonsense Reasoning ] [ Communication Compression ] [ co-modulation ] [ complete verifiers ] [ complex query answering ] [ Composition ] [ compositional generalization ] [ compositional learning ] [ compositional task ] [ Compressed videos ] [ Compressing Deep Networks ] [ Compression ] [ computation ] [ computational biology ] [ Computational Biology ] [ computational complexity ] [ Computational imaging ] [ Computational neuroscience ] [ Computational resources ] [ computer graphics ] [ Computer Vision ] [ concentration ] [ Concentration of Measure ] [ Concept-based Explanation ] [ concept drift ] [ Concept Learning ] [ conditional expectation ] [ Conditional GANs ] [ Conditional Generation ] [ Conditional generative adversarial networks ] [ conditional layer normalization ] [ Conditional Neural Processes ] [ Conditional Risk Minimization ] [ Conditional Sampling ] [ conditional text generation ] [ Conferrability ] [ confidentiality ] [ conformal inference ] [ conformal prediction ] [ conjugacy ] [ conservation law ] [ consistency ] [ consistency training ] [ Consistency Training ] [ constellation models ] [ constrained beam search ] [ Constrained optimization ] [ constrained RL ] [ constraints ] [ constraint satisfaction ] [ contact tracing ] [ Contextual Bandits ] [ Contextual embedding space ] [ Continual learning ] [ Continual Learning ] [ continuation method ] [ continuous and scalar conditions ] [ continuous case ] [ Continuous Control ] [ continuous convolution ] [ continuous games ] [ continuous normalizing flow ] [ continuous time ] [ Continuous-time System ] [ continuous treatment effect ] [ contrastive divergence ] [ Contrastive learning ] [ Contrastive Learning ] [ Contrastive Methods ] [ contrastive representation learning ] [ control barrier function ] [ controlled generation ] [ Controlled NLG ] [ Convergence ] [ Convergence Analysis ] [ convex duality ] [ Convex optimization ] [ ConvNets ] [ convolutional kernel methods ] [ Convolutional Layer ] [ convolutional models ] [ Convolutional Networks ] [ copositive programming ] [ corruptions ] [ COST ] [ Counterfactual inference ] [ counterfactuals ] [ Counterfactuals ] [ covariant neural networks ] [ covid-19 ] [ COVID-19 ] [ Cross-domain ] [ cross-domain few-shot learning ] [ cross-domain video generation ] [ cross-episode attention ] [ cross-fitting ] [ cross-lingual pretraining ] [ Cryptographic inference ] [ cultural transmission ] [ Curriculum Learning ] [ curse of memory ] [ curvature estimates ] [ custom voice ] [ cycle-consistency regularization ] [ cycle-consistency regularizer ] [ DAG ] [ DARTS stability ] [ Data augmentation ] [ Data Augmentation ] [ data cleansing ] [ Data-driven modeling ] [ data-efficient learning ] [ data-efficient RL ] [ Data Flow ] [ data labeling ] [ data parallelism ] [ Data Poisoning ] [ Data Protection ] [ Dataset ] [ dataset bias ] [ dataset compression ] [ dataset condensation ] [ dataset corruption ] [ dataset distillation ] [ dataset summarization ] [ data structures ] [ debiased training ] [ debugging ] [ Decentralized Optimization ] [ decision boundary geometry ] [ decision trees ] [ declarative knowledge ] [ deep-anomaly-detection ] [ Deep Architectures ] [ Deep denoising priors ] [ deep embedding ] [ Deep Ensembles ] [ deep equilibrium models ] [ Deep Equilibrium Models ] [ Deepfake ] [ deep FBSDEs ] [ Deep Gaussian Processes ] [ Deep generative model ] [ Deep generative modeling ] [ Deep generative models ] [ deeplearning ] [ Deep learning ] [ Deep Learning ] [ deep learning dynamics ] [ Deep Learning Theory ] [ deep network training ] [ deep neural network ] [ deep neural networks. ] [ Deep Neural Networks ] [ deep one-class classification ] [ deep Q-learning ] [ Deep reinforcement learning ] [ Deep Reinforcement Learning ] [ deep ReLU networks ] [ Deep residual neural networks ] [ deep RL ] [ deep sequence model ] [ deepset ] [ Deep Sets ] [ Deformation Modeling ] [ delay ] [ Delay differential equations ] [ denoising score matching ] [ Dense Retrieval ] [ Density estimation ] [ Density Estimation ] [ Density ratio estimation ] [ dependency based method ] [ deployment-efficiency ] [ depression ] [ depth separation ] [ descent ] [ description length ] [ determinantal point processes ] [ Device Placement ] [ dialogue state tracking ] [ differentiable optimization ] [ Differentiable physics ] [ Differentiable Physics ] [ Differentiable program generator ] [ differentiable programming ] [ Differentiable rendering ] [ Differentiable simulation ] [ differential dynamica programming ] [ differential equations ] [ Differential Geometry ] [ differentially private deep learning ] [ Differential Privacy ] [ diffusion probabilistic models ] [ diffusion process ] [ dimension ] [ Directed Acyclic Graphs ] [ Dirichlet form ] [ Discrete Optimization ] [ discretization error ] [ disentangled representation learning ] [ Disentangled representation learning ] [ Disentanglement ] [ distance ] [ Distillation ] [ distinct elements ] [ Distributed ] [ distributed deep learning ] [ distributed inference ] [ Distributed learning ] [ distributed machine learning ] [ Distributed ML ] [ Distributed Optimization ] [ distributional robust optimization ] [ distribution estimation ] [ distribution shift ] [ diverse strategies ] [ diverse video generation ] [ Diversity denoising ] [ Diversity Regularization ] [ DNN ] [ DNN compression ] [ document analysis ] [ document classification ] [ document retrieval ] [ domain adaptation theory ] [ Domain Adaption ] [ Domain Generalization ] [ domain randomization ] [ Domain Translation ] [ double descent ] [ Double Descent ] [ doubly robustness ] [ Doubly-weighted Laplace operator ] [ Dropout ] [ drug discovery ] [ Drug discovery ] [ dst ] [ Dual-mode ASR ] [ Dueling structure ] [ Dynamical Systems ] [ dynamic computation graphs ] [ dynamics ] [ dynamics prediction ] [ dynamic systems ] [ Early classification ] [ Early pruning ] [ early stopping ] [ EBM ] [ Edit ] [ EEG ] [ effective learning rate ] [ Efficiency ] [ Efficient Attention Mechanism ] [ efficient deep learning ] [ Efficient Deep Learning ] [ Efficient Deep Learning Inference ] [ Efficient ensembles ] [ efficient inference ] [ efficient inference methods ] [ Efficient Inference Methods ] [ EfficientNets ] [ efficient network ] [ Efficient Networks ] [ Efficient training ] [ Efficient Training ] [ efficient training and inference. ] [ egocentric ] [ eigendecomposition ] [ Eigenspectrum ] [ ELBO ] [ electroencephalography ] [ EM ] [ Embedding Models ] [ Embedding Size ] [ Embodied Agents ] [ embodied vision ] [ emergent behavior ] [ empirical analysis ] [ Empirical Game Theory ] [ empirical investigation ] [ Empirical Investigation ] [ empirical study ] [ empowerment ] [ Encoder layer fusion ] [ end-to-end entity linking ] [ End-to-End Object Detection ] [ Energy ] [ Energy-Based GANs ] [ energy based model ] [ energy-based model ] [ Energy-based model ] [ energy based models ] [ Energy-based Models ] [ Energy Based Models ] [ Energy-Based Models ] [ Energy Score ] [ ensemble ] [ Ensemble ] [ ensemble learning ] [ ensembles ] [ Ensembles ] [ entity disambiguation ] [ entity linking ] [ entity retrieval ] [ entropic algorithms ] [ Entropy Maximization ] [ Entropy Model ] [ entropy regularization ] [ epidemiology ] [ episode-level pretext task ] [ episodic training ] [ equilibrium ] [ equivariant ] [ equivariant neural network ] [ ERP ] [ Evaluation ] [ evaluation of interpretability ] [ Event localization ] [ evolution ] [ Evolutionary algorithm ] [ Evolutionary Algorithm ] [ Evolutionary Algorithms ] [ Excess risk ] [ experience replay buffer ] [ experimental evaluation ] [ Expert Models ] [ Explainability ] [ explainable ] [ Explainable AI ] [ Explainable Model ] [ explaining decision-making ] [ explanation method ] [ explanations ] [ Explanations ] [ Exploration ] [ Exponential Families ] [ exponential tilting ] [ exposition ] [ external memory ] [ Extrapolation ] [ extremal sector ] [ facial recognition ] [ factor analysis ] [ factored MDP ] [ Factored MDP ] [ fairness ] [ Fairness ] [ faithfulness ] [ fast DNN inference ] [ fast learning rate ] [ fast-mapping ] [ fast weights ] [ FAVOR ] [ Feature Attribution ] [ feature propagation ] [ features ] [ feature visualization ] [ Feature Visualization ] [ Federated learning ] [ Federated Learning ] [ Few Shot ] [ few-shot concept learning ] [ few-shot domain generalization ] [ Few-shot learning ] [ Few Shot Learning ] [ fine-tuning ] [ finetuning ] [ Fine-tuning ] [ Finetuning ] [ fine-tuning stability ] [ Fingerprinting ] [ First-order Methods ] [ first-order optimization ] [ fisher ratio ] [ flat minima ] [ Flexibility ] [ flow graphs ] [ Fluid Dynamics ] [ Follow-the-Regularized-Leader ] [ Formal Verification ] [ forward mode ] [ Fourier Features ] [ Fourier transform ] [ framework ] [ Frobenius norm ] [ from-scratch ] [ frontend ] [ fruit fly ] [ fully-connected ] [ Fully-Connected Networks ] [ future frame generation ] [ future link prediction ] [ fuzzy tiling activation function ] [ Game Decomposition ] [ Game Theory ] [ GAN ] [ GAN compression ] [ GANs ] [ Garbled Circuits ] [ Gaussian Copula ] [ Gaussian Graphical Model ] [ Gaussian Isoperimetric Inequality ] [ Gaussian mixture model ] [ Gaussian process ] [ Gaussian Process ] [ Gaussian Processes ] [ gaussian process priors ] [ GBDT ] [ generalisation ] [ Generalization ] [ Generalization Bounds ] [ generalization error ] [ Generalization Measure ] [ Generalization of Reinforcement Learning ] [ generalized ] [ generalized Girsanov theorem ] [ Generalized PageRank ] [ Generalized zero-shot learning ] [ Generation ] [ Generative Adversarial Network ] [ Generative Adversarial Networks ] [ generative art ] [ Generative Flow ] [ Generative Model ] [ Generative modeling ] [ Generative Modeling ] [ generative modelling ] [ Generative Modelling ] [ Generative models ] [ Generative Models ] [ genetic programming ] [ Geodesic-Aware FC Layer ] [ geometric ] [ Geometric Deep Learning ] [ G-invariance regularization ] [ global ] [ global optima ] [ Global Reference ] [ glue ] [ GNN ] [ GNNs ] [ goal-conditioned reinforcement learning ] [ goal-conditioned RL ] [ goal reaching ] [ gradient ] [ gradient alignment ] [ Gradient Alignment ] [ gradient boosted decision trees ] [ gradient boosting ] [ gradient decomposition ] [ Gradient Descent ] [ gradient descent-ascent ] [ gradient flow ] [ Gradient flow ] [ gradient flows ] [ gradient redundancy ] [ Gradient stability ] [ Grammatical error correction ] [ Granger causality ] [ Graph ] [ graph classification ] [ graph coarsening ] [ Graph Convolutional Network ] [ Graph Convolutional Neural Networks ] [ graph edit distance ] [ Graph Generation ] [ Graph Generative Model ] [ graph-level prediction ] [ graph networks ] [ Graph neural network ] [ Graph Neural Network ] [ Graph neural networks ] [ Graph Neural Networks ] [ Graph pooling ] [ graph representation learning ] [ Graph representation learning ] [ Graph Representation Learning ] [ graph shift operators ] [ graph-structured data ] [ graph structure learning ] [ Greedy Learning ] [ grid cells ] [ grounding ] [ group disparities ] [ group equivariance ] [ Group Equivariance ] [ Group Equivariant Convolution ] [ group equivariant self-attention ] [ group equivariant transformers ] [ group sparsity ] [ Group-supervised learning ] [ gumbel-softmax ] [ Hamiltonian systems ] [ hard-label attack ] [ hard negative mining ] [ hard negative sampling ] [ Hardware-Aware Neural Architecture Search ] [ Harmonic Analysis ] [ harmonic distortion analysis ] [ healthcare ] [ Healthcare ] [ heap allocation ] [ Hessian matrix ] [ Heterogeneity ] [ Heterogeneous ] [ heterogeneous data ] [ Heterogeneous data ] [ Heterophily ] [ heteroscedasticity ] [ heuristic search ] [ hidden-parameter mdp ] [ hierarchical contrastive learning ] [ Hierarchical Imitation Learning ] [ Hierarchical Multi-Agent Learning ] [ Hierarchical Networks ] [ Hierarchical Reinforcement Learning ] [ Hierarchy-Aware Classification ] [ high-dimensional asymptotics ] [ high-dimensional statistic ] [ high-resolution video generation ] [ hindsight relabeling ] [ histogram binning ] [ historical color image classification ] [ HMC ] [ homomorphic encryption ] [ Homophily ] [ Hopfield layer ] [ Hopfield networks ] [ Hopfield Networks ] [ human-AI collaboration ] [ human cognition ] [ human-computer interaction ] [ human preferences ] [ human psychophysics ] [ humans in the loop ] [ hybrid systems ] [ Hyperbolic ] [ hyperbolic deep learning ] [ Hyperbolic Geometry ] [ hypercomplex representation learning ] [ hypergradients ] [ Hypernetworks ] [ hyperparameter ] [ Hyperparameter Optimization ] [ Hyper-Parameter Optimization ] [ HYPERPARAMETER OPTIMIZATION ] [ Image Classification ] [ image completion ] [ Image compression ] [ Image Editing ] [ Image Generation ] [ Image manipulation ] [ Image Modeling ] [ ImageNet ] [ image reconstruction ] [ Image segmentation ] [ Image Synthesis ] [ image-to-action learning ] [ Image-to-Image Translation ] [ image translation ] [ image warping ] [ imbalanced learning ] [ Imitation Learning ] [ Impartial Learning ] [ implicit bias ] [ Implicit Bias ] [ Implicit Deep Learning ] [ implicit differentiation ] [ implicit functions ] [ implicit neural representations ] [ Implicit Neural Representations ] [ Implicit Representation ] [ Importance Weighting ] [ impossibility ] [ incoherence ] [ Incompatible Environments ] [ Incremental Tree Transformations ] [ independent component analysis ] [ indirection ] [ Individual mediation effects ] [ Inductive Bias ] [ inductive biases ] [ inductive representation learning ] [ infinitely wide neural network ] [ Infinite-Width Limit ] [ infinite-width networks ] [ influence functions ] [ Influence Functions ] [ Information bottleneck ] [ Information Bottleneck ] [ Information Geometry ] [ information-theoretical probing ] [ Information theory ] [ Information Theory ] [ Initialization ] [ input-adaptive multi-exit neural networks ] [ input convex neural networks ] [ input-convex neural networks ] [ InstaHide ] [ Instance adaptation ] [ instance-based label noise ] [ Instance learning ] [ Instance-wise Learning ] [ Instrumental Variable Regression ] [ integral probability metric ] [ intention ] [ interaction networks ] [ Interactions ] [ interactive fiction ] [ Internet of Things ] [ Interpolation Peak ] [ Interpretability ] [ interpretable latent representation ] [ Interpretable Machine Learning ] [ interpretable policy learning ] [ in-the-wild data ] [ Intrinsically Motivated Reinforcement Learning ] [ Intrinsic Motivation ] [ intrinsic motivations ] [ Intrinsic Reward ] [ Invariance and Equivariance ] [ invariance penalty ] [ invariances ] [ Invariant and equivariant deep networks ] [ Invariant Representations ] [ invariant risk minimization ] [ Invariant subspaces ] [ inverse graphics ] [ Inverse reinforcement learning ] [ Inverse Reinforcement Learning ] [ Inverted Index ] [ irl ] [ IRM ] [ irregularly spaced time series ] [ irregular-observed data modelling ] [ isometric ] [ Isotropy ] [ iterated learning ] [ iterative training ] [ JEM ] [ Johnson-Lindenstrauss Transforms ] [ kernel ] [ Kernel Learning ] [ kernel method ] [ kernel-ridge regression ] [ kernels ] [ keypoint localization ] [ Knowledge distillation ] [ Knowledge Distillation ] [ Knowledge factorization ] [ Knowledge Graph Reasoning ] [ knowledge uncertainty ] [ Kullback-Leibler divergence ] [ Kurdyka-Łojasiewicz geometry ] [ label noise robustness ] [ Label Representation ] [ Label shift ] [ label smoothing ] [ Langevin dynamics ] [ Langevin sampling ] [ Language Grounding ] [ Language Model ] [ Language modeling ] [ Language Modeling ] [ Language Modelling ] [ Language Model Pre-training ] [ language processing ] [ language-specific modeling ] [ Laplace kernel ] [ Large-scale ] [ Large-scale Deep Learning ] [ large scale learning ] [ Large-scale Machine Learning ] [ large-scale pre-trained language models ] [ large-scale training ] [ large vocabularies ] [ Last-iterate Convergence ] [ Latency-aware Neural Architecture Search ] [ Latent Simplex ] [ latent space of GANs ] [ Latent Variable Models ] [ lattices ] [ Layer order ] [ layerwise sparsity ] [ learnable ] [ learned algorithms ] [ Learned compression ] [ learned ISTA ] [ Learning ] [ learning action representations ] [ learning-based ] [ learning dynamics ] [ Learning Dynamics ] [ Learning in Games ] [ learning mechanisms ] [ Learning physical laws ] [ Learning Theory ] [ Learning to Hash ] [ learning to optimize ] [ Learning to Optimize ] [ learning to rank ] [ Learning to Rank ] [ learning to teach ] [ learning with noisy labels ] [ Learning with noisy labels ] [ library ] [ lifelong ] [ Lifelong learning ] [ Lifelong Learning ] [ lifted inference ] [ likelihood-based models ] [ likelihood-free inference ] [ limitations ] [ limited data ] [ linear bandits ] [ Linear Convergence ] [ linear estimator ] [ Linear Regression ] [ linear terms ] [ linformer ] [ Lipschitz constants ] [ Lipschitz constrained networks ] [ Local Explanations ] [ locality sensitive hashing ] [ Locally supervised training ] [ local Rademacher complexity ] [ log-concavity ] [ Logic ] [ Logic Rules ] [ logsignature ] [ Long-Tailed Recognition ] [ long-tail learning ] [ Long-term dependencies ] [ long-term prediction ] [ long-term stability ] [ loss correction ] [ Loss function search ] [ Loss Function Search ] [ lossless source compression ] [ Lottery Ticket ] [ Lottery Ticket Hypothesis ] [ lottery tickets ] [ low-dimensional structure ] [ lower bound ] [ lower bounds ] [ Low-latency ASR ] [ low precision training ] [ low rank ] [ low-rank approximation ] [ low-rank tensors ] [ L-smoothness ] [ LSTM ] [ Lyapunov Chaos ] [ Machine learning ] [ Machine Learning ] [ machine learning for code ] [ Machine Learning for Robotics ] [ Machine Learning (ML) for Programming Languages (PL)/Software Engineering (SE) ] [ machine learning systems ] [ Machine translation ] [ Machine Translation ] [ magnitude-based pruning ] [ Manifold clustering ] [ Manifolds ] [ Many-task ] [ mapping ] [ Markov chain Monte Carlo ] [ Markov Chain Monte Carlo ] [ Markov jump process ] [ Masked Reconstruction ] [ mathematical reasoning ] [ Matrix and Tensor Factorization ] [ matrix completion ] [ matrix decomposition ] [ Matrix Factorization ] [ max-margin ] [ MCMC ] [ MCMC sampling ] [ mean estimation ] [ mean-field dynamics ] [ mean separation ] [ Mechanism Design ] [ medical time series ] [ mel-filterbanks ] [ memorization ] [ Memorization ] [ Memory ] [ memory efficient ] [ memory efficient training ] [ Memory Mapping ] [ memory optimized training ] [ Memory-saving ] [ mesh ] [ Message Passing ] [ Message Passing GNNs ] [ meta-gradients ] [ Meta-learning ] [ Meta Learning ] [ Meta-Learning ] [ Metric Surrogate ] [ minimax optimal rate ] [ Minimax Optimization ] [ minimax risk ] [ Minmax ] [ min-max optimization ] [ mirror-prox ] [ Missing Data Inference ] [ Missing value imputation ] [ Missing Values ] [ misssing data ] [ mixed precision ] [ Mixed Precision ] [ Mixed-precision quantization ] [ mixture density nets ] [ mixture of experts ] [ mixup ] [ Mixup ] [ MixUp ] [ MLaaS ] [ MoCo ] [ Model Attribution ] [ model-based control ] [ model-based learning ] [ Model-based Reinforcement Learning ] [ Model-Based Reinforcement Learning ] [ model-based RL ] [ Model-based RL ] [ Model Biases ] [ Model compression ] [ model extraction ] [ model fairness ] [ Model Inversion ] [ model order reduction ] [ model ownership ] [ model predictive control ] [ model-predictive control ] [ Model Predictive Control ] [ Model privacy ] [ Models for code ] [ models of learning and generalization ] [ Model stealing ] [ Modern Hopfield Network ] [ modern Hopfield networks ] [ modified equation analysis ] [ modular architectures ] [ Modular network ] [ modular networks ] [ modular neural networks ] [ modular representations ] [ modulated convolution ] [ Molecular conformation generation ] [ molecular design ] [ Molecular Dynamics ] [ molecular graph generation ] [ Molecular Representation ] [ Molecule Design ] [ Momentum ] [ momentum methods ] [ momentum optimizer ] [ monotonicity ] [ Monte Carlo ] [ Monte-Carlo tree search ] [ Monte Carlo Tree Search ] [ morphology ] [ Morse theory ] [ mpc ] [ Multi-agent ] [ Multi-agent games ] [ Multiagent Learning ] [ multi-agent platform ] [ Multi-Agent Policy Gradients ] [ Multi-agent reinforcement learning ] [ Multi-agent Reinforcement Learning ] [ Multi-Agent Reinforcement Learning ] [ Multi-Agent Transfer Learning ] [ multiclass classification ] [ multi-dimensional discrete action spaces ] [ Multi-domain ] [ multi-domain disentanglement ] [ multi-head attention ] [ Multi-Hop ] [ multi-hop question answering ] [ Multi-hop Reasoning ] [ Multilingual Modeling ] [ multilingual representations ] [ multilingual transformer ] [ multilingual translation ] [ Multimodal ] [ Multi-Modal ] [ Multimodal Attention ] [ multi-modal learning ] [ Multimodal Learning ] [ Multi-Modal Learning ] [ Multimodal Spaces ] [ Multi-objective optimization ] [ multi-player ] [ Multiplicative Weights Update ] [ Multi-scale Representation ] [ multitask ] [ Multi-task ] [ Multi-task Learning ] [ Multi Task Learning ] [ Multi-Task Learning ] [ multi-task learning theory ] [ Multitask Reinforcement Learning ] [ Multi-view Learning ] [ Multi-View Learning ] [ Multi-view Representation Learning ] [ Mutual Information ] [ MuZero ] [ Named Entity Recognition ] [ NAS ] [ nash ] [ natural gradient descent ] [ Natural Language Processing ] [ natural scene statistics ] [ natural sparsity ] [ Negative Sampling ] [ negotiation ] [ nested optimization ] [ network architecture ] [ Network Architecture ] [ Network Inductive Bias ] [ network motif ] [ Network pruning ] [ Network Pruning ] [ networks ] [ network trainability ] [ network width ] [ Neural Architecture Search ] [ Neural Attention Distillation ] [ neural collapse ] [ Neural data compression ] [ Neural IR ] [ neural kernels ] [ neural link prediction ] [ Neural Model Explanation ] [ neural module network ] [ Neural Network ] [ Neural Network Bounding ] [ neural network calibration ] [ Neural Network Gaussian Process ] [ neural network robustness ] [ Neural networks ] [ Neural Networks ] [ neural network training ] [ Neural Network Verification ] [ neural ode ] [ Neural ODE ] [ Neural ODEs ] [ Neural operators ] [ Neural Physics Engines ] [ Neural Processes ] [ neural reconstruction ] [ neural sound synthesis ] [ neural spike train ] [ neural symbolic reasoning ] [ neural tangent kernel ] [ Neural tangent kernel ] [ Neural Tangent Kernel ] [ neural tangent kernels ] [ Neural text decoding ] [ neurobiology ] [ Neuroevolution ] [ Neuro symbolic ] [ Neuro-Symbolic Learning ] [ neuro-symbolic models ] [ NLI ] [ NLP ] [ Node Embeddings ] [ noise contrastive estimation ] [ Noise-contrastive learning ] [ Noise model ] [ noise robust learning ] [ Noisy Demonstrations ] [ noisy label ] [ Noisy Label ] [ Noisy Labels ] [ Non-asymptotic Confidence Intervals ] [ non-autoregressive generation ] [ nonconvex ] [ non-convex learning ] [ Non-Convex Optimization ] [ Non-IID ] [ nonlinear control theory ] [ nonlinear dynamical systems ] [ nonlinear Hawkes process ] [ nonlinear walk ] [ Non-Local Modules ] [ non-minimax optimization ] [ nonnegative PCA ] [ nonseparable Hailtonian system ] [ non-smooth models ] [ non-stationary stochastic processes ] [ no-regret learning ] [ normalized maximum likelihood ] [ normalize layer ] [ normalizers ] [ Normalizing Flow ] [ normalizing flows ] [ Normalizing flows ] [ Normalizing Flows ] [ normative models ] [ novelty-detection ] [ ntk ] [ number of linear regions ] [ numerical errors ] [ numerical linear algebra ] [ object-centric representations ] [ Object detection ] [ Object Detection ] [ object-keypoint representations ] [ ObjectNet ] [ Object Permanence ] [ Observational Imitation ] [ ODE ] [ offline ] [ offline/batch reinforcement learning ] [ off-line reinforcement learning ] [ offline reinforcement learning ] [ Offline Reinforcement Learning ] [ offline RL ] [ off-policy evaluation ] [ Off Policy Evaluation ] [ Off-policy policy evaluation ] [ Off-Policy Reinforcement Learning ] [ off-policy RL ] [ one-class-classification ] [ one-to-many mapping ] [ Open-domain ] [ open domain complex question answering ] [ open source ] [ Optimal Control Theory ] [ optimal convergence ] [ optimal power flow ] [ Optimal Transport ] [ optimal transport maps ] [ Optimisation for Deep Learning ] [ optimism ] [ Optimistic Gradient Descent Ascent ] [ Optimistic Mirror Decent ] [ Optimistic Multiplicative Weights Update ] [ Optimization ] [ order learning ] [ ordinary differential equation ] [ orthogonal ] [ orthogonal layers ] [ orthogonal machine learning ] [ Orthogonal Polynomials ] [ Oscillators ] [ outlier detection ] [ outlier-detection ] [ Outlier detection ] [ out-of-distribution ] [ Out-of-distribution detection in deep learning ] [ out-of-distribution generalization ] [ Out-of-domain ] [ over-fitting ] [ Overfitting ] [ overparameterisation ] [ over-parameterization ] [ Over-parameterization ] [ Overparameterization ] [ overparameterized neural networks ] [ Over-smoothing ] [ Oversmoothing ] [ over-squashing ] [ PAC Bayes ] [ padding ] [ parallel Monte Carlo Tree Search (MCTS) ] [ parallel tempering ] [ Parameter-Reduced MLR ] [ part-based ] [ Partial Amortization ] [ Partial differential equation ] [ partial differential equations ] [ partially observed environments ] [ particle inference ] [ pca ] [ pde ] [ pdes ] [ PDEs ] [ performer ] [ persistence diagrams ] [ personalized learning ] [ perturbation sets ] [ Peter-Weyl Theorem ] [ phase retrieval ] [ Physical parameter estimation ] [ physical reasoning ] [ physical scene understanding ] [ Physical Simulation ] [ physical symbol grounding ] [ physics ] [ physics-guided deep learning ] [ piecewise linear function ] [ pipeline toolkit ] [ plan-based reward shaping ] [ Planning ] [ Poincaré Ball Model ] [ Point cloud ] [ Point clouds ] [ point processes ] [ pointwise mutual information ] [ poisoning ] [ poisoning attack ] [ poisson matrix factorization ] [ policy learning ] [ Policy Optimization ] [ polynomial time ] [ Pose Estimation ] [ Position Embedding ] [ Position Encoding ] [ post-hoc calibration ] [ Post-Hoc Correction ] [ Post Training Quantization ] [ power grid management ] [ Predictive Modeling ] [ predictive uncertainty ] [ Predictive Uncertainty Estimation ] [ pretrained language model ] [ pretrained language model. ] [ pre-trained language model fine-tuning ] [ Pretrained Language Models ] [ Pretrained Text Encoders ] [ pre-training ] [ Pre-training ] [ Primitive Discovery ] [ principal components analysis ] [ Privacy ] [ privacy leakage from gradients ] [ privacy preserving machine learning ] [ Privacy-utility tradeoff ] [ probabelistic models ] [ probabilistic generative models ] [ probabilistic inference ] [ probabilistic matrix factorization ] [ Probabilistic Methods ] [ probabilistic multivariate forecasting ] [ probabilistic numerics ] [ probabilistic programs ] [ probably approximated correct guarantee ] [ Probe ] [ probing ] [ procedural generation ] [ procedural knowledge ] [ product of experts ] [ Product Quantization ] [ Program obfuscation ] [ Program Synthesis ] [ Proper Scoring Rules ] [ protein ] [ prototype propagation ] [ Provable Robustness ] [ provable sample efficiency ] [ proximal gradient descent-ascent ] [ proxy ] [ Pruning ] [ Pruning at initialization ] [ pseudo-labeling ] [ Pseudo-Labeling ] [ QA ] [ Q-learning ] [ Quantization ] [ quantum machine learning ] [ quantum mechanics ] [ Quantum Mechanics ] [ Question Answering ] [ random ] [ Random Feature ] [ Random Features ] [ Randomized Algorithms ] [ Random Matrix Theory ] [ Random Weights Neural Networks ] [ rank-collapse ] [ rank-constrained convex optimization ] [ rao ] [ rao-blackwell ] [ Rate-distortion optimization ] [ raven's progressive matrices ] [ real time recurrent learning ] [ real-world ] [ Real-world image denoising ] [ reasoning paths ] [ recommendation systems ] [ recommender system ] [ Recommender Systems ] [ recovery likelihood ] [ rectified linear unit ] [ Recurrent Generative Model ] [ Recurrent Neural Network ] [ Recurrent neural networks ] [ Recurrent Neural Networks ] [ recursive dense retrieval ] [ reformer ] [ regime agnostic methods ] [ Regression ] [ Regression without correspondence ] [ regret analysis ] [ regret minimization ] [ Regularization ] [ Regularization by denoising ] [ regularized markov decision processes ] [ Reinforcement ] [ Reinforcement learning ] [ Reinforcement Learning ] [ Reinforcement Learnings ] [ Reinforcement learning theory ] [ relabelling ] [ Relational regularized autoencoder ] [ Relation Extraction ] [ relaxed regularization ] [ relu network ] [ ReLU networks ] [ Rematerialization ] [ Render-and-Compare ] [ Reparameterization ] [ repetitions ] [ replica exchange ] [ representational learning ] [ representation analysis ] [ Representation learning ] [ Representation Learning ] [ representation learning for computer vision ] [ representation learning for robotics ] [ representation of dynamical systems ] [ Representation Theory ] [ reproducibility ] [ reproducible research ] [ Reproducing kernel Hilbert space ] [ resampling ] [ reset-free ] [ residual ] [ ResNets ] [ resource constrained ] [ Restricted Boltzmann Machines ] [ retraining ] [ Retrieval ] [ reverse accuracy ] [ reverse engineering ] [ reward learning ] [ reward randomization ] [ reward shaping ] [ reweighting ] [ Rich observation ] [ rich observations ] [ risk-averse ] [ Risk bound ] [ Risk Estimation ] [ risk sensitive ] [ rl ] [ RMSprop ] [ RNA-protein interaction prediction ] [ RNA structure ] [ RNA structure embedding ] [ RNN ] [ RNNs ] [ robotic manipulation ] [ robust ] [ robust control ] [ robust deep learning ] [ Robust Deep Learning ] [ robust learning ] [ Robust Learning ] [ Robust Machine Learning ] [ Robustness ] [ Robustness certificates ] [ Robust Overfitting ] [ ROC ] [ Role-Based Learning ] [ rooted graphs ] [ Rotation invariance ] [ rtrl ] [ Runtime Systems ] [ Saddle-point Optimization ] [ safe ] [ Safe exploration ] [ safe planning ] [ Saliency ] [ Saliency Guided Data Augmentation ] [ saliency maps ] [ SaliencyMix ] [ sample complexity separation ] [ Sample Efficiency ] [ sample information ] [ sample reweighting ] [ Sampling ] [ sampling algorithms ] [ Scalability ] [ Scale ] [ scale-invariant weights ] [ Scale of initialization ] [ scene decomposition ] [ scene generation ] [ Scene Understanding ] [ Science ] [ science of deep learning ] [ score-based generative models ] [ score matching ] [ score-matching ] [ SDE ] [ Second-order analysis ] [ second-order approximation ] [ second-order optimization ] [ Security ] [ segmented models ] [ selective classification ] [ Self-Imitation ] [ self supervised learning ] [ Self-supervised learning ] [ Self-supervised Learning ] [ Self Supervised Learning ] [ Self-Supervised Learning ] [ self-supervision ] [ self-training ] [ self-training theory ] [ semantic anomaly detection ] [ semantic directions in latent space ] [ semantic graphs ] [ Semantic Image Synthesis ] [ semantic parsing ] [ semantic role labeling ] [ semantic-segmentation ] [ Semantic Segmentation ] [ Semantic Textual Similarity ] [ semi-infinite duality ] [ semi-nonnegative matrix factorization ] [ semiparametric inference ] [ semi-supervised ] [ Semi-supervised Learning ] [ Semi-Supervised Learning ] [ semi-supervised learning theory ] [ Sentence Embeddings ] [ Sentence Representations ] [ Sentiment ] [ separation of variables ] [ Sequence Data ] [ Sequence Modeling ] [ sequence models ] [ Sequence-to-sequence learning ] [ sequence-to-sequence models ] [ sequential data ] [ Sequential probability ratio test ] [ Sequential Representation Learning ] [ set prediction ] [ set transformer ] [ SGD ] [ SGD noise ] [ sgld ] [ Shape ] [ shape bias ] [ Shape Bias ] [ Shape Encoding ] [ shapes ] [ Shapley values ] [ Sharpness Minimization ] [ side channel analysis ] [ Sigma Delta Quantization ] [ sign agnostic learning ] [ signal propagation ] [ signature ] [ sim2real ] [ sim2real transfer ] [ simple ] [ Singularity analysis ] [ singular value decomposition ] [ Sinkhorn algorithm ] [ skeleton-based action recognition ] [ sketch-based modeling ] [ sketches ] [ Skill Discovery ] [ SLAM ] [ sliced fused Gromov Wasserstein ] [ Sliced Wasserstein ] [ Slowdown attacks ] [ slowness ] [ Smooth games ] [ smoothing ] [ SMT Solvers ] [ social perception ] [ Soft Body ] [ soft labels ] [ software ] [ sound classification ] [ sound spatialization ] [ Source Code ] [ sparse Bayesian learning ] [ Sparse Embedding ] [ sparse embeddings ] [ sparse reconstruction ] [ sparse representation ] [ sparse representations ] [ sparse stochastic gates ] [ Sparsity ] [ Sparsity Learning ] [ spatial awareness ] [ spatial bias ] [ spatial uncertainty ] [ spatio-temporal forecasting ] [ spatio-temporal graph ] [ spatio-temporal modeling ] [ spatio-temporal modelling ] [ spatiotemporal prediction ] [ Spatiotemporal Understanding ] [ Spectral Analysis ] [ Spectral Distribution ] [ Spectral Graph Filter ] [ spectral regularization ] [ speech generation ] [ speech-impaired ] [ speech processing ] [ speech recognition. ] [ Speech Recognition ] [ spherical distributions ] [ spiking neural network ] [ spurious correlations ] [ square loss vs cross-entropy ] [ stability theory ] [ State abstraction ] [ state abstractions ] [ state-space models ] [ statistical learning theory ] [ Statistical Learning Theory ] [ statistical physics ] [ Statistical Physics ] [ statistical physics methods ] [ Steerable Kernel ] [ Stepsize optimization ] [ stochastic asymptotics ] [ stochastic control ] [ (stochastic) gradient descent ] [ Stochastic Gradient Descent ] [ stochastic gradient Langevin dynamics ] [ stochastic process ] [ Stochastic Processes ] [ stochastic subgradient method ] [ Storage Capacity ] [ straight-through ] [ straightthrough ] [ strategic behavior ] [ Streaming ASR ] [ structural biology ] [ structural credit assignment ] [ structural inductive bias ] [ Structured Pruning ] [ Structure learning ] [ structure prediction ] [ structures prediction ] [ Style Mixing ] [ Style Transfer ] [ subgraph reasoning. ] [ sublinear ] [ submodular optimization ] [ Subspace clustering ] [ Summarization ] [ summary statistics ] [ superpixel ] [ supervised contrastive learning ] [ Supervised Deep Networks ] [ Supervised Learning ] [ support estimation ] [ surprisal ] [ surrogate models ] [ svd ] [ SVD ] [ Symbolic Methods ] [ symbolic regression ] [ symbolic representations ] [ Symmetry ] [ symplectic networks ] [ Syntax ] [ Synthetic benchmark dataset ] [ synthetic-to-real generalization ] [ Systematic generalisation ] [ Systematicity ] [ System identification ] [ Tabular ] [ tabular data ] [ Tabular Data ] [ targeted attack ] [ Task Embeddings ] [ task generation ] [ task-oriented dialogue ] [ Task-oriented Dialogue System ] [ task reduction ] [ Task Segmentation ] [ Teacher-Student Learning ] [ teacher-student model ] [ temporal context ] [ Temporal knowledge graph ] [ temporal networks ] [ tensor product ] [ Text-based Games ] [ Text Representation ] [ Text Retrieval ] [ Text to speech ] [ Text to speech synthesis ] [ text-to-sql ] [ Texture ] [ Texture Bias ] [ Textworld ] [ Theorem proving ] [ theoretical issues in deep learning ] [ theoretical limits ] [ theoretical study ] [ Theory ] [ Theory of deep learning ] [ theory of mind ] [ Third-Person Imitation ] [ Thompson sampling ] [ time-frequency representations ] [ timescale ] [ timescales ] [ Time Series ] [ Time series forecasting ] [ time series prediction ] [ topic modelling ] [ Topology ] [ training dynamics ] [ Training Method ] [ trajectory ] [ trajectory optimization ] [ trajectory prediction ] [ Transferability ] [ Transfer learning ] [ Transfer Learning ] [ transformation invariance ] [ Transformer ] [ Transformers ] [ traveling salesperson problem ] [ Tree-structured Data ] [ trembl ] [ tropical function ] [ trust region ] [ two-layer neural network ] [ Uncertainty ] [ uncertainty calibration ] [ Uncertainty estimates ] [ Uncertainty estimation ] [ Uncertainty Machine Learning ] [ understanding ] [ understanding CNNs ] [ Understanding Data Augmentation ] [ understanding decision-making ] [ understanding deep learning ] [ Understanding Deep Learning ] [ understanding neural networks ] [ U-Net ] [ unidirectional ] [ uniprot ] [ universal approximation ] [ Universal approximation ] [ Universality ] [ universal representation learning ] [ universal sound separation ] [ unlabeled data ] [ Unlabeled Entity Problem ] [ Unlearnable Examples ] [ unrolled algorithms ] [ Unsupervised denoising ] [ Unsupervised Domain Translation ] [ unsupervised image denoising ] [ Unsupervised learning ] [ Unsupervised Learning ] [ unsupervised learning theory ] [ unsupervised loss ] [ Unsupervised Meta-learning ] [ unsupervised object discovery ] [ Unsupervised reinforcement learning ] [ unsupervised skill discovery ] [ unsupervised stabilization ] [ Upper Confidence bound applied to Trees (UCT) ] [ Usable Information ] [ VAE ] [ Value factorization ] [ value learning ] [ vanishing gradient problem ] [ variable binding ] [ variable convergence ] [ Variable Embeddings ] [ Variance Networks ] [ Variational Auto-encoder ] [ Variational autoencoders ] [ Variational Autoencoders ] [ Variational inference ] [ variational information bottleneck ] [ Verification ] [ video analysis ] [ Video Classification ] [ Video Compression ] [ video generation ] [ video-grounded dialogues ] [ Video prediction ] [ Video Reasoning ] [ video recognition ] [ Video Recognition ] [ video representation learning ] [ video synthesis ] [ video-text learning ] [ views ] [ virtual environment ] [ vision-and-language-navigation ] [ visual counting ] [ visualization ] [ visual perception ] [ Visual Reasoning ] [ visual reinforcement learning ] [ visual representation learning ] [ visual saliency ] [ vocoder ] [ voice conversion ] [ Volume Analysis ] [ VQA ] [ vulnerability of RL ] [ wanet ] [ warping functions ] [ Wasserstein ] [ wasserstein-2 barycenters ] [ wasserstein-2 distance ] [ Wasserstein distance ] [ waveform generation ] [ weakly-supervised learning ] [ weakly supervised representation learning ] [ Weak supervision ] [ Weak-supervision ] [ webly-supervised learning ] [ weight attack ] [ weight balance ] [ Weight quantization ] [ weight-sharing ] [ wide local minima ] [ Wigner-Eckart Theorem ] [ winning tickets ] [ wireframe model ] [ word-learning ] [ world models ] [ World Models ] [ worst-case generalisation ] [ xai ] [ XAI ] [ zero-order optimization ] [ zero-shot learning ] [ Zero-shot learning ] [ Zero-shot Learning ] [ Zero-shot synthesis ]

204 Results

Poster
Mon 1:00 Randomized Ensembled Double Q-Learning: Learning Fast Without a Model
Xinyue Chen, Che Wang, Zijian Zhou, Keith Ross
Poster
Mon 1:00 Isometric Transformation Invariant and Equivariant Graph Convolutional Networks
Masanobu Horie, Naoki Morita, Toshiaki Hishinuma, Yu Ihara, Naoto Mitsume
Poster
Mon 1:00 Batch Reinforcement Learning Through Continuation Method
Yijie Guo, Shengyu Feng, Nicolas Le Roux, Ed H. Chi, Honglak Lee, Minmin Chen
Poster
Mon 1:00 On the Transfer of Disentangled Representations in Realistic Settings
Andrea Dittadi, Frederik Träuble, Francesco Locatello, Manuel Wuthrich, Vaibhav Agrawal, Ole Winther, Stefan Bauer, Bernhard Schoelkopf
Poster
Mon 1:00 Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets
Hayeon Lee, Eunyoung Hyung, Sung Ju Hwang
Poster
Mon 1:00 Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs
Cheng Wang, Carolin Lawrence, Mathias Niepert
Poster
Mon 1:00 Signatory: differentiable computations of the signature and logsignature transforms, on both CPU and GPU
Patrick Kidger, Terry Lyons
Poster
Mon 1:00 Trusted Multi-View Classification
Zongbo Han, Changqing Zhang, Huazhu FU, Joey T Zhou
Poster
Mon 1:00 MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space
Tsz Him Cheung, Dit-Yan Yeung
Poster
Mon 1:00 Wasserstein Embedding for Graph Learning
Soheil Kolouri, Navid Naderializadeh, Gustavo K Rohde, Heiko Hoffmann
Oral
Mon 3:00 Dataset Condensation with Gradient Matching
Bo ZHAO, Konda Reddy Mopuri, Hakan Bilen
Oral
Mon 5:00 Geometry-aware Instance-reweighted Adversarial Training
Jingfeng Zhang, Jianing ZHU, Gang Niu, Bo Han, Masashi Sugiyama, Mohan Kankanhalli
Mon 6:00 WiML@ICLR 2021 Virtual Panel
Invited Talk
Mon 8:00 Moving beyond the fairness rhetoric in machine learning
Timnit Gebru
Poster
Mon 9:00 Understanding the failure modes of out-of-distribution generalization
Vaishnavh Nagarajan, Anders J Andreassen, Behnam Neyshabur
Poster
Mon 9:00 LambdaNetworks: Modeling long-range Interactions without Attention
Irwan Bello
Poster
Mon 9:00 Multi-Level Local SGD: Distributed SGD for Heterogeneous Hierarchical Networks
Timothy Castiglia, Anirban Das, Stacy Patterson
Poster
Mon 9:00 Fast convergence of stochastic subgradient method under interpolation
Huang Fang, Zhenan Fan, Michael Friedlander
Poster
Mon 9:00 Shapley Explanation Networks
Rui Wang, Xiaoqian Wang, David Inouye
Poster
Mon 9:00 Learning Hyperbolic Representations of Topological Features
Panagiotis Kyriakis, Iordanis Fostiropoulos, Paul Bogdan
Poster
Mon 9:00 Single-Photon Image Classification
Thomas Fischbacher, Luciano Sbaiz
Poster
Mon 9:00 The Traveling Observer Model: Multi-task Learning Through Spatial Variable Embeddings
Elliot Meyerson, Risto Miikkulainen
Poster
Mon 9:00 Language-Agnostic Representation Learning of Source Code from Structure and Context
Daniel Zügner, Tobias Kirschstein, Michele Catasta, Jure Leskovec, Stephan Günnemann
Mon 9:00 Philosophy and AGI (#1)
Poster
Mon 9:00 Adaptive Federated Optimization
Sashank Reddi, Zachary Charles, Manzil Zaheer, Zachary Garrett, Keith Rush, Jakub Konečný, Sanjiv Kumar, Brendan McMahan
Spotlight
Mon 11:45 Geometry-Aware Gradient Algorithms for Neural Architecture Search
Liam Li, Misha Khodak, Nina Balcan, Ameet Talwalkar
Poster
Mon 17:00 Unlearnable Examples: Making Personal Data Unexploitable
Hanxun Huang, Daniel Ma, Sarah Erfani, James Bailey, Yisen Wang
Poster
Mon 17:00 SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness
Mikhail Yurochkin, Yuekai Sun
Poster
Mon 17:00 Parrot: Data-Driven Behavioral Priors for Reinforcement Learning
Avi Singh, Huihan Liu, Gaoyue Zhou, Albert Yu, Nicholas Rhinehart, Sergey Levine
Poster
Mon 17:00 Deep Partition Aggregation: Provable Defenses against General Poisoning Attacks
Alexander Levine, Soheil Feizi
Poster
Mon 17:00 Model Patching: Closing the Subgroup Performance Gap with Data Augmentation
Karan Goel, Albert Gu, Yixuan Li, Christopher Re
Poster
Mon 17:00 PseudoSeg: Designing Pseudo Labels for Semantic Segmentation
Yuliang Zou, Zizhao Zhang, Han Zhang, Chun-Liang Li, Xiao Bian, Jia-Bin Huang, Tomas Pfister
Poster
Mon 17:00 Explaining the Efficacy of Counterfactually Augmented Data
Divyansh Kaushik, Amrith Setlur, Eduard H Hovy, Zachary Lipton
Poster
Mon 17:00 Undistillable: Making A Nasty Teacher That CANNOT teach students
Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Chenyu You, Xiaohui Xie, Zhangyang Wang
Poster
Mon 17:00 On Dyadic Fairness: Exploring and Mitigating Bias in Graph Connections
Peizhao Li, Yifei Wang, Han Zhao, Pengyu Hong, Hongfu Liu
Poster
Mon 17:00 Why resampling outperforms reweighting for correcting sampling bias with stochastic gradients
Jing An, Lexing Ying, Yuhua Zhu
Poster
Mon 17:00 MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training
Beidi Chen, Zichang Liu, Binghui Peng, Zhaozhuo Xu, Jonathan L Li, Tri Dao, Zhao Song, Anshumali Shrivastava, Christopher Re
Oral
Mon 19:30 Parrot: Data-Driven Behavioral Priors for Reinforcement Learning
Avi Singh, Huihan Liu, Gaoyue Zhou, Albert Yu, Nicholas Rhinehart, Sergey Levine
Spotlight
Mon 20:38 Information Laundering for Model Privacy
Xinran Wang, Yu Xiang, Jun Gao, Jie Ding
Spotlight
Mon 20:48 Dataset Inference: Ownership Resolution in Machine Learning
Pratyush Maini, Mohammad Yaghini, Nicolas Papernot
Spotlight
Mon 21:46 The Traveling Observer Model: Multi-task Learning Through Spatial Variable Embeddings
Elliot Meyerson, Risto Miikkulainen
Poster
Tue 1:00 Conformation-Guided Molecular Representation with Hamiltonian Neural Networks
Ziyao Li, Shuwen Yang, Guojie Song, Lingsheng Cai
Poster
Tue 1:00 Calibration tests beyond classification
David Widmann, Fredrik Lindsten, Dave Zachariah
Poster
Tue 1:00 PDE-Driven Spatiotemporal Disentanglement
Jérémie DONA, Jean-Yves Franceschi, sylvain lamprier, patrick gallinari
Poster
Tue 1:00 Generalized Multimodal ELBO
Thomas Sutter, Imant Daunhawer, Julia E Vogt
Poster
Tue 1:00 Bayesian Context Aggregation for Neural Processes
Michael Volpp, Fabian Flürenbrock, Lukas Grossberger, Christian Daniel, Gerhard Neumann
Poster
Tue 1:00 Learning the Pareto Front with Hypernetworks
Aviv Navon, Aviv Shamsian, Ethan Fetaya, Gal Chechik
Poster
Tue 1:00 Hyperbolic Neural Networks++
Ryohei Shimizu, YUSUKE Mukuta, Tatsuya Harada
Oral
Tue 4:23 Scalable Learning and MAP Inference for Nonsymmetric Determinantal Point Processes
Mike Gartrell, Insu Han, Elvis Dohmatob, Jennifer Gillenwater, Victor-Emmanuel Brunel
Tue 6:00 Lapsed Physicists Wine-and-Cheese (#1)
Poster
Tue 9:00 Direction Matters: On the Implicit Bias of Stochastic Gradient Descent with Moderate Learning Rate
Jingfeng Wu, Difan Zou, vladimir braverman, Quanquan Gu
Poster
Tue 9:00 Robust Pruning at Initialization
Soufiane Hayou, Jean-Francois Ton, Arnaud Doucet, Yee Whye Teh
Poster
Tue 9:00 Statistical inference for individual fairness
Subha Maity, Songkai Xue, Mikhail Yurochkin, Yuekai Sun
Poster
Tue 9:00 Getting a CLUE: A Method for Explaining Uncertainty Estimates
Javier Antorán, Umang Bhatt, Tameem Adel, Adrian Weller, José Miguel Hernández Lobato
Poster
Tue 9:00 Learning Parametrised Graph Shift Operators
George Dasoulas, Johannes Lutzeyer, Michalis Vazirgiannis
Poster
Tue 9:00 Sharper Generalization Bounds for Learning with Gradient-dominated Objective Functions
Yunwen Lei, Yiming Ying
Poster
Tue 9:00 FairBatch: Batch Selection for Model Fairness
Yuji Roh, Kangwook Lee, Steven Whang, Changho Suh
Poster
Tue 9:00 Learning from Protein Structure with Geometric Vector Perceptrons
Bowen Jing, Stephan Eismann, Patricia Suriana, Raphael J Townshend, Ron Dror
Poster
Tue 9:00 Large Associative Memory Problem in Neurobiology and Machine Learning
Dmitry Krotov, John J Hopfield
Poster
Tue 9:00 On the mapping between Hopfield networks and Restricted Boltzmann Machines
Matthew Smart, Anton Zilman
Poster
Tue 9:00 Learning a Latent Search Space for Routing Problems using Variational Autoencoders
André Hottung, Bhanu Bhandari, Kevin Tierney
Oral
Tue 11:00 Iterated learning for emergent systematicity in VQA
Ankit Vani, Max Schwarzer, Yuchen Lu, Eeshan Dhekane, Aaron Courville
Oral
Tue 12:00 Randomized Automatic Differentiation
Deniz Oktay, Nick McGreivy, Joshua Aduol, Alex Beatson, Ryan P Adams
Spotlight
Tue 12:50 Learning from Protein Structure with Geometric Vector Perceptrons
Bowen Jing, Stephan Eismann, Patricia Suriana, Raphael J Townshend, Ron Dror
Oral
Tue 13:13 On the mapping between Hopfield networks and Restricted Boltzmann Machines
Matthew Smart, Anton Zilman
Expo Talk Panel
Tue 14:00 Interpretability with skeptical and user-centric mind
Been Kim
Poster
Tue 17:00 Dataset Inference: Ownership Resolution in Machine Learning
Pratyush Maini, Mohammad Yaghini, Nicolas Papernot
Poster
Tue 17:00 Information Laundering for Model Privacy
Xinran Wang, Yu Xiang, Jun Gao, Jie Ding
Poster
Tue 17:00 Achieving Linear Speedup with Partial Worker Participation in Non-IID Federated Learning
Haibo Yang, Minghong Fang, Jia Liu
Poster
Tue 17:00 Monotonic Kronecker-Factored Lattice
William Bakst, Nobuyuki Morioka, Erez Louidor
Poster
Tue 17:00 Knowledge Distillation as Semiparametric Inference
Tri Dao, Govinda Kamath, Vasilis Syrgkanis, Lester Mackey
Poster
Tue 17:00 Generating Adversarial Computer Programs using Optimized Obfuscations
Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shiyu Chang, Quanfu Fan, Gaoyuan Zhang, Una-May O'Reilly
Poster
Tue 17:00 Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees?
Zhen Qin, Le Yan, Honglei Zhuang, Yi Tay, Rama Kumar Pasumarthi, Xuanhui Wang, Michael Bendersky, Marc Najork
Poster
Tue 17:00 RMSprop converges with proper hyper-parameter
Naichen Shi, Dawei Li, Mingyi Hong, Ruoyu Sun
Oral
Tue 19:00 Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
Brenden Petersen, Mikel Landajuela Larma, Terrell N Mundhenk, Claudio Santiago, Soo Kim, Joanne Kim
Spotlight
Tue 20:30 Individually Fair Gradient Boosting
Alexander Vargo, Fan Zhang, Mikhail Yurochkin, Yuekai Sun
Spotlight
Tue 20:40 Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees?
Zhen Qin, Le Yan, Honglei Zhuang, Yi Tay, Rama Kumar Pasumarthi, Xuanhui Wang, Michael Bendersky, Marc Najork
Oral
Tue 21:18 MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training
Beidi Chen, Zichang Liu, Binghui Peng, Zhaozhuo Xu, Jonathan L Li, Tri Dao, Zhao Song, Anshumali Shrivastava, Christopher Re
Invited Talk
Wed 0:00 Perceiving the 3D World from Images and Video
Lourdes Agapito
Poster
Wed 1:00 Dance Revolution: Long-Term Dance Generation with Music via Curriculum Learning
Ruozi Huang, Huang Hu, Wei Wu, Kei Sawada, Mi Zhang, Daxin Jiang
Poster
Wed 1:00 New Bounds For Distributed Mean Estimation and Variance Reduction
Peter Davies, Vijaykrishna Gurunathan, Niusha Moshrefi, Saleh Ashkboos, Dan Alistarh
Poster
Wed 1:00 Deep Neural Network Fingerprinting by Conferrable Adversarial Examples
Nils Lukas, Yuxuan Zhang, Florian Kerschbaum
Poster
Wed 1:00 Byzantine-Resilient Non-Convex Stochastic Gradient Descent
Zeyuan Allen-Zhu, Faeze Ebrahimianghazani, Jerry Li, Dan Alistarh
Poster
Wed 1:00 Geometry-aware Instance-reweighted Adversarial Training
Jingfeng Zhang, Jianing ZHU, Gang Niu, Bo Han, Masashi Sugiyama, Mohan Kankanhalli
Poster
Wed 1:00 A Better Alternative to Error Feedback for Communication-Efficient Distributed Learning
Samuel Horváth, Peter Richtarik
Oral
Wed 4:05 Getting a CLUE: A Method for Explaining Uncertainty Estimates
Javier Antorán, Umang Bhatt, Tameem Adel, Adrian Weller, José Miguel Hernández Lobato
Spotlight
Wed 4:20 Influence Estimation for Generative Adversarial Networks
Naoyuki Terashita, Hiroki Ohashi, Yuichi Nonaka, Takashi Kanemaru
Spotlight
Wed 4:40 Deep Neural Network Fingerprinting by Conferrable Adversarial Examples
Nils Lukas, Yuxuan Zhang, Florian Kerschbaum
Invited Talk
Wed 8:00 Is My Dataset Biased?
Kate Saenko
Poster
Wed 9:00 Variational State-Space Models for Localisation and Dense 3D Mapping in 6 DoF
Atanas Mirchev, Baris Kayalibay, Patrick van der Smagt, Justin Bayer
Poster
Wed 9:00 Iterated learning for emergent systematicity in VQA
Ankit Vani, Max Schwarzer, Yuchen Lu, Eeshan Dhekane, Aaron Courville
Poster
Wed 9:00 Boost then Convolve: Gradient Boosting Meets Graph Neural Networks
Sergei Ivanov, Liudmila Prokhorenkova
Poster
Wed 9:00 IsarStep: a Benchmark for High-level Mathematical Reasoning
Wenda Li, Lei Yu, Yuhuai Wu, Lawrence Paulson
Poster
Wed 9:00 Geometry-Aware Gradient Algorithms for Neural Architecture Search
Liam Li, Misha Khodak, Nina Balcan, Ameet Talwalkar
Poster
Wed 9:00 More or Less: When and How to Build Convolutional Neural Network Ensembles
Abdul Wasay, Stratos Idreos
Poster
Wed 9:00 Modeling the Second Player in Distributionally Robust Optimization
Paul Michel, Tatsunori Hashimoto, Graham Neubig
Poster
Wed 9:00 Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
Brenden Petersen, Mikel Landajuela Larma, Terrell N Mundhenk, Claudio Santiago, Soo Kim, Joanne Kim
Poster
Wed 9:00 Few-Shot Bayesian Optimization with Deep Kernel Surrogates
Martin Wistuba, Josif Grabocka
Poster
Wed 9:00 Theoretical bounds on estimation error for meta-learning
James Lucas, Mengye Ren, Irene Raissa KAMENI KAMENI, Toniann Pitassi, Richard Zemel
Poster
Wed 9:00 Learning Task-General Representations with Generative Neuro-Symbolic Modeling
Reuben Feinman, Brenden Lake
Wed 12:00 Women in Artificial Intelligence & Machine Learning (WinAIML)
Spotlight
Wed 12:48 LambdaNetworks: Modeling long-range Interactions without Attention
Irwan Bello
Spotlight
Wed 13:58 Differentially Private Learning Needs Better Features (or Much More Data)
Florian Tramer, Dan Boneh
Poster
Wed 17:00 Estimating Lipschitz constants of monotone deep equilibrium models
Chirag Pabbaraju, Ezra Winston, Zico Kolter
Poster
Wed 17:00 Protecting DNNs from Theft using an Ensemble of Diverse Models
Sanjay Kariyappa, Atul Prakash, Moinuddin K Qureshi
Poster
Wed 17:00 AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly
Yuchen Jin, Tianyi Zhou, Liangyu Zhao, Yibo Zhu, Chuanxiong Guo, Marco Canini, Arvind Krishnamurthy
Poster
Wed 17:00 Individually Fair Gradient Boosting
Alexander Vargo, Fan Zhang, Mikhail Yurochkin, Yuekai Sun
Poster
Wed 17:00 Influence Functions in Deep Learning Are Fragile
Samyadeep Basu, Phil Pope, Soheil Feizi
Poster
Wed 17:00 Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling
Benedikt Boecking, Willie Neiswanger, Eric P Xing, Artur Dubrawski
Poster
Wed 17:00 Learning and Evaluating Representations for Deep One-Class Classification
Kihyuk Sohn, Chun-Liang Li, Jinsung Yoon, Minho Jin, Tomas Pfister
Poster
Wed 17:00 NBDT: Neural-Backed Decision Tree
Alvin Wan, Lisa Dunlap, Daniel Ho, Jihan Yin, Scott Lee, Suzanne Petryk, Sarah A Bargal, Joseph E Gonzalez
Poster
Wed 17:00 Combining Physics and Machine Learning for Network Flow Estimation
Arlei Lopes da Silva, Furkan Kocayusufoglu, Saber Jafarpour, Francesco Bullo, Ananthram Swami, Ambuj K Singh
Spotlight
Wed 20:40 Undistillable: Making A Nasty Teacher That CANNOT teach students
Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Chenyu You, Xiaohui Xie, Zhangyang Wang
Poster
Thu 1:00 Mind the Gap when Conditioning Amortised Inference in Sequential Latent-Variable Models
Justin Bayer, Maximilian Soelch, Atanas Mirchev, Baris Kayalibay, Patrick van der Smagt
Poster
Thu 1:00 Genetic Soft Updates for Policy Evolution in Deep Reinforcement Learning
Enrico Marchesini, Davide Corsi, Alessandro Farinelli
Poster
Thu 1:00 Learning continuous-time PDEs from sparse data with graph neural networks
Valerii Iakovlev, Markus Heinonen, Harri Lähdesmäki
Poster
Thu 1:00 GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Dmitry Lepikhin, HyoukJoong Lee, Yuanzhong Xu, Dehao Chen, Orhan Firat, Yanping Huang, Maxim Krikun, Noam Shazeer, Zhifeng Chen
Poster
Thu 1:00 Influence Estimation for Generative Adversarial Networks
Naoyuki Terashita, Hiroki Ohashi, Yuichi Nonaka, Takashi Kanemaru
Poster
Thu 1:00 Private Image Reconstruction from System Side Channels Using Generative Models
Yuanyuan Yuan, Shuai Wang, Junping Zhang
Poster
Thu 1:00 AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights
Byeongho Heo, Sanghyuk Chun, Seong Joon Oh, Dongyoon Han, Sangdoo Yun, Gyuwan Kim, Youngjung Uh, Jung-Woo Ha
Poster
Thu 1:00 Do not Let Privacy Overbill Utility: Gradient Embedding Perturbation for Private Learning
Da Yu, Huishuai Zhang, Wei Chen, Tie-Yan Liu
Poster
Thu 1:00 Hopfield Networks is All You Need
Hubert Ramsauer, Bernhard Schäfl, Johannes Lehner, Philipp Seidl, Michael Widrich, Lukas Gruber, Markus Holzleitner, Thomas Adler, David Kreil, Michael K Kopp, Günter Klambauer, Johannes Brandstetter, Sepp Hochreiter
Poster
Thu 1:00 Learning Neural Generative Dynamics for Molecular Conformation Generation
Minkai Xu, Shitong Luo, Yoshua Bengio, Jian Peng, Jian Tang
Poster
Thu 1:00 Scalable Learning and MAP Inference for Nonsymmetric Determinantal Point Processes
Mike Gartrell, Insu Han, Elvis Dohmatob, Jennifer Gillenwater, Victor-Emmanuel Brunel
Poster
Thu 1:00 Conditional Generative Modeling via Learning the Latent Space
Sameera Ramasinghe, Kanchana Ranasinghe, Salman Khan, Nick Barnes, Stephen Gould
Spotlight
Thu 4:35 Unlearnable Examples: Making Personal Data Unexploitable
Hanxun Huang, Daniel Ma, Sarah Erfani, James Bailey, Yisen Wang
Thu 9:00 Machine Learning for Software Engineering
Poster
Thu 9:00 Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma Augmented Gaussian Processes
Jake Snell, Richard Zemel
Poster
Thu 9:00 CaPC Learning: Confidential and Private Collaborative Learning
Christopher Choquette-Choo, Natalie Dullerud, Adam Dziedzic, Yunxiang Zhang, Somesh Jha, Nicolas Papernot, Xiao Wang
Poster
Thu 9:00 Dataset Meta-Learning from Kernel Ridge-Regression
Timothy Nguyen, Zhourong Chen, Jaehoon Lee
Poster
Thu 9:00 Noise or Signal: The Role of Image Backgrounds in Object Recognition
Kai Xiao, Logan Engstrom, Andrew Ilyas, Aleksander Madry
Poster
Thu 9:00 Private Post-GAN Boosting
Marcel Neunhoeffer, Steven Wu, Cynthia Dwork
Poster
Thu 9:00 Differentially Private Learning Needs Better Features (or Much More Data)
Florian Tramer, Dan Boneh
Poster
Thu 9:00 Domain-Robust Visual Imitation Learning with Mutual Information Constraints
Edoardo Cetin, Oya Celiktutan
Poster
Thu 9:00 Dataset Condensation with Gradient Matching
Bo ZHAO, Konda Reddy Mopuri, Hakan Bilen
Poster
Thu 9:00 Cut out the annotator, keep the cutout: better segmentation with weak supervision
Sarah Hooper, Michael Wornow, Ying Seah, Peter Kellman, Hui Xue, Frederic Sala, Curtis Langlotz, Christopher Re
Oral
Thu 11:15 SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness
Mikhail Yurochkin, Yuekai Sun
Thu 12:00 Philosophy and AGI (#2)
Poster
Thu 17:00 Evaluation of Similarity-based Explanations
Kazuaki Hanawa, Sho Yokoi, Satoshi Hara, Kentaro Inui
Poster
Thu 17:00 Learning perturbation sets for robust machine learning
Eric Wong, Zico Kolter
Poster
Thu 17:00 Estimating and Evaluating Regression Predictive Uncertainty in Deep Object Detectors
Ali Harakeh, Steven L Waslander
Poster
Thu 17:00 Randomized Automatic Differentiation
Deniz Oktay, Nick McGreivy, Joshua Aduol, Alex Beatson, Ryan P Adams
Poster
Thu 17:00 A Learning Theoretic Perspective on Local Explainability
Jeffrey Li, Vaishnavh Nagarajan, Gregory Plumb, Ameet Talwalkar
Poster
Thu 17:00 Clustering-friendly Representation Learning via Instance Discrimination and Feature Decorrelation
Yaling Tao, Kentaro Takagi, Kouta Nakata
Poster
Thu 17:00 $i$-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning
Kibok Lee, Yian Zhu, Kihyuk Sohn, Chun-Liang Li, Jinwoo Shin, Honglak Lee
Poster
Thu 17:00 Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity
Shaocong Ma, Ziyi Chen, Yi Zhou, Shaofeng Zou
Poster
Thu 17:00 Neural representation and generation for RNA secondary structures
Zichao Yan, Will Hamilton, Mathieu Blanchette
Poster
Thu 17:00 Nonseparable Symplectic Neural Networks
Shiying Xiong, Yunjin Tong, Xingzhe He, Shuqi Yang, Cheng Yang, Bo Zhu
Poster
Thu 17:00 HeteroFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients
Enmao Diao, Jie Ding, VAHID TAROKH
Poster
Thu 17:00 Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers
Kaidi Xu, Huan Zhang, Shiqi Wang, Yihan Wang, Suman Jana, Xue Lin, Cho-Jui Hsieh
Thu 17:00 Lapsed Physicists Wine-and-Cheese (#2)
Spotlight
Thu 19:55 RMSprop converges with proper hyper-parameter
Naichen Shi, Dawei Li, Mingyi Hong, Ruoyu Sun
Workshop
Fri 2:30 Science and Engineering of Deep Learning
Levent Sagun, Caglar Gulcehre, Adriana Romero, Negar Rostamzadeh, Stefano Sarao Mannelli, Lenka Zdeborova, Samy Bengio
Workshop
Fri 2:45 Ideas for machine learning from psychology's reproducibility crisis
Samuel J Bell
Workshop
Fri 3:30 Neural Compression: From Information Theory to Applications
Stephan Mandt, Robert Bamler, Yingzhen Li, Christopher Schroers, Yang Yang, Max Welling, Taco Cohen
Workshop
Fri 5:00 Keynote 1: Warren Gross. Title: Stochastic Computing for Machine Learning towards an Intelligent Edge
Workshop
Fri 5:15 Beyond Static Papers: Rethinking How We Share Scientific Understanding in ML
Krishna Murthy Jatavallabhula, Bhairav Mehta, Tegan Maharaj, Amy Tabb, Khimya Khetarpal, Aditya Kusupati, Anna Rogers, Sara Hooker, Breandan Considine, Devi Parikh, Derek Nowrouzezahrai, Yoshua Bengio
Workshop
Fri 5:55 AI for Public Health
Bryan Wilder, Ioana Bica, Marie-Laure Charpignon, Emma Pierson
Workshop
Fri 6:00 Workshop on Neural Architecture Search
Arber Zela, Aaron Klein, Frank Hutter, Liam Li, Jan Hendrik Metzen, Jovita Lukasik
Workshop
Fri 6:18 Adversarial Data Augmentation Improves Unsupervised Machine Learning
Chia-Yi Hsu
Workshop
Fri 6:30 Break & Poster session 1
Workshop
Fri 6:45 Responsible AI (RAI)
Ahmad Beirami, Emily Black, Krishna Gummadi, Hoda Heidari, Baharan Mirzasoleiman, Meisam Razaviyayn, Joshua Williams
Workshop
Fri 7:00 2nd Workshop on Practical ML for Developing Countries: Learning Under Limited/low Resource Scenarios
Esube Bekele, Waheeda Saib, Timnit Gebru, Meareg Hailemariam, Vukosi Marivate, Judy Gichoya
Workshop
Fri 7:00 Workshop on Learning to Learn
Sarah Bechtle, Todor Davchev, Yevgen Chebotar, Timothy Hospedales, Franziska Meier
Workshop
Fri 7:00 Workshop on Weakly Supervised Learning
Benjamin Roth, Barbara Plank, Alex Ratner, Katharina Kann, Dietrich Klakow, Michael Hedderich
Workshop
Fri 7:00 Generalization beyond the training distribution in brains and machines
Christina Funke, Judith Borowski, Drew Linsley, Xavier Boix
Workshop
Fri 7:00 Neural Conversational AI: Bridging the Gap Between Research and Real World (NeuCAIR)
Ahmad Beirami, Asli Celikyilmaz, Yun-Nung Chen, Paul Crook, Orianna DeMasi, Stephen Roller, Chinnadhurai Sankar, Joao Sedoc, Zhou Yu
Workshop
Fri 7:00 Interpretable Recommender System With Heterogeneous Information: A Geometric Deep Learning Perspective
Yan Leng
Workshop
Fri 7:00 Synthetic Data Generation: Quality, Privacy, Bias
Sergul Aydore, Krishnaram Kenthapadi, Haipeng Chen, Edward Choi, Jamie Hayes, Mario Fritz, Rachel Cummings, Krishnaram Kenthapadi
Workshop
Fri 7:10 Invited Speaker Dan Roth - Natural Language Understanding with Incidental Supervision
Dan Roth
Workshop
Fri 7:10 "Can Machine Learning Revolutionize Healthcare? Synthetic Data may be the Answer" by Mihaela van der Schaar, UCLA
Mihaela van der Schaar
Workshop
Fri 7:55 ICLR 2021 Workshop on Embodied Multimodal Learning (EML)
Ruohan Gao, Andrew Owens, Dinesh Jayaraman, Yuke Zhu, Jiajun Wu, Kristen Grauman
Workshop
Fri 8:00 Robust and reliable machine learning in the real world
Di Jin, Eric Wong, Yonatan Belinkov, Kai-Wei Chang, Zhijing Jin, Yanjun Qi, Aditi Raghunathan, Tristan Naumann, Mohit Bansal
Workshop
Fri 8:30 Workshop on Distributed and Private Machine Learning
Fatemeh Mireshghallah, Praneeth Vepakomma, Ayush Chopra, Vivek Sharma, Abhishek Singh, Adam Smith, Ramesh Raskar, Gautam Kamath, Reza Shokri
Workshop
Fri 8:45 Machine Learning for Preventing and Combating Pandemics
Pengtao Xie, Xiaodan Liang, Jure Leskovec, Judy Wawira, Jeremy Weiss, Manuel Gomez Rodriguez, Madalina Fiterau, Yueyu Jiang, Leo Celi, Eric P Xing
Workshop
Fri 8:45 Opening Remarks
Workshop
Fri 8:45 Security and Safety in Machine Learning Systems
Xinyun Chen, Cihang Xie, Ali Shafahi, Bo Li, Ding Zhao, Tom Goldstein, Dawn Song
Workshop
Fri 8:50 PROBLEM AND SOLUTION DOCUMENTATION TEMPLATE FOR MACHINE LEARNING COMPETITIONS TO ENHANCE EXPLAINABILITY, REPRODUCIBILITY, AND COLLABORATION BETWEEN STAKEHOLDERS
Olubayo Hamzat
Fri 9:00 Starting/Transitioning your career in ML during a pandemic
Workshop
Fri 9:01 "Differentially Private Synthetic Data Generations Using Generative Adversarial Networks" by Jinsung Yoon, Google Cloud AI
Jinsung Yoon
Workshop
Fri 9:11 Bambara Language Dataset for Sentiment Analysis
chayma fourati
Workshop
Fri 9:30 Break & Poster session 2
Workshop
Fri 9:40 Inference Risks for Machine Learning
David Evans
Workshop
Fri 9:51 "Towards Financial Synthetic Data" by Manuela M. Veloso, J.P.Morgan, CMU
Manuela Veloso
Workshop
Fri 10:05 Q&A for Inference Risks for Machine Learning
Workshop
Fri 10:54 TenSEAL: A Library for Encrypted Tensor Operations Using Homomorphic Encryption
Ayoub Benaissa
Workshop
Fri 11:06 Smoothness Matrices Beat Smoothness Constants: Better Communication Compression Techniques for Distributed Optimization
Mher Safaryan, Filip Hanzely, Peter Richtarik
Workshop
Fri 11:45 Spotlight 9: George Zhang et al., Universal Rate-Distortion-Perception Representations for Lossy Compression
Workshop
Fri 11:52 DeepSMOTE: Deep Learning for Imbalanced Data
Bartosz Krawczyk
Workshop
Fri 12:15 A Better Bound Gives a Hundred Rounds: Enhanced Privacy Guarantees via f-Divergences
Lalitha Sankar
Workshop
Fri 12:51 "Ethical Considerations of Generative AI" by Emily Denton, Google’s Ethical AI team
Emily Denton
Workshop
Fri 13:07 Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks
Curtis G Northcutt
Workshop
Fri 14:01 Contributed Talk 3 - Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks
Curtis G Northcutt
Workshop
Fri 15:25 Invited Speaker Paroma Varma - Snorkel: Programmatically Labeling Training Data
Paroma Varma
Workshop
MPCLeague: Robust 4-party Computation for Privacy-Preserving Machine Learning
Nishat Koti, Arpita Patra, Ajith Suresh
Workshop
Smoothness Matrices Beat Smoothness Constants: Better Communication Compression Techniques for Distributed Optimization
Mher Safaryan, Filip Hanzely, Peter Richtarik
Workshop
Talk Less, Smile More: Reducing Communication with Distributed Auto-Differentiation
Bradley Baker, Vince Calhoun, Barak Pearlmutter, Sergey Plis
Workshop
Does Differential Privacy Defeat Data Poisoning?
Matthew Jagielski, Alina Oprea
Workshop
Differentially Private Multi-Task Learning
Shengyuan Hu, Steven Wu, Virginia Smith
Workshop
Prior-Free Auctions for the Demand Side of Federated Learning
Andreas Haupt, Vaikkunth Mugunthan
Workshop
SWIFT: Super-fast and Robust Privacy-Preserving Machine Learning
Nishat Koti, Mahak Pancholi, Arpita Patra, Ajith Suresh
Workshop
Privacy and Integrity Preserving Training Using Trusted Hardware
Seyedeh Hanieh Hashemi, Yongqin Wang, Murali Annavaram
Workshop
Membership Inference Attack on Graph Neural Networks
Iyiola Emmanuel Olatunji, Wolfgang Nejdl, Megha Khosla
Workshop
TenSEAL: A Library for Encrypted Tensor Operations Using Homomorphic Encryption
Ayoub Benaissa