Google Scholar

Learning the pareto front with hypernetworks

A Navon, A Shamsian, G Chechik, E Fetaya - arXiv preprint arXiv …, 2020 - arxiv.org

… PFL implemented using HyperNetworks, which we term Pareto HyperNetworks (PHNs).
PHN learns the entire Pareto front simultaneously using a single hypernetwork, which receives …

Save Cite Cited by 104 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Continual learning with hypernetworks

J Von Oswald, C Henning, BF Grewe… - arXiv preprint arXiv …, 2019 - arxiv.org

… based on task-conditioned hypernetworks, ie, networks that … data, task-conditioned
hypernetworks only require rehearsing task-… that task-conditioned hypernetworks display a very …

Save Cite Cited by 362 Related articles All 9 versions View as HTML

[PDF] arxiv.org

Neural architecture search with reinforcement learning

B Zoph, QV Le - arXiv preprint arXiv:1611.01578, 2016 - arxiv.org

Neural networks are powerful and flexible models that work well for many difficult learning
tasks in image, speech and natural language understanding. Despite their success, neural …

Save Cite Cited by 6099 Related articles All 16 versions View as HTML

[PDF] arxiv.org

Graph hypernetworks for neural architecture search

C Zhang, M Ren, R Urtasun - arXiv preprint arXiv:1810.05749, 2018 - arxiv.org

… We propose Graph HyperNetwork that predicts the parameters of unseen neural networks
by directly operating on their computational graph representations. 2. Our approach achieves …

Save Cite Cited by 278 Related articles All 7 versions View as HTML

[PDF] acm.org

Neuroevolution of self-interpretable agents

Y Tang, D Nguyen, D Ha - Proceedings of the 2020 genetic and …, 2020 - dl.acm.org

… Hypernetworks [36] suggested making the phenotype directly dependent on the inputs, thus
tailoring the weights of the phenotype to the specific inputs of the network. By incorporating …

Save Cite Cited by 127 Related articles All 5 versions

[PDF] mlr.press

Efficient neural architecture search via parameters sharing

H Pham, M Guan, B Zoph, Q Le… - … conference on machine …, 2018 - proceedings.mlr.press

… work (Ha et al., 2017) to generate its weight. Such usage of the hypernetwork in SMASH …
This is because the hypernetwork generates weights for SMASH’s child models via tensor …

Save Cite Cited by 3121 Related articles All 17 versions View as HTML

[PDF] arxiv.org

Simple and efficient architecture search for convolutional neural networks

T Elsken, JH Metzen, F Hutter - arXiv preprint arXiv:1711.04528, 2017 - arxiv.org

… (2017) used hypernetworks (Ha et al., 2017) to generate the weights for a randomly sampled
network architecture with the goal of eliminating the costly process of training a vast amount …

Save Cite Cited by 284 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Zoneout: Regularizing rnns by randomly preserving hidden activations

D Krueger, T Maharaj, J Kramár, M Pezeshki… - arXiv preprint arXiv …, 2016 - arxiv.org

We propose zoneout, a novel method for regularizing RNNs. At each timestep, zoneout
stochastically forces some hidden units to maintain their previous values. Like dropout, zoneout …

Save Cite Cited by 380 Related articles All 7 versions View as HTML

[PDF] openreview.net

[PDF][PDF] Multiplicative interactions and where to find them

SM Jayakumar, WM Czarnecki, J Menick, J Schwarz… - 2020 - openreview.net

… “projected” context by the hypernetwork; or (c) the … hypernetwork that generates a weight
matrix for a matrix multiplication. Similarly, a diagonal 3D tensor is equivalent to a hypernetwork …

Save Cite Cited by 119 Related articles All 3 versions View as HTML

[PDF] arxiv.org

Hierarchical multiscale recurrent neural networks

J Chung, S Ahn, Y Bengio - arXiv preprint arXiv:1609.01704, 2016 - arxiv.org

… Published as a conference paper at ICLR 2017 … Published as a conference paper at ICLR
2017 … JC would also like to thank Guillaume Alain, Kyle Kastner and David Ha for providing us …

Save Cite Cited by 634 Related articles All 7 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

Learning the pareto front with hypernetworks

Continual learning with hypernetworks

Neural architecture search with reinforcement learning

Graph hypernetworks for neural architecture search

Neuroevolution of self-interpretable agents

Efficient neural architecture search via parameters sharing

Simple and efficient architecture search for convolutional neural networks

Zoneout: Regularizing rnns by randomly preserving hidden activations

[PDF][PDF] Multiplicative interactions and where to find them

Hierarchical multiscale recurrent neural networks