[T12] Studying Large Language Model Generalization with Influence Functions
Roger Grosse*, Juhan Bae*, Cem Anil*, Nelson Elhage, Alex Tamkin, Amirhossein Tajdini, Benoit Steiner, Dustin Li, Esin Durmus, Ethan Perez, Evan Hubinger, Kamilė Lukošiūtė, Karina Nguyen, Nicholas Joseph, Sam McCandlish, Jared Kaplan, Samuel R. Bowman
arXiv 2023
[T11] Benchmarking Neural Network Training Algorithms
George E. Dahl*, Frank Schneider*, Zachary Nado*, Naman Agarwal*, Chandramouli Sastry,
Philipp Hennig, Sourabh Medapati, Runa Eschenhagen, Priya Kasimbeg, Daniel Suo, Juhan Bae, Justin Gilmer and 13 more authors
arXiv 2023
[C10] Efficient Parametric Approximations of Neural Network Function Space Distance
Nikita Dhawan, Sicong Huang, Juhan Bae, Roger Grosse
ICML 2023
[C9] Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve
Juhan Bae, Michael R. Zhang, Michael Ruan, Eric Wang, So Hasegawa, Jimmy Ba, Roger Grosse
ICLR 2023 (Oral Presentation)
[C8] If Influence Functions are the Answer, Then What is the Question?
Juhan Bae, Nathan Ng, Alston Lo, Marzyeh Ghassemi, Roger Grosse
NeurIPS 2022
[C7] Amortized Proximal Optimization
Juhan Bae*, Paul Vicol*, Jeff Z. HaoChen, Roger Grosse
NeurIPS 2022
[C6] Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes
James Lucas, Juhan Bae, Michael R. Zhang, Stanislav Fort, Richard Zemel, Roger Grosse
ICML 2021
[C5] Delta-STN: Efficient Bilevel Optimization for Neural Networks using Structured Response Jacobians
Juhan Bae and Roger Grosse
NeurIPS 2020
[W4] On Monotonic Linear Interpolation of Neural Network Parameters
James Lucas, Juhan Bae, Michael R. Zhang, Richard Zemel, Jimmy Ba, Roger Grosse
NeurIPS 2020, Optimization for Machine Learning Workshop
[W3] Eigenvalue Corrected Noisy Natural Gradient
Juhan Bae, Guodong Zhang, Roger Grosse
NeurIPS 2019, Bayesian Deep Learning Workshop
[C2] Fast 6DOF Pose Estimation with Synthetic Textureless CAD Model for Mobile Applications
Bowen Chen, Juhan Bae, Dibyendu Mukherjee
ICIP 2019
[W1] Learnable Pooling Methods for Video Classification
Sebastian Kmiec, Juhan Bae, Ruijian An
ECCV 2018, Workshop on YouTube-8M Large-Scale Video Understanding (Oral Presentation)