Post-doc at Princeton University
Here are some blog posts I've written, typically about papers or machine learning techniques:>
A step-by-step procedure for choosing a learning rate and other optimization parameters.
Attention is all you need (aka: the Transformer network).
The Lottery Ticket Hypothesis.
Insights on representational similarity in neural networks with canonical correlation.