Hacker News

34 points by jxmorris12 2 days ago

4 Comments
(2021), still very interesting. Especially the "post-overfitting" training strategy is unexpected.

By xg15 11 hours ago
I remember vaguely that this was observed when training GPT-3 (probably?) as well. Just trained on and on, and the error went up and then down again. Like a phase transition in the model.

By luckystarr 8 hours ago
The low sample efficiency of RL is well explained.

By esafak 7 hours ago
[deleted]

By 7 hours ago