Who can see your viewing activity?
The results for CoAtNet seem to imply adding the locality property to the self-attention helps a lot
Naomi Subtlety Saphra
+ ConViT too
Why do we look at the accuracy of task 2 since forgetting happens with task 1? For freeze layers.
I cannot unmute myself sorry.
Okay that makes sense. Thank you for the talk!
Related: an empirical study on whether all examples get forgotten: https://arxiv.org/abs/1812.05159