https://openreview.net/forum?id=C33p2CNOQ8&referrer=%5Bthe%20profile%20of%20Vighnesh%20Subramaniam%5D(%2Fprofile%3Fid%3D~Vighnesh_Subramaniam1)
We demonstrate that architectures which traditionally are considered to be ill-suited for a task can be trained using inductive biases from another...
inductive biastrainingintroducingviarepresentational
https://openreview.net/forum?id=GGItImF9oG5&referrer=%5Bthe%20profile%20of%20Hyung%20Won%20Chung%5D(%2Fprofile%3Fid%3D~Hyung_Won_Chung1)
Your model is pretty cool, but does it scale? Let's find out.
scaling lawshow doesinductive biasvsmodel