You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello teacher, I would like to ask about the bert-large model on the RTE task, the accuracy of the approximate distillation is only 0.47, I see that this problem has also been mentioned in the previous question, how is this problem finally solved?