''' exponent = torch.exp(-.5 * torch.sum((x - mu) * (x - mu) / var, 2, keepdim=True)) RuntimeError: expected device cpu and dtype Float but got device cuda:0 and dtype Float ''' How could this program working rightly with GPU?