For a toy example this not necessary but just a note to create an example in which we should add the proper lazy without the discouraged GPU sync solution (https://pytorch.org/tutorials/recipes/recipes/tuning_guide.html#avoid-unnecessary-cpu-gpu-synchronization)