Right now, the pytorch models can accept either bytes of compressed images, or tensors. but we only provide bytes in the context of serving the API. we should move the conversion logic outside of the model into the Ray wrappers to make it more clear what's going on when we do a forward pass against the network.