Can we fully describe GPUs as a thermodynamical system in terms of its information processing capability? Can we backcalculate optimal model architecture designs bsaed on this?