Rich, you are absolutely right.
It is maths, or specifically geometry and common sense that the entertainment industry has been historically slow to adopt.
It can be argued that the resolution loss resulting from correcting the key-stoning is larger than that from HIT. But it is difficult to correct the asymetric geometric distortions in left and right eye as the lenses rarely render the field flat and undistorted. And, more importantly, when shooting converged, the image planes are no longer parallel, so that the focus on each side of the screen falls at a different distance on left and right eye. This is not assisting the brain to fuse such images together and can not be "corrected" or compensated for in post.
Convergence (toe-in) gained popularity in live 3D broadcast as a "cheap", or rather gimmicky, way of adjusting the depth at which the image appears to the viewer. Complex systems, like CPG Fusion 3D, have been developed and marketed convergence linked to focus as a desireable feature, for example. Hype over science. It happens all the time.
On a side but related note, I know of some clowns advocating convergence when filming underwater through a flat port without understanding that it works like a variable chromatic aberration and asymetric distortion control allowing the user to change them from "severe" to "ridiculous" and anything in between. Welcome to the entertainment industry! :)
