I suspect this would be feasible if filming was accompanied with a depth sensing camera like the Microsoft Kinect. In post production, you could tell roughly how far each pixel is from the camera which could aid in the reconstruction of a 3d scene.
Maybe this could be done with just the aperture and focal distance, which most modern cinema cameras record as they film.
Maybe this could be done with just the aperture and focal distance, which most modern cinema cameras record as they film.