if some speakers are good for imaging, then it must not require much effort to imagine "seeing" instruments and notes in the air (room), as they sound so real and so "present".

isn't all this about high fidelity, whether audio or video, supposed to be a representation of reality?
i think so, and if that representation is good enough, then who needs anything else than one's senses and imagination!