Electrical phase and physical alignment. If the electrical phase is correct then you adjust the driver distances such that the impulse of each driver reaches the ear at the same time. A step response tells the whole story of what is and what is not time coherent.
Its can only be time coherent physically in the Nearfield at a specific predetermine distance , itscwhy so many are not affected by poor step responses , electrical its different and may not be noticeable too, eg when a tweeter has reverse phase, very few can hear this , also if drivers are ran in reverse phase to compensate acoustically then you have to examine its acoustic properties / phase rotation.
Transfer function is a big part but like everything its the complete package making the difference and not all are sensitive to it , hence why so many ridiculous designs
Regards