Now they do say in the paper "It is worth stressing that this measurement does not rely on the difference between a start (t0) and a stopsignal but on the comparison of two event time distributions."
A system with poor transient response tends to blur these sounds over time, due to the speaker's inability to stop and start quickly enough to react to the signal accurately.