I believe I understand the role of the IF bandwidth (IFBW) in a VNA measurement:
The signal is mixed down to the intermediate frequency (IF) and the receiver detects the signal in a frequency band around that frequency whose width is the IFBW. Selecting a narrow IFBW increases the frequency selectivity of the measurement and thus reduces noise.
What I do not understand why this also decreases the measurement speed.
Does the receiver measure the time it takes for a certain amount of energy to be deposited? If that were so, then increasing the power of the probe signal should reduce the measurement time, which it does not.
What exactly is the reason that recording one data point takes the VNA longer when employing a smaller IFBW?