I'll try
.
Phase detection works by comparing the position (phase) of separate virtual images which are created from different portions of the light... basically, if you think of "the V" of light in a typical diagram, where the base is the light being in focus; there is a complete/separate (virtual) image in both legs of the V.
For on sensor PDAF to function a certain number of pixels have to be dedicated to sensing/comparing these separate images... this is a dedicated data stream/function so those pixels are lost from the image (re-mapped the way hot pixels are mapped out). The more AF points there are, the more that is lost from the image stream (but even 1000 pixels lost is minor in a 40MP system). The more significant issues are that because the sensor is using (essentially) two images composed from all of the light at the point where they (should) converge, there's not much movement to detect (phase shift). Additionally, because it is using all of the light (or 1/2 each) the image sharpness and DOF of the lens/lens aperture will affect the accuracy of on sensor PDAF. As will the amount of light transmitted by the lens.
The PDAF module uses a bunch of lenses and sensors to farther isolate multiple images to compare with greater accuracy/speed. Because of how the system works it is not dependent on lens aperture to control the amount of light (only that the aperture is not smaller than a particular sensor point is designed for). And it is not dependent on the lens aperture for DOF/image sharpness. The lenses/system has their own aperture/DOF which is somewhere around f/7 in terms of light transmission, and over f/22 in terms of DOF (aperture numbers are relative to main lens FL/aperture function). And because the virtual images are taken from smaller portions of the total light they have greater potential offsets/phase shift for greater accuracy. I.e. an f/2.8 AF sensor point uses images from the f/2.8 area of the objective lens, which has greater separation and accuracy than the images used by an f/8 AF sensor point.