Researchers criticize A-B testing, warning it yields unreliable ad performance conclusions due to divergent delivery issues.