Has anyone else noticed that 4o gets more things wrong than 4?