question about TD3

In the original implementation of TD3, when updating q functions, you use the target policy for the TD target. However, when updating the policy, you use q function rather than the target q function. Why is that?