Does student engagement in self-assessment calibrate their judgement over time?