Inaccurate penicillin allergy labels may be delabelled following evaluation. The intervention in this study was an email‐based notification system regarding the appropriateness for penicillin allergy evaluation, with a view to delabelling, as identified by a deep learning artificial intelligence algorithm. Of the intervention group (n = 59), three (5.1%) individuals had their penicillin allergies delabelled, which was significantly more than the control group (0%, P = 0.002). Further research to optimise such approaches is required.