Shared-task on Hallucinations and Related Observable Overgeneration Mistakes. The
participants were asked to perform binary classification to identify cases of fluent
overgeneration hallucinations. Our experimentation included fine-tuning a pre-trained model
on hallucination detection and a Natural Language Inference (NLI) model. The most
successful strategy involved creating an ensemble of these models, resulting in accuracy …