A Survey on Out-of-Distribution Evaluation of Neural NLP Models

A Survey on Out-of-Distribution Evaluation of Neural NLP Models

Xinzhe Li, Ming Liu, Shang Gao, Wray Buntine

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
Survey Track. Pages 6683-6691. https://doi.org/10.24963/ijcai.2023/749

Adversarial robustness, domain generalization and dataset biases are three active lines of research contributing to out-of-distribution (OOD) evaluation on neural NLP models. However, a comprehensive, integrated discussion of the three research lines is still lacking in the literature. This survey will 1) compare the three lines of research under a unifying definition; 2) summarize their data-generating processes and evaluation protocols for each line of research; and 3) emphasize the challenges and opportunities for future work.
Keywords:
Survey: Natural Language Processing
Survey: Machine Learning
Survey: Uncertainty in AI
Survey: AI Ethics, Trust, Fairness