GigaPevt: Multimodal Medical Assistant

GigaPevt: Multimodal Medical Assistant

Pavel Blinov, Konstantin Egorov, Ivan Sviridov, Nikolay Ivanov, Stepan Botman, Evgeniy Tagin, Stepan Kudin, Galina Zubkova, Andrey V. Savchenko

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence
Demo Track. Pages 8614-8618. https://doi.org/10.24963/ijcai.2024/992

Building an intelligent and efficient medical assistant is still a challenging AI problem. The major limitation comes from the data modality scarceness, which reduces comprehensive patient perception. This demo paper presents GigaPevt, the first multimodal medical assistant that combines the dialog capabilities of large language models with specialized medical models. Such an approach shows immediate advantages in dialog quality and metric performance, with a 1.18% accuracy improvement in the question-answering task.
Keywords:
Multidisciplinary Topics and Applications: MDA: Health and medicine
Natural Language Processing: NLP: Dialogue and interactive systems
Computer Vision: CV: Vision and languageĀ 
Computer Vision: CV: Applications