GigaPevt: Multimodal Medical Assistant
GigaPevt: Multimodal Medical Assistant
Pavel Blinov, Konstantin Egorov, Ivan Sviridov, Nikolay Ivanov, Stepan Botman, Evgeniy Tagin, Stepan Kudin, Galina Zubkova, Andrey V. Savchenko
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence
Demo Track. Pages 8614-8618.
https://doi.org/10.24963/ijcai.2024/992
Building an intelligent and efficient medical assistant is still a challenging AI problem. The major limitation comes from the data modality scarceness, which reduces comprehensive patient perception. This demo paper presents GigaPevt, the first multimodal medical assistant that combines the dialog capabilities of large language models with specialized medical models. Such an approach shows immediate advantages in dialog quality and metric performance, with a 1.18% accuracy improvement in the question-answering task.
Keywords:
Multidisciplinary Topics and Applications: MDA: Health and medicine
Natural Language Processing: NLP: Dialogue and interactive systems
Computer Vision: CV: Vision and languageĀ
Computer Vision: CV: Applications