SaGol: Using MiniGPT-4 to Generate Alt Text for Improving Image Accessibility
SaGol: Using MiniGPT-4 to Generate Alt Text for Improving Image Accessibility
Yunseo Moon, Hyunmin Lee, SeungYoung Oh, Hyunggu Jung
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence
Demo Track. Pages 8745-8748.
https://doi.org/10.24963/ijcai.2024/1023
SaGol is an AI-powered application to improve image accessibility for people with visual impairments (PVI) users. Alternative (alt) text, a general method of web accessibility for PVI users, is text or phrases that describe images on a website in an understandable way. SaGol generates alt text with the images on the user's smartphone using a vision large language model called MiniGPT-4. SaGol searches for similar images based on the generated alt text. We evaluated the length of alt text and the search accuracy. This paper shows a potential opportunity to improve image accessibility for PVI users.
Keywords:
Humans and AI: HAI: Human-computer interaction
Humans and AI: HAI: Applications
Humans and AI: HAI: Intelligent user interfaces