Event Details
Implementing Voice Assistant for Visually Impaired Using LLMs and Vision Language Models
Presenter: Jinke Jiang
Supervisor:
Date: Fri, October 11, 2024
Time: 14:00:00 - 00:00:00
Place: Zoom, link below.
ABSTRACT
Abstract: As a result of population aging, the number of visually impaired people is growing. Unfortunately, there is limited accessibility measures to help improve the quality of life of these people. The recent technological development in Artificial Intelligence, especially Large Language Models (LLMs), should offer effective and efficient solutions. Recognizing the limitation of existing products, we design and implement a user-friendly and privacy-safe voice assistant for visually impaired people. Using LLMs and Vision Language Models, the assistant can recognize and identify objects through low-latency speech-to-speech interactions. The assistant can be deployed on offline edge computing devices with camera/microphone/speaker, with easily extendable functionalities. In this report, we present the design, adopted technologies, and adjustment that we applied to arrive at the final implementation
Topic: Zoom Meeting
Time: Oct 11, 2024 02:00 PM Vancouver
Join Zoom Meeting
Meeting ID: 894 5708 9273
Password: 195461
One tap mobile
+17789072071,,89457089273# Canada
+16475580588,,89457089273# Canada
Dial by your location
+1 778 907 2071 Canada
+1 647 558 0588 Canada
Meeting ID: 894 5708 9273