details - 番茄社区

Skip to global menu.
Skip to primary navigation.
Skip to secondary navigation.
Skip to page content.

Return to global menu.
Skip to primary navigation.
Skip to secondary navigation.
Skip to page content.

Event Details

Implementing Voice Assistant for Visually Impaired Using LLMs and Vision Language Models

Presenter: Jinke Jiang
Supervisor:

Date: Fri, October 11, 2024
Time: 14:00:00 - 00:00:00
Place: Zoom, link below.

ABSTRACT

Abstract: As a result of population aging, the number of visually impaired people is growing. Unfortunately, there is limited accessibility measures to help improve the quality of life of these people. The recent technological development in Artificial Intelligence, especially Large Language Models (LLMs), should offer effective and efficient solutions. Recognizing the limitation of existing products, we design and implement a user-friendly and privacy-safe voice assistant for visually impaired people. Using LLMs and Vision Language Models, the assistant can recognize and identify objects through low-latency speech-to-speech interactions. The assistant can be deployed on offline edge computing devices with camera/microphone/speaker, with easily extendable functionalities. In this report, we present the design, adopted technologies, and adjustment that we applied to arrive at the final implementation

Topic: Zoom Meeting

Time: Oct 11, 2024 02:00 PM Vancouver

Join Zoom Meeting

Meeting ID: 894 5708 9273

Password: 195461

One tap mobile

+17789072071,,89457089273# Canada

+16475580588,,89457089273# Canada

Dial by your location

+1 778 907 2071 Canada

+1 647 558 0588 Canada

Meeting ID: 894 5708 9273

Return to global menu.
Return to primary navigation.
Return to secondary navigation.
Return to page content.