Project
# | Title | Team Members | TA | Documents | Sponsor |
---|---|---|---|---|---|
33 | # AMADEUS - Augmented Modular AI Dialogue and Exchange User System |
Chengyuan Peng Ryan Fu Wesley Pang |
Jason Zhang | design_document1.pdf proposal1.pdf |
|
# AMADEUS - Augmented Modular AI Dialogue and Exchange User System # Team members: · Ryan Fu (ryfu2) · Qiran Pang (qpang2) · Chengyuan Peng (cpeng14) # Problem For many years, people have dreamed of having natural, everyday conversations with robots to fulfill their emotional and lifestyle needs. However, current interactive AI systems are often bulky, and even the most portable solutions still rely on smartphone interactions. Regarding emotional needs, we don’t want to talk to a cold, lifeless screen. Instead, we hope for a more tangible medium—like a child chatting with a SpongeBob toy embedded with AI. Thus, the needs are clear: We require a more compact AI platform that can easily integrate into various devices. On top of that, it should be as affordable as possible to make it widely accessible. # Solution We are designing an AI-based audio interactive interface. The baseline feature of the project is a cheap PCB board interface that can receive audio from the user and then send it through Wifi to a model on a computer so that the AI model can process the audio and reply with audio, which is sent to the board to be played out. We will use an ESP32 microcontroller with wifi and audio input/output capability to achieve this. Additional features would be indoor and outdoor modes such that when we are outdoors we will speak when a button is pressed and the input will be denoised. Another additional feature can be integrating the board with headphones or Bluetooth earbuds. Moreover, a text display interface can be embedded on the PCB to display the converted audio as text. Please view our block diagram via the Google link: https://docs.google.com/document/d/1Uv_b5SzeoN7boqyMyB3Kkgl7XGVAnuv50S6DZ1e3PhY/edit # Solution Components # Subsystem 1: AI Web Client Our language model will be hosted on a cloud-based server. The local MCU will transmit audio to the server via a WiFi module. We are collaborating with a local start-up that will provide the AI model and handle the audio training. However, we also have the option to train our own AI model to create additional characters using their interface. # Subsystem 2: ESP32 with Wifi Capability We will utilize ESP32 for the processor to process the signal. Before use, it will receive a password from the user’s device through Bluetooth to connect with Wifi. It will receive an audio signal from the ADC and send it to the PC for AI Web Client input. After receiving the output audio signal from the PC, it will be sent to the audio codec for audio output. # Subsystem 3: Power System The system can be powered through either a USB connection or a 5V battery. The 5V supply directly powers the I/O devices and the programming module. To provide 3.3V power for the microcontroller and audio processing module, a 5V to 3.3V LDO voltage regulator is used to step down the voltage. # Subsystem4: Bluetooth Communication A Bluetooth transceiver module will be connected to the ESP32 processor to receive user input for configuring the internet connection. The user will transmit the internet passcode to the Bluetooth transceiver, which will then relay this information to the microcontroller to establish the connection. # Subsystem5: Audio I/O & Processing The microphone on the board will capture the audio input, which will be processed by an Audio Codec module. Once the audio output is fetched from the internet into the MCU, it will be transmitted through the Audio Codec and played through a speaker. # Subsystem6: Text Display An additional feature of our project will be a text display. After the ESP32 module converts the audio input / output into texts, an LCD screen will be attached to the microprocessor to display the text output. # Subsystem7: Debug Module A serial port will be temporarily integrated into the PCB for debugging the output from the ESP32 processor. Additionally, a programmer will be connected to the MCU for programming purposes. |