Quantcast
Channel: Raspberry Pi Forums
Viewing all articles
Browse latest Browse all 5286

Beginners • Suggestions on Workflow, Hardware, and Software for Voice Chatbot with Raspberry Pi 5

$
0
0
Hi, I am Roberto, a PHP and JavaScript software developer. I have been exploring the world of Raspberry for a few weeks as I aim to create a voice chatbot by integrating it with my already functional text-based chatbot web application (created with node.js API's chatGPT) currently running on Heroku, to assist the sales staff of a company. I am now planning to extend this chatbot to include a voice-based version, utilizing a Raspberry Pi 5 as the input/output device while delegating all processing operations to the cloud server.

Project Objective

Workflow:

Voice Input: The sales staff speaks a command (e.g., "What are the orders for customer Rossi?").
Role of the Raspberry Pi: The Raspberry Pi records the audio through a microphone, converts it to text, and sends the textual input to the cloud application on Heroku via an API call.

Cloud Processing:

The Heroku server, running on Node.js, processes the command, queries a database, and generates a textual response.
All complex operations, including data processing, are managed by the server, with the Raspberry Pi serving as an intermediary.
Raspberry Pi Output:
The Raspberry Pi receives the response from the cloud, converts it to speech using a text-to-speech library, and plays it through a speaker.
During the voice playback, an LED lights up to indicate that the chatbot is responding.
Hardware:

The system must fit within a 20x20x20 cm case.

Questions:

Is the Raspberry Pi 5 suitable for this type of application?
Which USB microphone and compact speaker would you recommend to ensure good recording and audio output quality in a small space?

Software:

The Raspberry Pi will act solely as an intermediary between the user and the cloud server.

I plan to use Node.js on the Raspberry Pi to:

Send textual input to the Heroku server via API calls.
Receive the processed textual response and convert it to audio using text-to-speech libraries.
I would like confirmation that the Raspberry Pi 5 supports Node.js and recommendations for libraries to use for:

- Speech-to-Text: Converting voice commands into text.
- Text-to-Speech: Generating audio from the received text.

Additional Specifications:

The chatbot responses are brief and generated quickly by the cloud server, so the Raspberry Pi only needs to manage voice input/output and LED control.

I aim to minimize latency and power consumption.

Specific Questions:

Is the Raspberry Pi 5 the best choice for a project where the Pi is only an intermediary, with all processing delegated to the cloud server on Heroku?

Which USB microphones and compact speakers (suitable for a 20x20x20 cm case) would you recommend for achieving good audio quality?

What libraries or tools would you suggest for:

Speech-to-Text: Converting audio into text?

Text-to-Speech: Producing natural-sounding voice output?

What software configuration (OS, libraries, Node.js versions) would you recommend to ensure optimal compatibility and performance on Raspberry Pi 5?

I would like to start with this kit:

https://www.kubii.com/fr/kits-nano-ordi ... #/ram-4_gb

What do you think about it?

Thank you very much for your support!

Any suggestions or advice, whether for hardware or software, will be greatly appreciated.

Thank you 8-)
Roberto

Statistics: Posted by DeveloPress — Wed Dec 25, 2024 5:09 pm — Replies 0 — Views 0



Viewing all articles
Browse latest Browse all 5286

Trending Articles