A bilingual voice-powered AI companion, built using XiaoZhi’s intelligent assistant platform.
English | 中文 | 日本語 | Русский | Français | Español | Deutsch | हिन्दी
Wally is a fun, reliable voice assistant in the shape of a penguin. It is designed to mainly help users with reminders, but can also assist in anything from productivity, to daily conversations, to emotional support.
Note
This will soon be available for purchase here. Give it a star to stay tuned for updates!
I just keep forgetting stuff. A lot of people around me have the same issue as well, I’ve noticed. It’s annoying to the point where it drives people crazy. As a competitive swimmer, after the nth time of forgetting to bring my electrolyte drink or swim gear to practice, I decided to build a tool to remind myself.
Wally is designed to have all the normal functionalities of an AI assistant, while also being intentionally interesting in appearance. This is what separates it from other reminder assistants out there. I find that a cute, fun-to-use device creates a better impression on users, meaning you’re more likely to remember to ask it for help and actually listen to its reminders.
After four months, I completed my first prototype and gave a few to friends at school to test out. It turned out that all of them absolutely loved using it and urged me to make it public. So here it is.
These are some ways my family and friends used Wally:
| Scenario | Usage |
|---|---|
| School | Quick essentials check before school day |
| Practice | Confirm practice gear before/after training |
| Travel | Final packing check during trips |
-
Voice Wake-Up: Wake word: "Hi, Wally" or "Hi Wall-E" triggers Wally to start listening and responding
-
Bilingual Voice Chat: Fluent interaction in both English and Mandarin (although it's Mandarin accent might sound strange at the moment)
-
Real-Time Streaming Responses: Instant replies from XiaoZhi-powered Qwen LLM backend
-
OLED/Display Feedback: On-screen visualization of detected user speech said and system replies.
-
Small and Portable: Simple, miniature form that is easy to place or carry around.
-
Adorable and Fun To Use: Intentionally cute penguin design, child-like voice and various on-screen emojis for interaction.
Watch on Youtube or download it here.
-
Powered by: XiaoZhi cloud + ESP32 firmware
-
Backend: Uses XiaoZhi’s LLM (Qwen, Deepseek, etc) response system
-
Frontend: Device firmware via XiaoZhi flashing tools and preset images
-
Wake Word Engine: ESP-SR based offline recognition
-
Voice Synthesis: TTS via Volcano Engine or CosyVoice
-
Outer Shell: Designed in Tinkercad and 3D printed via Bambu Lab P1
-
Turn on Wally: Flip the switch on Wally’s back to power it on.
-
Enter Configuration Mode: Remove Wally’s head to access the internal board. On the back of the board, locate the round button next to the white battery stand. Press that button and the one labeled “B” at the same time. Wally will report: “Entering network configuration mode.”
-
Connect to Wally’s Setup Network: On your phone or laptop, connect to the Wi-Fi network named
Xiaozhi. A configuration webpage should automatically open. If it doesn’t, open your browser and go to the address displayed on Wally’s screen. -
Enter Your Wi-Fi Details: On the setup page, input your Wi-Fi network’s name (SSID) and password. (You can choose to remember this network for future use.)
-
You're All Set!: Once configuration is complete, follow the next set of instructions to start using Wally.
Important
Only follow the below instructions once network configuration has been completed.
- Say "Hi, Wally" to wake Wally up.
- Start chatting with Wally.
- Say any form of goodbye you like to shut Wally down.
Tip
For a more personalized experience, it may be helpful to list details about your belongings you want Wally to remember. The information will be stored in Wally's memory and can be cleared any time simply by asking it to.
Special thanks to:
- Xiao Xia and his amazing project XiaoZhi
- The Xiao Zhi Platform
Contributions are welcome. Feel free to open issues and pull requests.
This project is licensed under the MIT License.