new
1.8.0
Features
- Remote clients (experimental): You can finally connect to your Wingman AI host from other devices on your network.Use your iPad or Laptop as a second screen!While you cannot record your voice using external clients, the TTS output will be played on both your host and external clients. You can also send text from external clients to the host and most user interface interactions will work on external clients, too. Remember that Wingman AI is still running on your PC, so changes are applied there. To connect, simply start Wingman AI on your main PC and then navigate tohttp://<host-ip>:5173in the browser on your external device(s). You might have to allow your firewall on the host to listen to this port. You can find out the local host IP by typingipconfigin a shell or by inspecting your network adapter in Windows.
- Wingman Pro: Weimproved the performanceof our existing regions and added anew Asia region. You have to update to 1.8.0 to get the full benefits, especially if you're in the US or in Asia.
- We added a new Shut up keywhich defaults toShift+y(can be rebound in Settings) and cancels any current TTS output via shortcut
- Improved Skills UIthat tells you better what Skills actually do. Our in-client tutorial videos are now played in a modal dialog tode-clutter the interface
- We now support OpenAI's new TTS API with our new “OpenAI-compatible TTS" providerand you can use it to connect tolocal TTS systems like CoquiTTS (xtts2)or others. This is huge because now you can go "full local" for the first time ever and even have voice cloning. If you want to dive into local TTS, we highly recommend @teddybear082's Wingman AI version of CoquiTTS. He has Wingman-specific instructions in this repository and can help you to set it up.
- Our Google provider now supports function calling, so that you can use the gemini-models to their full capabilities. They’re really fast and have quite the “personality”, so try them out! You need an API key but the free tiers are very generous.
- Added Hume AIas new TTS provider. It’s a service similar to Elevenlabs but more affordable.
- Added speedandmodelparameters andballadvoice for OpenAI TTS
- updated Perplexity models
- we added a {SKIP-TTS}phrase. If you prompt the LLM to add this string to its reply (or write a Skill that does it), the reply will appear in the Wingman UI but not be sent to the TTS provider. This is useful for “silent replies”.
- Skills now have a new on_play_to_userhook that is called just before we send the text to the TTS provider. You can modify the text here to remove smileys from the response, add the{SKIP-TTS}phrase etc.
- You can set default hotwords for FasterWhisper using Settings > Edit default configurationand these are now separate from the Wingman-specific hotwords. In other words: Wingmen (and Skills likeUEXCorp) can now add additional hotwords on top of the default ones properly.
- Added new custom property type audio_devicefor Skills that will render our audio device picker in the UI
- We are now tracking some anonymous usage data in our client to understand better which features you're (not) using. We do not log your backstories or any user-generated content, it's things like "changed LLM provider to x" or "opened Sound Effects Library". We cannot identify you with this data and we won't sell it. You can opt-out in the Settings view.
New Skills
- Audio Device Changer: Lets you change the desired output device per Wingman. This is useful for VTubers who need each Wingman on a dedicated virtual sound device to display avatars etc.
Skill changes
- Vision AI: improved Skill prompt
- Timer: added silent mode
- American/Euro Truck Simulator: fixed location data
- UEXCorp: minor adjustments and optimizations
Bugfixes
- FasterWhisper no longer crashes when you remove a language set previously
- Fixed Elevenlabs provider not loading models (thanks @lugia19)
- Wingman Pro request errors are now properly sent to the client so that we can display them properly in the UI
- fixed missing translations in French and Spanish
- The initial Setup Wizard now asks for your NVIDIA RTX series and sets FasterWhisper’s computeTypetofloat16if it’s a 5000 card, preventing a crash
- Clicking on the LLM model dropdown and leaving without selecting a new one no longer crashes Wingman AI Core
- models are not (re)loaded properly after entering a secret
Breaking changes & migration:
- Wingman AI Core now needs a client to be connected to it. We had issues before where Skills were requesting data using the Wingman Pro API before the user was authenticated, so Skill are now initialized after the user has logged in.
- Unfortunately, we have to reset your FasterWhisper hotwords during migration. So you'll have to enter them again. You can still access your old list of hotwords by looking at your old 1_7_0 configs in your APPDATA directory.
- Custom Skills have to be migrated manually as usual. This time we only changed the structure of the default_config.yaml to display Skills better in the UI: category: strwas replaced bytags: list[str]. Added new propertiesdisplay_name(may contain whitespaces) andauthor. Simplifiedexamplesto only contain questions now (so a list ofde/enstrings).
Known issues:
- Hume currently doesn't list custom voices which we already reported to them. You can still set one by editing the config manually. Let us know if you need help with that!
- Elevenlabs lowered their request limit significantly. If it fetches your voices and you then open the wrench to fetch the models too quickly, the second request might fail. The workaround is to wait a couple of seconds after the voices have been loaded and then click the wrench. You can toggle the wrench to send the request again. If it fails, the model field will just disappear.