new

1.8.0

Features

Remote clients (experimental)
: You can finally connect to your Wingman AI host from other devices on your network.
Use your iPad or Laptop as a second screen!
While you cannot record your voice using external clients, the TTS output will be played on both your host and external clients. You can also send text from external clients to the host and most user interface interactions will work on external clients, too. Remember that Wingman AI is still running on your PC, so changes are applied there. To connect, simply start Wingman AI on your main PC and then navigate to
http://<host-ip>:5173
in the browser on your external device(s). You might have to allow your firewall on the host to listen to this port. You can find out the local host IP by typing
ipconfig
in a shell or by inspecting your network adapter in Windows.
Wingman Pro
: We
improved the performance
of our existing regions and added a
new Asia region
. You have to update to 1.8.0 to get the full benefits, especially if you're in the US or in Asia.
We added a new
Shut up key
which defaults to
Shift+y
(can be rebound in Settings) and cancels any current TTS output via shortcut
Improved Skills UI
that tells you better what Skills actually do. Our in-client tutorial videos are now played in a modal dialog to
de-clutter the interface
We now support OpenAI's new TTS API with our new
“OpenAI-compatible TTS" provider
and you can use it to connect to
local TTS systems like CoquiTTS (xtts2)
or others. This is huge because now you can go "full local" for the first time ever and even have voice cloning. If you want to dive into local TTS, we highly recommend @teddybear082's Wingman AI version of CoquiTTS. He has Wingman-specific instructions in this repository and can help you to set it up.
Our
Google provider now supports function calling
, so that you can use the gemini-models to their full capabilities. They’re really fast and have quite the “personality”, so try them out! You need an API key but the free tiers are very generous.
Added
Hume AI
as new TTS provider. It’s a service similar to Elevenlabs but more affordable.
Added
speed
and
model
parameters and
ballad
voice for OpenAI TTS
updated Perplexity models
we added a
{SKIP-TTS}
phrase. If you prompt the LLM to add this string to its reply (or write a Skill that does it), the reply will appear in the Wingman UI but not be sent to the TTS provider. This is useful for “silent replies”.
Skills now have a new
on_play_to_user
hook that is called just before we send the text to the TTS provider. You can modify the text here to remove smileys from the response, add the
{SKIP-TTS}
phrase etc.
You can set default hotwords for FasterWhisper using
Settings > Edit default configuration
and these are now separate from the Wingman-specific hotwords. In other words: Wingmen (and Skills like
UEXCorp
) can now add additional hotwords on top of the default ones properly.
Added new custom property type
audio_device
for Skills that will render our audio device picker in the UI
We are now tracking some anonymous usage data in our client to understand better which features you're (not) using. We do not log your backstories or any user-generated content, it's things like "changed LLM provider to x" or "opened Sound Effects Library". We cannot identify you with this data and we won't sell it. You can opt-out in the Settings view.

New Skills

Audio Device Changer
: Lets you change the desired output device per Wingman. This is useful for VTubers who need each Wingman on a dedicated virtual sound device to display avatars etc.

Skill changes

Vision AI
: improved Skill prompt
Timer
: added silent mode
American/Euro Truck Simulator
: fixed location data
UEXCorp
: minor adjustments and optimizations

Bugfixes

FasterWhisper no longer crashes when you remove a language set previously
Fixed Elevenlabs provider not loading models (thanks @lugia19)
Wingman Pro request errors are now properly sent to the client so that we can display them properly in the UI
fixed missing translations in French and Spanish
The initial Setup Wizard now asks for your NVIDIA RTX series and sets FasterWhisper’s
computeType
to
float16
if it’s a 5000 card, preventing a crash
Clicking on the LLM model dropdown and leaving without selecting a new one no longer crashes Wingman AI Core
models are not (re)loaded properly after entering a secret

Breaking changes & migration:

Wingman AI Core now needs a client to be connected to it. We had issues before where Skills were requesting data using the Wingman Pro API before the user was authenticated, so Skill are now initialized after the user has logged in.
Unfortunately, we have to reset your FasterWhisper hotwords during migration. So you'll have to enter them again. You can still access your old list of hotwords by looking at your old 1_7_0 configs in your APPDATA directory.
Custom Skills have to be migrated manually as usual. This time we only changed the structure of the default_config.yaml to display Skills better in the UI:
category: str
was replaced by
tags: list[str]
. Added new properties
display_name
(may contain whitespaces) and
author
. Simplified
examples
to only contain questions now (so a list of
de
/
en
strings).

Known issues:

Hume currently doesn't list custom voices which we already reported to them. You can still set one by editing the config manually. Let us know if you need help with that!
Elevenlabs lowered their request limit significantly. If it fetches your voices and you then open the wrench to fetch the models too quickly, the second request might fail. The workaround is to wait a couple of seconds after the voices have been loaded and then click the wrench. You can toggle the wrench to send the request again. If it fails, the model field will just disappear.