Changelog

Follow up on the latest improvements and updates.

RSS

Skill Changes
  • ImageGeneration
    can now save to file and passes the file path to chained function calls
Bugfixes
  • Closing Wingman AI client now properly terminates whispercpp and xvysynth processes (if it launched them)
  • fixed config directories being recreated when the default config dir was changed (on Windows)
  • fixed "white screen error" when changing the default Wingman in the active config using the client
  • Wingman Pro: hide "Subscribe" buttons if you already have a subscription
  • fixed some minor client issues
Features
  • We were forced to implement a
    new payment provider
    for Wingman Pro. See our announcements on various channels for more information.
  • Audio Library
    : There is now a new
    audio_library
    directory in your config directory and you can put any number of
    .mp3
    and/or
    .wav
    files in there. You can also structure them im subdirectories. These files are imported into Wingman AI and you can use them as new action type in Commands or in Skills. You can now create commands that press hotkeys and then play a sound file - or the other way around. If you select multiple files, a random one will be chosen to play. Sounds familiar, HCS Voice Packs/VA?
  • AI Sound effect generation
    : You don’t have an audio library yet or need a new sound for a very specific use case? Use
    Elevenlabs
    to generate mp3 files on the fly using a simple prompt!
  • We
    redesigned Command Configuration
    . It’s no longer modal but a dedicated view giving it more screen space. We also optimised the UX and made all the options for instant activation and responses much easier to understand. You can now also tell the AI not to say anything after executing command which is useful if you just want to play a sound file, for example.
  • Wingman Client now has a new Audio Library drawer that lets you see your current sound effects and Elevenlabs subscription details (including characters left)
  • Added new LLM providers & models:
  • Cerebras
    : A very(!) fast provider similar to Groq. They offer access to
    llama-3.1
    models and are currently free-to-use. New models will be added automatically to Wingman AI.
  • Perplexity
    : Trusted real-time data powered by
    llama-3.1
    . Does not support function calling in Wingman AI but there is a Skill to solve that (see below). It’s a paid service, you will need a funded account with an active API key
  • OpenAI
    models are now fetched dynamically so that you can use all available models with your own API key
  • Config migrations are now sequential, meaning that you will always be able to migrate from any(!) version since 1.4.0 to the new version, even if you skipped some in between. Note that custom (non-official) Skills currently can’t be migrated and Wingmen that are using these, will fail to migrate. Just delete the Skills from your Wingman configs manually, then copy over your custom Skill code to 1.6.0 and run the migration again. Finally re-attach your custom Skills to your now migrated Wingmen in the client.
Skill changes
  • FileManager
    can now read PDF files
New Skills
  • Image Generation
    : Let your Wingman create images on the fly using Dall-E 3 (free & unlimited with Wingman Pro)
  • MSFS 2020 Control
    : Control and retrieve data from Microsoft Flight Simulator 2020 using SimConnect
  • ThinkingSound
    : Use files from the new AudioLibrary to play sounds while the AI is generating responses
  • AskPerplexity
    : Use the Perplexity API to get up-to-date information on a wide range of topics. The Skill allows you to run a model with function calling capabilities (like Wingman Pro/gpt-4o) as main driver for your Wingman while still providing access to recent data from Perplexity.
Bugfixes
  • Fixed secret prompts immediately disappearing before you could enter anything. You also no longer have to restart Wingman AI after entering certain secrets for providers or Skills
  • Fixed a bug where trying to remove a voice from the VoiceSelection list (used in RadioChatter and VoiceChanger) removed the voice before that
  • Better error handling and performance improvements for Elevenlabs
  • The “ToC” modal no longer auto-closes before you can click it
  • Closing Wingman Client now properly unloads Skills and ends the XVASynth task (instead of accidentally launching it)
  • XVASynth: added missing “disabled” label
  • Fixed Discord invite links (because our old one expired) & updated some docs
Features
  • Autosave
    : You complained in our survey that the Wingman AI client was unresponsive and sometimes felt sluggish and as your favourite UX-driven development company, we listened. From now on, almost any action and config change you can make happens ad-hoc and no longer requires reloading the config you’re in. This brings a couple of benefits: increased performance as loading a config is expensive, way less conversation history resets, no more “Save” buttons, more real estate in the UI as we don't need the ActionBar anymore and you can't forget to save.
  • Config Migrations
    . Finally. Wingman AI 1.5.0 (and future versions) will now take your configs from previous versions and migrate them to the current format, meaning that you’ll keep almost all of your old secrets, settings, configs and Wingmen. You’ll find a new
    .migration
    file in your 1.5.0 config directory after the first start. This file is a log and as long it’s there, Wingman AI will not attempt to migrate again on start. If you want to re-migrate your 1.4.0 configs for some reason, delete this
    .migration
    file (and the 1.5.0 configs), then restart Wingman AI. We only support migrating from 1.4.0 to 1.5.0 but future versions might also be able to migrate step-by-step starting from version 1.4.0.
    If you are a developer and have built your own skills in older versions
    , you have to copy the files from
    /[old_version]/skills/[your-skill]
    to
    /[new_version]/skills/[your-skill]
    manually because we cannot migrate unknown skills reliably. Restart Wingman AI afterwards.
  • Our beloved TTS provider
    whispercpp is now bundled (and auto-started) with Wingman AI
    and is the new default for everything so that everyone can use it. Check out the new section the Settings view to set it up. If you have a NVIDIA RTX GPU, make sure to check the “use CUDA” option. If you’re using custom models (other than
    ggml-base.bin
    ), just copy them into the new
    whispercpp-models
    directory in your Wingman AI installation directory and the UI will find them.
  • Edit default config:
    In the Settings, you’ll find a new button and view that lets you change the defaults config (
    defaults.yaml
    ). This includes the highest level
    system_prompt
    that Wingman AI uses. Be careful if you modify this and please only do so if you know what you’re doing. Yes, editing the default config also has autosave.
  • Added
    volume slider
    to Wingman configuration. If it’s set to zero, TTS processing will be skipped entirely.
  • added
    OpenAI
    ’s much cheaper and faster
    gpt-4o-mini
    model and made it the new default. The larger
    gpt-4o
    model is still available in case you need it.
  • added
    Google Gemini
    as LLM provider. Function calling doesn’t work (yet) but you can use it for chit-chat and roleplaying. It’s very fast and pretty good - try it! Unfortunately, we can’t provide it with Wingman Pro, so you’ll need your own API key. The smaller model is currently free.
  • Groq
    and
    Elevenlabs
    models are now fetched using their API so that new models will be available in Wingman AI immediately and without an update. Check out Groq’s new
    llama3.1
    models with function calls and Elevenlabs’ brand-new
    eleven-turbo-2.5
    model finally supporting multiple languages.
  • We removed the
    summarize_provider
    so that
    tool calls can now be chained
    , meaning that the response to a function call can call another function and so on. This doesn’t sound like much but is actually huge. You (and skills) can now basically give commands like “Use the XYZ skill and then do ABC with the result”.
  • We split settings and configuration for
    XVASynth
    and improved its UI support meaning that it has nice dropdown for all your downloaded models and voices now. If you want to use XVASynth, install it via Steam, then
    enable and configure it in the Wingman AI Settings
    view. After that, it will appear in the list of TTS providers. We cannot migrate your old XVA settings, so please do that once, even if you were already using XVASynth.
  • Wingman AI Core can now
    open directories and files
    using Windows Explorer or OSX Finder. We added a new useful buttons like “open logs” or “open config directory” to the client.
  • We added
    new custom property types
    "VoiceSelection" and "Slider" so that skills can display our fancy UI components for them. No more writing JSON into text boxes to select voices...
  • We
    removed DirectSound
    drivers for audio devices because it caused more problems than it solved. The only available driver is MME now. Therefore, we have to reset your audio device settings during the migration, sorry. Make sure you check the “Settings” view after upgrading.
Bugfixes
  • Links in LLM responses now have
    target="_blank"
    set so that they’ll open in a new browser Window
  • Skills are now unloaded correctly when removed. No more infinite zombie timers. We also added some new hooks for developers.
  • fixed an issue with our AudioPlayer that sometimes resulted in a crash when using the RADIO_LOW or RADIO_MEDIUM sound effects
  • fixed an issue with mic selection while using VA
  • fixed an error occurring when using a default audio device
  • fixed renamed configs being recreated on startup (again)
  • fixed not being able to un-default renamed Wingmen
  • fixed Wingmen being disabled when a secret from their configuration was missing on startup
  • fixed Elevenlabs
    get_available_voices
    API endpoint changing for no apparent reason and without warning us ;) Thanks again to @lugia19 who fixed this very quickly in his great
    elevenlabslib
  • Request a feature is now opened in a new browser window because integrating it could cause a “blank screen” error.
  • If Wingman AI Core fails to start, we now show better trouble-shooting infos pointing to our Discord #support channel in the client.
New Skills
  • APIRequest
    : This one is a powerhouse. Point it to a docs page or (OpenAPI) spec for an API of your choice and it will understand the endpoints and then be able to call them - all on-thy-fly! We already gave it the Wingman AI Core API spec and well.. it was able to “send text” to another Wingman immediately without us ever implementing that. Wingmanception! You can also feed it any public online API like Pokedex or whatever. Try it out!
  • RadioChatter
    plays customizable random AI-generated chatter over time. You can even answer to these messages if you want to. It also uses our new custom property types now (in case you were already using the Discord versions).
  • Auto-Screenshot
    : Takes a screenshot if you request it or in moments where your voice input suggests interesting, scary or funny moments. Example: “Aww look, a spaceship!” => takes a screenshot
  • ATSTelemetry
    : Retrieve game state information from American Truck Simulator or Euro Truck Simulator 2
  • NMSAssistant
    : fetch information about No Man's Sky items, elements, crafting, cooking, expeditions, community missions, game news, and patch notes. Powered by NMS Assistant API.
Skill changes
  • ControlWindows and FileManager
    got some new capabilities like new supported file formats, text-to-clipboard and appending text to existing files
  • Timer
    skill can now loop and works more efficiently
  • VoiceChanger
    now makes use of the new custom property types and offers a nice voice selection in the UI
  • Vision
    now outputs the direct LLM responses in
    debug_mode
Skills:
  • added
    Timer
    : Let your Wingman delay and time command executions or responses
  • added
    VoiceChanger
    : Let your Wingman change its voice automatically and customisable
  • added
    WebSearch
    : Searches the web using the DuckDuckGo search engine. You can also specify a specific page/URL that it will crawl.
  • added
    TypingAssistant
    : Let your Wingman type text for you (in any app etc.)
  • added
    QuickCommands
    : Learn command phrases you use regularly and auto-convert them to instant activation commands to execute them faster
  • added
    Vision
    : Use AI to take and analyse a live screenshot, then ask questions about it
  • added
    FileHandler
    : Read and create files and directories
  • UEXCorp
    now supports even more UEX API v2 features and no longer requires an API key
  • ControlWindows
    can now use your clipboard and supports window snapping
Features & Changes
  • added
    Clippy Wingman
    - the famous assistant we all know and hate, now resurrected with AI powers and using a lot of our new skills. He resides in a new config name “General”.
  • added
    AI avatar generation
    for your Wingman using DALL-E
  • improved the
    quality of all of our audio effects
    a lot
  • “ROBOT” sound effect is now called “AI”
  • “RADIO” sound effect now has 3 different variants to simulate a low/medium/high quality radio device
  • new
    Apollo Beep
    sound
  • Your Wingman can now say
    instant filler responses
    (like “Give me a second to…”) for long-running tasks
  • The Wingman AI Client can now render Markdown responses with
    links, images, different text formats
    etc. (like SlickGPT)
  • you can now define
    multiple commands with the same instant activation phrase
    and they’ll all be executed (with a better response) if you say the phrase
  • Wingman AI client no longer saves your entire Wingman configuration but
    only the differences
    to the
    defaults.yaml
    config. This is a huge change and will help us with future config migrations and allow us to finally make the defaults editable in the UI in a later version.
Bugfixes
  • fixed renamed or deleted default Configs/Wingmen being recreated on restart
  • fixed a bug enabling Voice Activation even if it was disabled in the Settings if you (accidentally) pressed the un/mute key (default: Shift+x)
  • fixed a bug causing dependencies defined by Skills not being loaded properly
  • fixed drag&drop to reorder action “rows” in the Commands tab
  • fixed silent error/crash on start when no mic is connected
  • added missing Azure STT language selection for Voice Activation (or PTT with external clients) in Settings
  • fixed creating new configs from templates ("Empty" vs. "Star Citizen")
  • fixed conversation not being cleared if the "clear button" was clicked in "View all" terminal mode
  • fixed mouse button actions not being displayed in the Command list

improved

fixed

1.3.1

  • added the updated UEXCorp skill that now works with the new UEX API v2.0
  • Free users can now fully explore (but not save) all skills in the client
  • StarHead skill is now attached to the Computer Wingman by default so that Free users can use it
  • improved default contexts of our demo Wingmen
  • fixed a bug with un/muting that lead to Voice Activation being enabled when it shouldn't
  • fixed an issue when handling Alt Gr in command bindings
  • fixed renamed default Wingmen being recreated on restart
  • fixed a bug caused by Wingmen without configured activation keys
  • fixed a bug with saving the Basic configuration
  • gpt-4o
    for Wingman Pro/Ultra is now served via Azure (instead of OpenAI)

new

improved

fixed

1.3.0

  • Added
    gpt-4o
    model. The latest & greatest from OpenAI ticks all the boxes for Wingman AI, so we are making it the new default model for... everything. We've even removed the
    gpt-3.5-turbo
    and
    gpt-4-turbo
    options because we're sure we won't need them anymore. General responses should already be significantly faster, and it will get much faster once OpenAI unlocks audio input/output to the API. The model also seems to work well with our new contexts and capabilities. Please let us know (with examples) if we are wrong!
  • StarHead is now a Skill
    , meaning you can attach it to any Wingman.
  • Added new
    Control Windows Skill
    to launch applications, minimise/maximise Windows etc. For example, tell it to “Open Spotify”. Note that the skill currently only looks for applications in your Start Menu directory (
    %APPDATA%/Roaming/Microsoft/Windows/Start Menu/Programs
    ), so if it tells you that it cannot find an application, create a shortcut in that directory.
  • Added new
    Spotify Skill
    to play songs or playlists, "like” songs, control the playback device and more. The skill is very intelligent and honestly takes Spotify to another level because you can ask it things like: “Play the theme song from Bad Boys”. Try it out, it’s really good! You will need to create a developer account for your Spotify Premium account and then connect Wingman to it. The Spotify client has to be running on any one of your clients and must be playing a song. Otherwise, the Spotify Web API won't find any active device for your account. You can find more information and instructions on our Discord server.
  • Added
    Groq and OpenRouter
    (in favour of the old "Llama") as AI providers. You can select any of their models from a list in the UI, but only Mistral and Llama3 have been tested with AI function calls. However, you can use other models for roleplaying etc. and experiment with them. Our demo contexts may need to be tweaked to work properly with other models.
  • Added
    "Local LLM" as an AI provider
    . This makes it easier for you to connect to different models from different providers. Use
    Local LLM
    instead of the old
    OpenAI > Advanced > base_path
    setting to connect to Ollama, LMStudio etc. You can now enter your own custom model name and are no longer limited to the drop down choices in the client.
  • Updated default commands for
    Star Citizen 3.23
    . There have been a lot of changes in the last SC patch and our commands are by no means a complete mapping for everything. Feel free to improve these bindings and post your updates in Discord. We can collect your changes and then release them for everyone to use.
  • Added a handler and websocket allowing you to connect
    esp32
    devices to Wingman AI. This is what we used in our Walkie Talkie for Wingman AI short video. We’ll publish a blog post about this but until then, contact @Timo if you’re interested in connecting external devices to Wingman AI using
    OpenInterpreter
    etc.
  • Added
    Debug Mode
    toggle to global settings. Enable this for more verbose system messages and additional benchmarks-
  • @JayMatthew’s beloved
    UEXCorp
    Wingman has already been converted to be a skill, too. Unfortunately, the UEXCorp API was changed completely yesterday, so he will fix the skill as soon as possible and you can then install it (without another Wingman AI update) when it’s done. You can even use it together with StarHead now! You may need to give special commands to determine which skill you want to prefer when asking for a good trading route. We are experimenting with this, but we need more data and experience from you, so please let us know how it goes.
  • fixed crashes when AI function calling with llama3 models
SlickGPT Pro (For Wingman Ultra users):
  • added
    gpt-4o
    model for both SlickGPT Free and Pro users
  • Free users can now use
    llama3
    (for free via Groq) and
    mistral-large
    with their own API keys. We are waiting for a fix in Azure to serve these two options in our SlickGPT Pro backend.
  • various UI improvements on desktop & mobile viewports
Spotify skill
  • We updated the
    StarHead Wingman
    to support better trading data and added a new function to get detailed ship data. You can ask for ship details or even compare different ships or ship systems. Try something like "What is the difference between a Constellation Taurus and a Constellation Andromeda", for example.
  • We worked around the super slow performance of
    GPT-4-Turbo
    , so you can now use it again. It's still slower than GPT-3.5-Tubo.
  • Remove emotes
    from Audio Playback, if they are in the text result. Sometimes the AI want's to express emotions like this, but it was very annoying in the audio output.

new

improved

1.2.0

  • Mistral AI
    has been added as a top class provider. You can use your own API key to access all their cloud models or use the most powerful one (mistral-large) with Wingman Pro. Please note that if you use Mistral, you will need to adapt your contexts, as it does not seem to handle our examples written for GPT very well. Mistral supports function calls, so start with "You are a helpful assistant", which should work with AI commands, and iterate from there.
  • We have also added
    experimental support for Llama3
    . It's all ready in Wingman AI, but not yet in Azure, so if you try to use it with Wingman Pro, your requests will time out. We are waiting for Microsoft to fix this and will let you know when it works (hopefully with no additional client update). Until then, you can use 3rd party providers like Groq (super fast!) or host it locally and play around with it. Function calls won't work yet because we need to write some more code to parse them, but you can already chat with it and run instant commands as usual.
  • Reworked command UI. You can now add
    mouse and write (text) actions
    . You can also delete individual actions from a command's action list and reorder them using drag and drop.
  • Improved Windows audio device settings. Wingman AI should now handle disconnected devices or changed hardware configurations much better.
  • Improved function call parsing to be more robust when using local LLMs.
  • Added a UI option to make OpenAI the default provider for all Wingmen settings. This makes it easier to use Wingman AI without a subscription. Note that you can still manually change your default provider in the config defaults.yaml in your config directory.
Unfortunately we have to change our payment provider for tax reasons.
We are switching from Stripe to Paddle.
We have already changed the whole process for new users/subscribers. If you have a subscription with Stripe, you can still manage it using the client and our website. We are currently trying to determine if we need to cancel and migrate current Stripe subscriptions.
If you would like to help with the transition, please cancel in Stripe and then re-subscribe with Paddle if your subscription has expired. You don't need to do anything at the moment, but if this changes we'll let you know!
  • fixed error when switching config
  • text input is now disabled if the focused Wingman is disabled

new

improved

fixed

1.1.0

  • You can now
    send text messages
    to your Wingmen! To do so:
  • Select a Wingman in the bar on the left to open a filtered "
    Focus view
    " which only shows the conversation history with this particular Wingman. If you talk to another Wingman while in this mode, the view will automatically switch to the activated Wingman. Select "View all" to see the shared "Terminal" with messages of all your Wingmen (as usual).
  • fixed handling of "extended" keys and added a default "hold time" of 0.1s which should
    greatly improve keypress command executions
    . If you had "Wingman doesn't press the key" issues before, please try again with this version!
  • added
    "extended macro" recordings
    , meaning that key down/up events will be recorded separately. This gives you more flexibility to record complex and staggered macros
  • The client now has a
    button to reset the conversation history
    (in both Focus and Terminal view)
  • reworked "energy threshold" controls for Voice Activation in the client. It's no longer a slider but a numeric input with presets (low/standard/custom) and better hint texts to explain what meaningful values are.
  • fixed Microsoft password policy to be less restrictive
  • fixed Conversation/Summarize model not being saved properly when using the client
  • changed/moved various API endpoint URLs
Load More