# Voice Commands

{% hint style="info" %}
By default, all voice transcriptions happens on device using a local Whisper model. It should work across multiple languages, you can increase quality by changing the underlying model or setting common used words (see [transcription](https://dreetje-1.gitbook.io/inbox-ai/transcription "mention")).
{% endhint %}

In the [advanced-settings](https://dreetje-1.gitbook.io/inbox-ai/advanced-settings "mention"), you can see the configuration for voice commands. Here you can configure the model and whether to send audio data directly to the ai model (this is only supported by OpenAI's `gpt-4o-audio-preview`and should be considered beta). For more information on these settings see [transcription](https://dreetje-1.gitbook.io/inbox-ai/transcription "mention").

<figure><img src="https://2980396867-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2Fkm2shtnhVydyNEDdxbX1%2Fuploads%2FyNStcrXotui8YZljrzpG%2FCleanShot%202024-10-29%20at%2021.28.57%402x.png?alt=media&#x26;token=63c789c5-e953-480c-859b-367e5c7577ce" alt=""><figcaption><p>Audio settings</p></figcaption></figure>

Once you finish the [quick-start](https://dreetje-1.gitbook.io/inbox-ai/quick-start "mention"), voice commands should work right away. Try holding the hotkey and say: "Write an email to my co-worker about the benefits of coffee" and see what happens.

By default, the "Process Audio" command is used to handle your command. Let's dive into this command and see how it works. Go to "Actions", and select "Process Audio".

<figure><img src="https://2980396867-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2Fkm2shtnhVydyNEDdxbX1%2Fuploads%2FtrYRkertOeekuQqrlLKo%2FCleanShot%202024-10-15%20at%2009.08.28%402x.png?alt=media&#x26;token=9341b776-3d4c-46d7-9757-5fa13d4ce3d5" alt=""><figcaption><p>The process audio action</p></figcaption></figure>

Process audio is an [ask-ai](https://dreetje-1.gitbook.io/inbox-ai/actions/custom-actions/ask-ai "mention") action, that uses the default AI provider and has a lot of actions it can choose to perform. Each action usually reflects a specific command. By default it has these actions:

* Ask ChatGPT / Claude / Perplexity
* Draft Email
* [add-apple-reminder](https://dreetje-1.gitbook.io/inbox-ai/actions/built-in-actions/system-actions/add-apple-reminder "mention")
* Open Browser
* [get-selected-text](https://dreetje-1.gitbook.io/inbox-ai/actions/built-in-actions/system-actions/get-selected-text "mention")
* [paste-at-cursor](https://dreetje-1.gitbook.io/inbox-ai/actions/built-in-actions/system-actions/paste-at-cursor "mention")
* [send-notification](https://dreetje-1.gitbook.io/inbox-ai/actions/built-in-actions/system-actions/send-notification "mention")
* Open Application
* [take-screenshot](https://dreetje-1.gitbook.io/inbox-ai/actions/built-in-actions/system-actions/take-screenshot "mention")
* [do-nothing](https://dreetje-1.gitbook.io/inbox-ai/actions/built-in-actions/do-nothing "mention")

You can add or customize these actions as you please. Below the actions, you see the prompt which controls the AIs main behavior, this too can be fully customized.

In the prompt, you can specify the behavior you are looking for. The `{{ value }}` placeholders in the prompt will be replaced by variables before sending them to the AI. In this case the `{{ originalInput }}` is your spoken command, and your name and some time information is also inserted. For more information on templating, see the [templating](https://dreetje-1.gitbook.io/inbox-ai/templating "mention") page.
