mgsgde/whisper-shortcut: Speech-to-text and voice-to-prompt macOS app with Gemini and Whisper support

WhisperShortcut

Speech-to-Text and Voice-to-Prompt shortcut for macOS with Google Gemini and Offline Whisper support

Download & Support

Download for FREE via GitHub Releases

Support the Project ❤️

WhisperShortcut is open source and free to use. If you want to support the development, you can purchase the app on the App Store:

Buy on Mac App Store

This is completely optional. If you like the app, a purchase or a review on the App Store is a fantastic way to say thank you! ⭐

How it works

Transcription Mode (Speech-to-Text)

Choose between cloud (Google Gemini...

WhisperShortcut

Speech-to-Text and Voice-to-Prompt shortcut for macOS with Google Gemini and Offline Whisper support

📺 Watch Demo Video

Download & Support

Download for FREE via GitHub Releases

Support the Project ❤️

WhisperShortcut is open source and free to use. If you want to support the development, you can purchase the app on the App Store:

Buy on Mac App Store

This is completely optional. If you like the app, a purchase or a review on the App Store is a fantastic way to say thank you! ⭐

How it works

Transcription Mode (Speech-to-Text)

Choose between cloud (Google Gemini) or offline (Whisper) transcription:

Setup - For cloud transcription: Configure your Gemini API key here. For offline transcription: Download a Whisper model in Settings (no API key needed).
Press Shortcut - Start recording with a keyboard shortcut
Transcribe - Uses your selected model (Gemini or Whisper) for accurate speech-to-text
Copy to Clipboard - Automatically copies transcription to your clipboard

Prompt Mode (Voice-to-Prompt)

Speak instructions that apply to selected clipboard text:

Select Text - Copy text you want to modify to your clipboard
Press Shortcut - Start recording your voice instruction
Process - Gemini processes both your voice instruction and the selected text
Get Result - Modified text is automatically copied to your clipboard

Installation

Recommended: Download App

Download the latest .dmg file from the Releases page.
Open the DMG and drag WhisperShortcut to your Applications folder.

Build from Source

# Clone the repository
git clone https://github.com/mgsgde/whisper-shortcut.git
cd whisper-shortcut

# Install the app
bash install.sh

Features

Speech-to-Text Transcription: Audio → Text transcription using Google Gemini (cloud) or Whisper (offline)
Voice-to-Prompt Mode: Speak instructions to modify selected clipboard text using Gemini AI
Offline Support (Privacy Mode): Use local Whisper models for completely offline transcription
Smart Clipboard Integration: Automatic copy to clipboard for both modes
Customizable Shortcuts: Configurable keyboard shortcuts for each mode

Development

Prerequisites

macOS 15.5+
Xcode 16.0+
Gemini API key (required only for cloud transcription and prompt mode; optional for offline Whisper transcription)

Open in Xcode

git clone https://github.com/mgsgde/whisper-shortcut.git
cd whisper-shortcut
open WhisperShortcut.xcodeproj

Release Process

To create a new release:

Update the version number in Xcode (Target > General > Version). 1.

Run the release helper script:

scripts/create-release.sh

This will tag the commit and trigger the GitHub Action to build and release the app.

License

MIT License - see LICENSE file for details.

Made with ❤️ in Karlsruhe, Germany

WhisperShortcut

Download & Support

Support the Project ❤️

How it works

Transcription Mode (Speech-to-Text)

WhisperShortcut

Download & Support

Support the Project ❤️

How it works

Transcription Mode (Speech-to-Text)

Prompt Mode (Voice-to-Prompt)

Installation

Recommended: Download App

Build from Source

Features

Development

Prerequisites

Open in Xcode

Release Process

License

Similar Posts