WhisperShortcut
Speech-to-Text and Voice-to-Prompt shortcut for macOS with Google Gemini and Offline Whisper support
Download & Support
Download for FREE via GitHub Releases
Support the Project ❤️
WhisperShortcut is open source and free to use. If you want to support the development, you can purchase the app on the App Store:
This is completely optional. If you like the app, a purchase or a review on the App Store is a fantastic way to say thank you! ⭐
How it works
Transcription Mode (Speech-to-Text)
Choose between cloud (Google Gemini...
WhisperShortcut
Speech-to-Text and Voice-to-Prompt shortcut for macOS with Google Gemini and Offline Whisper support
Download & Support
Download for FREE via GitHub Releases
Support the Project ❤️
WhisperShortcut is open source and free to use. If you want to support the development, you can purchase the app on the App Store:
This is completely optional. If you like the app, a purchase or a review on the App Store is a fantastic way to say thank you! ⭐
How it works
Transcription Mode (Speech-to-Text)
Choose between cloud (Google Gemini) or offline (Whisper) transcription:
- Setup - For cloud transcription: Configure your Gemini API key here. For offline transcription: Download a Whisper model in Settings (no API key needed).
- Press Shortcut - Start recording with a keyboard shortcut
- Transcribe - Uses your selected model (Gemini or Whisper) for accurate speech-to-text
- Copy to Clipboard - Automatically copies transcription to your clipboard
Prompt Mode (Voice-to-Prompt)
Speak instructions that apply to selected clipboard text:
- Select Text - Copy text you want to modify to your clipboard
- Press Shortcut - Start recording your voice instruction
- Process - Gemini processes both your voice instruction and the selected text
- Get Result - Modified text is automatically copied to your clipboard
Installation
Recommended: Download App
- Download the latest
.dmgfile from the Releases page. - Open the DMG and drag
WhisperShortcutto your Applications folder.
Build from Source
# Clone the repository
git clone https://github.com/mgsgde/whisper-shortcut.git
cd whisper-shortcut
# Install the app
bash install.sh
Features
- Speech-to-Text Transcription: Audio → Text transcription using Google Gemini (cloud) or Whisper (offline)
- Voice-to-Prompt Mode: Speak instructions to modify selected clipboard text using Gemini AI
- Offline Support (Privacy Mode): Use local Whisper models for completely offline transcription
- Smart Clipboard Integration: Automatic copy to clipboard for both modes
- Customizable Shortcuts: Configurable keyboard shortcuts for each mode
Development
Prerequisites
- macOS 15.5+
- Xcode 16.0+
- Gemini API key (required only for cloud transcription and prompt mode; optional for offline Whisper transcription)
Open in Xcode
git clone https://github.com/mgsgde/whisper-shortcut.git
cd whisper-shortcut
open WhisperShortcut.xcodeproj
Release Process
To create a new release:
Update the version number in Xcode (Target > General > Version).
1.
Run the release helper script:
scripts/create-release.sh
This will tag the commit and trigger the GitHub Action to build and release the app.
License
MIT License - see LICENSE file for details.
Made with ❤️ in Karlsruhe, Germany