Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add speech input support #1456

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

NullLabTests
Copy link

This PR adds speech input support to the Goose CLI using Vosk for offline speech recognition.

Features:

  • Toggle voice input mode with /voice command
  • Start/stop recording with Ctrl+V
  • Uses Vosk for offline speech recognition
  • Cross-platform audio capture with cpal
  • Seamless integration with existing input system

Testing:

  • Tested on Linux
  • Voice commands are processed as regular text input
  • Works with all existing commands

Dependencies added:

  • cpal = "0.15.2"
  • whisper-rs = "0.8.0"
  • vosk = "0.2.0"

Note: This feature was developed and tested on Linux. Additional platform-specific installation instructions for speech recognition dependencies may be needed.

- Add voice input mode with /voice command
- Implement Ctrl+V shortcut for recording
- Use Vosk for offline speech recognition
- Add cpal for cross-platform audio capture
- Update help text with voice commands
@michaelneale
Copy link
Collaborator

cool! would it be possible to have this on desktop as well? (in some ways perhaps more common to be used there?)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants