What are Speech Recognition Providers?
Hedy supports multiple speech recognition options, giving you flexibility to choose between complete privacy with local processing or cloud-based alternatives. You can switch providers anytime based on your current needs - use local for offline sessions and cloud services when you prefer their specific features.
Getting Started
Open the Hedy app
Navigate to Settings (tap your profile icon)
Scroll to "Speech Recognition Options"
Select your preferred provider from the dropdown menu
Configure provider-specific settings if needed
Your selection takes effect in the next recording session
Available Providers
Hedy offers three speech recognition options, each with unique characteristics:
Local Speech Recognition: Default option - 100% private, works offline, no usage costs. Your audio never leaves your device.
Deepgram: Cloud-based service with real-time streaming and smart formatting features. Requires your own API key.
OpenAI: Cloud transcription with Voice Activity Detection and automatic language detection. Requires your own API key.
Configuring Local Speech Recognition
When using Local Speech Recognition, you can optimize for your device and needs:
For macOS Users:
Small Model: Fastest processing, recommended for Intel Macs
Regular Model: Balanced speed and accuracy for most users
Large Model: Enhanced capabilities for non-English languages (requires 1.5GB download)
For iOS/Android Users:
Standard Model: Default option suitable for most devices
Large Model: Alternative model option (iPhone 12+ or 2024+ Android recommended)
Transcript Speed Settings:
Slower: Waits for complete sentences before displaying
Normal: Balanced speed and display timing
Faster: Near real-time display with more frequent updates
Setting Up Cloud Providers
Deepgram Setup:
Create an account at console.deepgram.com
Generate an API key from your dashboard
In Hedy Settings, select Deepgram from the dropdown
Paste your API key and tap "Test" to verify
Choose your model and language preferences
Set maximum session duration to control costs
OpenAI Setup:
Get your API key from platform.openai.com/api-keys
In Hedy Settings, select OpenAI from the dropdown
Enter your API key and test the connection
Choose your preferred model
Optionally enable Voice Activity Detection with adjustable sensitivity
Set maximum session duration for cost control
Choosing the Right Provider
Select based on your priorities and use case:
Privacy First: Use Local Speech Recognition - audio never leaves your device
Offline Use: Local Speech Recognition works without internet
Cloud Features: Deepgram and OpenAI offer cloud-based processing
Voice Detection: OpenAI includes Voice Activity Detection features
Smart Formatting: Deepgram offers automatic formatting options
No Usage Costs: Local Speech Recognition has no per-minute charges
Cost Considerations
Understanding the cost implications of each provider:
Local Speech Recognition: Free - no usage charges
Deepgram: Pay-per-minute pricing (check current rates on their dashboard)
OpenAI: Usage-based pricing (check current rates on their platform)
The maximum session duration setting helps prevent accidental overnight recordings and manage API costs.
Best Practices
Start with Local Speech Recognition to familiarize yourself with the feature
Test cloud providers with short recordings before important sessions
Monitor your API usage on provider dashboards to track costs
Use different providers for different scenarios based on your needs
Switch to local when traveling or in areas with limited internet
Set appropriate maximum session durations (60-120 minutes for typical meetings)
Troubleshooting
API Key Not Working
Ensure you copied the complete key without spaces
Verify your account has available credits
Check the API key has necessary permissions
Try regenerating the key from provider dashboard
Connection Test Failed
Check your internet connection stability
Verify firewall isn't blocking WebSocket connections
Ensure API key is active with sufficient quota
Wait a moment and try again (temporary service issues)
Transcription Issues
For Local: Try a different model size
For Cloud: Check internet connection stability
Ensure microphone is properly configured
Minimize background noise during recording
Settings Not Saving
Wait for the "Saved" indicator to appear
Don't switch screens while saving
Restart the app if issues persist
Ensure you have a stable internet connection
Your API keys are stored securely in your device's encrypted keychain and never transmitted to Hedy's servers. For maximum privacy with sensitive conversations, always use Local Speech Recognition.
