Frequently Asked Questions

FAQ

Everything you need to know about winWhisper's transcription capabilities, features, and functionality. Find answers to common questions about our speech-to-text service.

Q1
Why are there two transcription models to choose from, what is the difference?

We offer two transcription models to balance speed, accuracy, and functionality:

Gpt-4o-transcribe: Current premium model with superior accuracy and speed. It provides the best quality transcription with built-in language understanding.
Whisper-1: OpenAI's classic Whisper model, faster and cost-effective. For advanced modes (clean, rewrite, user), it uses a two-step process with secondary AI processing for enhanced results.

Whisper-1 supports a much greater number of languages (70+), and can be natively used for translations. Both models deliver professional-grade accuracy.

Q2
Why is there sometimes additional use of the secondary LLM?

Secondary LLM processing is automatically applied for certain enhanced transcription modes (Rewrite mode, or User mode).

This two-step process ensures you get both accurate transcription AND intelligent post-processing tailored to your needs.

Q3
Is any of my data stored on the servers (voice or transcriptions)?

No, your audio and transcriptions are never permanently stored on our servers. Your privacy is our priority:

  • Audio files are processed in real-time and immediately discarded
  • Transcribed text is only returned to your device
  • We only retain minimal metadata (timestamps, usage statistics) for service operation
  • All data processing complies with our strict privacy policy
Q4
What is the maximum length of a recording?

The current maximum recording length is 5 minutes per transcription. This limit ensures optimal processing speed and quality.

For longer content, we recommend breaking it into smaller segments for the best results.

Q5
What is the difference between different modes?

Choose the perfect mode for your needs:

Raw: Natural speech transcription with minimal processing - exactly what was said
Clean: Removes filler words, fixes grammar, improves readability while maintaining your natural voice
Rewrite: Transforms casual speech into polished, professional text suitable for documents
User: Apply custom instructions (e.g., "rewrite in language of Shakespeare" "translate to Spanish," "technical writing style")
Q6
How can I use winWhisper for translation?

Two easy ways to translate your audio:

Using Whisper-1 model: The language selected is the language of transcription. You may select spanish language and input voice in english.
Using User mode: Add custom instructions like "translate this text to spanish language".
Q7
How is the time used to deduct credits calculated?

Credits are deducted based on your actual audio duration in seconds, converted to minutes. For example:

  • 30 seconds = 0.5 minutes of credits
  • 2 minutes 15 seconds = 2.25 minutes of credits

Processing time doesn't affect credit usage - only your audio length matters

Q8
Do I need an internet connection to use winWhisper?

Yes, winWhisper requires an internet connection for transcription processing. The app connects to the cloud-based AI models to provide fast, accurate transcriptions.

Your desktop app handles recording locally, but sends audio securely to our servers for processing.

Q9
How accurate is the transcription?

The transcription accuracy is industry-leading, typically 95%+ for clear audio.

Accuracy depends on audio quality, background noise, accent, and speaking clarity. Both models perform exceptionally well for professional and personal use.

Still Have Questions?

Can't find the answer you're looking for? We're here to help.

Contact our support team for additional assistance