12
Show HN: SpeechOS – Wispr Flow-inspired voice input for any web app
Hi Hacker News! I’m David Huie. I’m launching SpeechOS, a drop-in voice input SDK for web apps.
I was inspired by Wispr Flow and wanted the same workflow inside business apps (CRMs, docs, forms, support tools), not just a standalone tool. It’s saved me a massive amount of time vs typing.
How it works: add a couple lines of JS plus an API key, and SpeechOS shows a small mic widget on every text field.
Live demo: https://www.speechos.ai/
(Click a text box and the mic widget appears; click the gear icon to see settings, custom vocabulary, and snippet configuration.)
Users can:
- Dictate: speak naturally, real-time voice to polished text (punctuation, no filler/typos)
- Edit: say “make it shorter”, “fix grammar”, “translate...”
- Command: define Siri-style app actions (for example, “submit form”, “mark complete”), and we match intent to your commands
It also supports:
- Custom vocabulary: domain terms and names (product names, acronyms, jargon) so they transcribe correctly
- Text snippets: saved reusable blocks of text you can insert by voice (for example, “my signature”, “disclaimer”, “my address”)
Why: text entry speed and accuracy still matter for productivity tools. A large-scale text entry dataset with 37,370 participants showed an average typing speed of 36.2 WPM and ~2.3% uncorrected errors. In speech research, speech recognition was about 3× faster than keyboard input and had ~20.4% lower error rate for English text entry. (https://hci.stanford.edu/research/speech/)
SpeechOS is currently in beta and free for now. Sign up at https://app.speechos.ai/accounts/signup/ and enter this HN-only beta code: HN-JFc74cVC (please don’t share outside HN).
Links:
SDK repo: https://github.com/speechos-org/speechos-client
Demo: https://www.speechos.ai/
Signup (code: HN-JFc74cVC): https://app.speechos.ai/accounts/signup/
I’d love feedback on:
1) Where this is most valuable in your stack (notes? docs? CRM entry? support macros?)
2) What you’d want from voice commands/snippets configuration
3) What would make you comfortable shipping this (privacy/security, latency, pricing)
If you’re building anything in voice AI/dictation, I’d be happy to swap notes (david@speechos.ai).
This feels less about typing speed and more about workflow compression. You just dictate your thoughts, edit and the it takes action for you?
Pretty neat idea!
That's exactly right. The aim of this product is to enable businesses to offer that service to their users. Right now, you can only get intelligent dictation by having users already come to the table with some of these apps, like Wispr Flow.
You want companies to pay you to process their customers voices and private data, but I don't see a privacy policy or even what jurisdiction you operate in. Your Terms and Conditions is literally a placeholder. That's a huge red flag.
I completely understand. This product is in beta testing with no paid customers. We'll definitely have that in there at launch.
If you're processing people's data it doesn't matter if it's free or not, you need to understand your legal obligations. You can't have a blank Terms of Service. You should really talk to a legal professional.