Alternatives to VoxCommando

Compare VoxCommando alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to VoxCommando in 2024. Compare features, ratings, user reviews, pricing, and more from VoxCommando competitors and alternatives in order to make an informed decision for your business.

  • 1
    Twilio Voice
    Create a scalable voice experience with the API that connects millions globally. With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Create an engaging voice experience that you can quickly scale and modify with a wide array of customization options and resources, like our Voice SDK. Then, add on features like Interactive Voice Response (IVR), recording transcriptions, and speech recognition to create an experience that your customers will appreciate. Whether you're looking to set up global conferencing or alerts & notifications, Twilio has the support you need for building with Voice. Find docs, code samples, helper libraries, and developer tools such as Twilio Runtime and our visual workflow builder, Studio.
    Compare vs. VoxCommando View Software
    Visit Website
  • 2
    Speechmatics

    Speechmatics

    Speechmatics

    Speechmatics is the most accurate and inclusive speech-to-text API ever released. Speechmatics is the world’s leading expert in Speech Intelligence, combining the latest breakthroughs in AI and ML to unlock the business value in human speech. Businesses use Speechmatics worldwide to accurately understand and transcribe human-level speech into text regardless of demographic, age, gender, accent, dialect, or location in real-time and on recorded media. Combining these transcripts with the latest AI-driven speech capabilities, businesses build products that utilize summarization, topic and chapter detection, sentiment analysis, translation, and more. Speechmatics processes over 500 years of transcription worldwide every month in 50 languages and can translate 69 language pairs. Having pioneered machine learning in speech recognition, its neural networks consider acoustics, languages, dialects, multiple speakers, punctuation, capitalization, context, and implicit meanings.
    Partner badge
    Compare vs. VoxCommando View Software
    Visit Website
  • 3
    Google Cloud Speech-to-Text
    Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech into text in 73 languages and 137 different local variants. Leverage Google’s most advanced deep learning neural network algorithms for automatic speech recognition (ASR) and deploy ASR wherever you need it, whether in the cloud with the API, on-premises with Speech-to-Text On-Prem, or locally on any device with Speech On-Device.
    Compare vs. VoxCommando View Software
    Visit Website
  • 4
    LumenVox

    LumenVox

    LumenVox

    Transforming customer engagement with AI-driven speech recognition and voice authentication technology. We’ve spent the last 20 years empowering our partners’ success through collaboration. Our curiosity keeps us innovating for the next 20. Our flexible speech-enabling technology enables you to build a solution that fulfills all your customers’ demands, affordably and reliably. We do one thing, and we do it well. And that's speech-enabling your applications. Finally, deliver great voice automation and interactions. Whether short and simple commands, or conversational questions, LumenVox ASR and TTS is accurate and affordable, helping you improve efficiencies on both sides of the phone line. You’ll never repeat yourself again. We provide you with the utmost flexibility from a capabilities, deployment and monetization perspective. If you can think it, you can build it with LumenVox. Shorten your development to deployment time with our easy, intuitive technology and toolsets.
  • 5
    Rev

    Rev

    Rev

    Rev provides premium on-demand, manual and automated transcription, closed caption, and foreign subtitling services. With 170,000+ customers, Rev's clients span from global enterprises to freelance journalists. Rev processes more audio and video than any other provider and has the ability to scale to fit any customer's needs. Pricing is simple starting at just $0.25 per audio/video minute for automated speech-to-text services and $1.25/min for manual with 99% accuracy. Rev also offers Rev.ai which is a speech recognition engine that's available to companies that want it.
    Starting Price: $1.25 per minute
  • 6
    Voice Finger

    Voice Finger

    Voice Finger

    Enables zero computer contact, no need for keyboards and mouses. Rest your hands and use your voice to command the computer. A definitive solution for people with disabilities and/or computer injuries. Some speech recognition software assumes you can type and click for some tasks. Voice Finger was made to do everything by voice. Also for hardcore gamers. For competitive gamers, Voice Finger can hit keys and buttons while the gamer moves and shoots, acting like a third hand. Voice Finger allows complete control of the keyboard, with short commands to navigate the cursor, type, hold and hit keys and buttons. Windows default speech recognition has a lot of lengthy commands like "Press 1", "Press A" and "Press down 30 times". Voice Finger cuts down all commands to a minimum length, like "1", "A" and "Down 30", and you are still able to use the mouse buttons with commands like "click left", "click right" and others, and at the same time hold keys like Control, Shift and Alt.
    Starting Price: $9.99 one-time payment
  • 7
    Braina

    Braina

    Brainasoft

    Braina (Brain Artificial) is an intelligent personal assistant, human language interface, automation and voice recognition software for Windows PC. Braina is a multi-functional AI software that allows you to interact with your computer using voice commands in most of the languages of the world. Braina also allows you to accurately convert speech to text in over 100 different languages of the world. Braina's artificial intelligence makes it possible for you to control your computer using natural language commands and makes your life easier. Braina is not a Siri or Cortana clone for PC but rather a powerful personal and office productivity software. It isn't just like a chat-bot; its priority is to be super functional and to help you in doing tasks. Braina helps you do things you do everyday. It is a multi-functional artificial intelligence software that provides a single window environment to control your computer and perform wide range of tasks using voice commands.
    Starting Price: $29 per year
  • 8
    Work by Speech

    Work by Speech

    Mikołaj Magowski

    Work by Speech is the first program in the world that allows efficient work on a computer by speech without needing a keyboard and mouse. Work by Speech Features: - Efficient work on a computer by speech alone - Quiet speaking support - Application switching and opening by speech - Built-in voice commands for the most common actions - Custom voice commands management - Macro recording and editing - Separate dictation mode - Fast and repeatable mouse control by speech with support for all mouse actions - Customizable mousegrid that can be moved by speech - Automatic mousegrid optimization for every used application - Very low processor and memory usage - Works with any microphone under Windows 10 and 11 - Available for the English language only - Free updates
    Starting Price: Free
  • 9
    Knovvu Speech Recognition
    Automate customer processes, evaluate agent performances objectively and ensure your operations are 100% efficient. In our connected world, many consumers are interacting with everyday connected appliances in new ways. With a trend in connected devices that often lack a screen, speech is emerging as a natural, intuitive interface for human-machine interaction. Speech recognition is the driving technology behind this development, revolutionizing the way people interact with their devices. With Knovvu Speech Recognition from Sestek, machines and applications can understand user commands in spoken language. With the ability to listen to and interpret spoken demands, users may interact with these devices by speaking aloud rather than inputting buttons and keystrokes. Our automatic speech recognition software has full application. Many organizations use technology to power intuitive and straightforward self-service solutions.
  • 10
    Rubidium

    Rubidium

    Rubidium

    Rubidium enables leading companies to embed voice commands and text to speech in their products. Voice Trigger is an “always on” engine that continuously listens and wakes up when you say the proper “magic word”. Voice Trigger identification uses a sophisticated miniature footprint Automatic Speech Recognition (ASR) engine to run in the background and distinguish between the trigger phrase and the rest of the speech, sounds and noise. Automated Speech Recognition (ASR) easily and safely controls any set of functions through voice commands. For example: call acceptance and rejection, device setup and installation procedure (pairing, calibration, interconnection, etc.), voice dialing, music streaming control and music selection. Rubidium technology is now embedded in over 50 million consumer products with customers and partners including leading global brands such as RIM (Blackberry), GN Netcom (Jabra), Panasonic, Uniden, CSR, Mattel, General Motors, Electrolux and many others.
  • 11
    tazti

    tazti

    Voice Tech Group

    Welcome to the tazti website! tazti is state of the art Speech Recognition & Voice Recognition software. You can easily mash up tazti to files, folders, programs, videos and songs on your PC, to open them by voice control. Play PC Games, control applications, programs, and robots by voice command! Over 300,000 people have now tried tazti and it's many features. tazti is super fun, especially if you are tired of pounding your keyboard or want an easy to use assistive technology. Great as well for people with Arthritis, Carpal Tunnel, Tendonitis, Fibromyalgia or other hand, finger or wrist pain.
    Starting Price: $39.99
  • 12
    Dragon Home

    Dragon Home

    Nuance Communications

    With a next-generation speech engine leveraging Deep Learning technology, Dragon adapts to your voice or environmental variations—even while dictating. Dragon intelligently transcribes your spoken words into text 3x faster than typing with up to 99% recognition accuracy. And with a streamlined user interface and no training required, getting started is as easy as launch and dictate! With a new playback feature, you can select a block of text and “play that back” for easy proofreading and editing as you listen to what you dictated. Dragon works with today’s popular touchscreen PCs and tablets, so you can enjoy the versatility of interacting with your favorite applications—at home or school.
    Starting Price: $200 one-time payment
  • 13
    Yandex SpeechKit
    Speech technologies based on machine learning to create voice assistants, automate call centers, monitor service quality, and perform other tasks. Leverage the advanced technology behind the wildly successful Alice voice assistant, now ready for use in your business. In a fraction of a second, SpeechKit accurately recognizes speech, allowing our clients' voice assistants to communicate quickly and easily. Choose the right version for you, the full version creates a smart voice assistant while the adaptive version gives your brand a unique voice in just a month. A solution for the most demanding customers who need to control speech processing and synthesis within their own infrastructure. SpeechKit’s ML models can now be deployed to your infrastructure. We offer both hybrid options and 100% on-premise deployments for sensitive traffic. The service can recognize audio in MP3, LPCM, and OggOpus formats.
    Starting Price: $0.000020 per unit
  • 14
    Rev.ai

    Rev.ai

    Rev.ai

    Rev.ai was built by leading speech recognition experts from millions of hours of accurate human-transcribed content. We began in 2011 with Rev.com, providing human transcription services. We are now the world's largest transcription vendor, with over 35,000 contractors who transcribe millions of minutes of audio each month. In 2017 we launched Temi, an automated speech-to-text transcription and editing service. Temi has already transcribed 20 million minutes of content and was named the best transcription service by Wirecutter. Today our best-in-class speech engine is available to everyone as Rev.ai. We're helping companies get the most out of their audio and video content by making it searchable and accessible.
  • 15
    Dragon Professional Group

    Dragon Professional Group

    Nuance Communications

    Empower employees to dictate documents 3 times faster than typing with up to 99% recognition accuracy, right from the first use. Since documents are created in a fraction of the time it would typically take typing by hand, they spend less time on paperwork, and more time on more profitable tasks. With a next‑generation speech engine powered by Nuance Deep Learning technology, Dragon achieves high recognition accuracy while dictating, even for users with accents or those working in open office or mobile environments; making it ideal for diverse workgroups and settings. Dragon makes it easy to automate tasks or short‑cut repetitive steps. Create custom voice commands to insert standard boilerplate text or signatures into documents. Or create time‑saving macros to automate multi‑step workflows by voice. Once created, share these customizations across the Dragon user community for efficiency gains.
  • 16
    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile's modern, robust speech recognition is optimized for mobile devices in over 40 languages. Designed for industry workflows, cutting edge noise abatement technology delivers outstanding recognition in noisy environments. A speaker-independent voice engine works for all users out-of-the-box, without the need to voice train or maintain voice files for each user. AccuSpeechMobile is a 100% device-based solution. No voice server or middleware is required and no changes are needed to the backend system (WMS, ERP, EAM, CMMS). Cloud or network connection is not required to use the full functionality of device-based data collection. AccuSpeechMobile fully supports multi-modal capabilities so that users can hear spoken information and speak commands in tandem with the use of intelligent scanners. The ability to reference additional information on the device screen is also always available in conjunction with speech-to-text and text-to-speech commands.
  • 17
    GoVivace

    GoVivace

    GoVivace

    Our automatic speech recognition engine supports several English accents and can be localized to any language. Also, the ASR engine supports standard telephony as well as web and mobile applications. Being capable of actioning voice commands given to electronic devices such as computers, tablets, smartphones or telephones with the aid of a microphone, the GoVivace’s Automatic Speech Recognition Engine finds use in diverse applications. This automatic speech recognition engine compares the spoken input with a number of pre-specified possibilities and convert speech to text. The entire set of pre-specified possibilities constitute the application’s grammar, which powers the interface between the dialogue-speaker and the back-end processing. GoVivace’s patented Automatic Speech Recognition solution needs only very simple grammar for its processing. It can also support very large grammars for complex tasks.
  • 18
    TrulyNatural
    Sensory is a pioneer in the use of embedded neural network-based speech recognition and has become the industry leader in optimizing and engineering speech recognition software with small footprints and minimal MIPS. This extensive experience and continuous innovation have led to the first embedded large vocabulary continuous-speech recognizer (LVCSR) with state of-the-art cloud performance. Unlike voice recognition software often used with smartphones and mobile devices, such as with a voice assistant mobile app, as well as with IoT (internet of things) enabled technologies (Alexa, Google Assistant, Siri, Cortana), Sensory’s solution is embedded and doesn’t require a wifi connection. Many applications don’t need or want to rely on cloud-based connection to do high-performance speech recognition. Others seek a client/cloud distributed system with optimal performance. The market concerns regarding privacy, performance and bandwidth are driving more processing to the edge.
  • 19
    Azure AI Speech
    Build voice-enabled apps confidently and quickly with the Speech SDK. Transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and use speaker recognition during conversations. Create custom models tailored to your app with Speech studio. Get state-of-the-art speech to text, lifelike text to speech, and award-winning speaker recognition. Your data stays yours, your speech input is not logged during processing. Create custom voices, add specific words to your base vocabulary, or build your own models. Run Speech anywhere, in the cloud or at the edge in containers. Quickly and accurately transcribe audio in more than 92 languages and variants. Gain customer insights with call center transcription, improve experiences with voice-enabled assistants, capture key discussions in meetings and more. Use text to speech to create apps and services that speak conversationally, choosing from more than 215 voices, and 60 languages.
  • 20
    Phonexia Speech Platform
    Phonexia offers a comprehensive portfolio of cutting-edge speech recognition and voice biometrics technologies ready to meet any commercial and governmental scenarios. Powered by the latest advancements in artificial intelligence, acoustics, phonetics, and voice biometrics science, Phonexia products are extremely accurate, fast, and scalable. Phonexia’s AI-powered solutions let you build voicebots, verify a speaker’s identity based on voice biometrics, transcribe speech to text, and search for speakers and context in large amounts of audio. Secure access to your clients’ data conveniently with voice biometric authentication and detect fraud attempts natively. Phonexia offers a comprehensive portfolio of cutting-edge speech recognition and voice biometrics technologies ready to meet any commercial and governmental scenarios. Powered by the latest advancements in artificial intelligence, acoustics, phonetics, and voice biometrics science.
  • 21
    SpeechMotion
    Document a patient encounter with full or partial dictation, voice recognition, or on-the-go with a customized solution tailored to your unique environment. Solving common documentation issues, like lowering costs and integrating workflows, begins with choosing a solution designed to meet your evolving needs. Improve workflow efficiencies and physician adoption for a rapid return on investment with a partner committed to your long-term success. A leading, national provider of US-based transcription, speech recognition, voice capture and advanced documentation technologies, SpeechMotion partners with healthcare facilities and the organizations supporting them to create a customized documentation solution tailored to support both long and short-term goals. SpeechMotion provides the flexible options healthcare facilities need to quickly and efficiently document a complete patient story, all under one product and service umbrella.
  • 22
    Scribe

    Scribe

    Scribe Technology Solutions

    “The Future is NOW!” – with the addition of ScribeNow! Speech Recognition to our flagship product, ScribeMobile, the future of medical documentation is here in the palm of your hand. ScribeNow! enhances ScribeMobile’s already robust set of documentation services – traditional dictation, charting, and live scribing. With ScribeNow! Speech Recognition, providers quickly and easily document encounters in real-time. This gives providers the flexibility they need to improve their productivity, profitability, and patient care with one easy to use solution, with a wide range of integration capabilities available. Scribe TeleCare is an innovative solution that is providing opportunities for healthcare providers to continue to service their clients AND have completed documentation to support the care of their patients and facilitate reimbursement with one easy to use tool. No more trying to use an app that is not healthcare focused to connect remotely to your patients.
    Starting Price: $59.95/month/user
  • 23
    Dragon Legal Individual

    Dragon Legal Individual

    Nuance Communications

    Legal professionals in practices of all sizes face documentation overload, resulting in document backlogs, high transcription costs, and less time for billable work. Use Dragon Legal Individual speech recognition to create and manage legal documentation—quickly and accurately—by voice. Built with a specialized legal vocabulary to deliver optimal recognition accuracy—right out of the gate—when you dictate legal terms. Quickly dictate and edit case files, contracts, and briefs by voice; even format legal citations automatically. Add custom words specific to your practice or create custom commands to quickly insert standardized content and shortcut repetitive tasks by voice. Record legal notes using a digital recorder for later transcription by you or your staff; streamlined setup lets you transcribe audio files with speed and ease.
    Starting Price: $500 one-time payment
  • 24
    Azure Speaker Recognition
    A Speech service feature that verifies and identifies speakers. Enable frictionless, secure customer experiences: Improve the customer experience by streamlining verification processes. Use voice to verify individuals for secure, frictionless customer engagements in a wide range of solutions, from web applications to call centers. Speaker verification can use either passphrases or free-form voice input. Improve the customer experience by streamlining verification processes. Use voice to verify individuals for secure, frictionless customer engagements in a wide range of solutions, from web applications to call centers. Speaker verification can use either passphrases or free-form voice input. Unlock value from scenarios with multiple speakers: Determine a speaker’s identity from within a group of enrolled speakers. Speaker identification enables you to attribute speech to individual speakers, support multiuser voice recognition for personalized interactions, and more.
  • 25
    Acusis

    Acusis

    Acusis

    Acusis’ approach to Revenue Cycle Management (RCM) is full circle that provides finest experience to their clients. Acusis has a tenured team consisting of proven RCM experts and consultants on billing, coding, CDI, risk adjustment, HCC, account receivables and denials management. Clinical documentation management is simple and cost-effective with Acusis’ unique approach of combining cutting-edge technology and professional documentation services. While eCareNotes speech recognition platform helps Physicians save time and focus on delivering care, Acusis professional services team focuses on making life easy for HIM by offering superior editing services. From dictation capture to cutting-edge voice recognition, Acusis offers a wide array of cloud-based products for simplifying MTSO transcription workflow management. eCareNotes, the flagship technology platform helps MTSOs as well as in-house transcription teams of hospitals to reduce documentation costs and stay compliant.
  • 26
    Yactraq

    Yactraq

    Yactraq

    Yactraq is the industry value leader in speech analytics software. Our customers typically realize benefits across two broad functional areas. Marketing teams looking to extend their Voice-of-the-Customer (VoC) capabilities beyond the feedback form and social media now want to mine sales and customer service phone calls as part of their omni-channel capability. Contact Center Quality Management teams typically use speech analytics / audio mining as a way of leveraging AI / Machine Learning to evaluate the performance of their call agents. Yactraq offers customized free trials based on a clients own data so they can experience the value of our software before deciding to buy. Our products are cost-effectively priced to suit the needs of end customers as well as partners in the Business Process Outsourcing (BPO), Contact Center as a Service (CCAS), Voice-of-the-Customer (VoC), CRM Software and Network Service Provider businesses.
  • 27
    Ctalk

    Ctalk

    Ctalk

    Realize the benefits of contact center, IVR, speech recognition, call recording, unified communications, outbound dialing without replacing your existing telephony platform. The Ctalk contact centre system 'wraps around' your existing PBX seamlessly adding features or more capacity. That's why you don't have to rip and replace. Effectively handle more calls and contacts with the same or reduced resources. Significantly reduce your support costs and dependency on I.T. by empowering multiple administrators with on the fly call management. Dramatically Increase first contact resolution. Know who is calling and why, then route to the right agent every time. 24/7 automated services blend seamlessly with proactive outbound calling.
  • 28
    LumenVox Automatic Speech Recognition (ASR)
    Transforming customer engagement with AI-powered voice recognition and voice authentication technology. Our flexible voice-enabled technology allows you to create a solution that meets all of your customers' demands, affordably and reliably. We do one thing, and we do it well. And that's voice enablement for your apps. Finally, deliver great voice automation and interactions. Whether it's short, simple commands or conversational questions, LumenVox ASR and TTS are accurate and affordable, helping you improve efficiency on both sides of the phone line. You will never repeat yourself. Recognize multiple dialects from a single global language model to serve all your customers. We give you maximum flexibility from a capabilities, implementation and monetization perspective. If you can think it, you can build it with LumenVox
  • 29
    INVOX Medical
    The most intuitive voice dictation program on the market. Convenient and instant audio-to-text transcription. The program has a clear and simple design, which guarantees a comfortable, fast and precise operation. INVOX Medical has specific dictionaries and is adapted to many medical specialties. INVOX Medical accurately recognizes a wide variety of medical terminology. INVOX Medical is the voice recognition software already trusted by thousands of medical professionals around the world. It's accurate, easy, and incredibly intuitive. In a few minutes you will be dictating your medical reports with complete accuracy. And in addition, it has an unbeatable price. INVOX Medical uses the latest technology in the use of artificial intelligence to help you dictate your medical reports with maximum precision, allowing you to work up to three times faster. The system allows you to add terms to the dictionary, replace words and modify their pronunciation at any time.
    Starting Price: $35 per month
  • 30
    e-Speaking

    e-Speaking

    e-Speaking

    An easy software solution to enable you to control your computer, dictate emails and letters, and have the computer read documents back to you. Command and control your Window's computer through your voice. Operate your computer using a minimum of keystrokes or mouse clicks. If you want to move the cursor down one line, simply say: Down One. Want to check your emails? Simply say: Open Email. Add commands to open and control any Window's document or program. People have been speaking to each other for tens of thousands of years. Our brains have evolved to perform a fantastic and complex set of analyses of auditory input. Our brains convert the sounds we hear into conceptual ideas and thoughts which in turn form the basis of instructions, commands, information, and entertainment.
    Starting Price: $14 one-time payment
  • 31
    VoiceMe

    VoiceMe

    VoiceMe

    In an always more contactless world, arises the necessity of a new model of digital trust. VoiceMe enables people, companies, and objects to interact with each other through a simple interface and in an ultra-secured way opening the door to a new generation of services. Access restricted physical areas guaranteeing users' identity. Sign with legal validation documents and contracts. Our algorithms pre-identify the user based on behaviors, using also biometric parameters obtained from the upper face and voice. All customer-related data remains exclusively at the user's disposal, offering maximum privacy and respect for GDPR regulation. Each data set is encrypted, divided in pieces, and spread on a network of nodes, making it impossible for an external unauthorized source to extract. At each authorized data usage the inverse process is done to recompose the data set. API or SDK for third-party allows easy integration in already existing systems.
  • 32
    Fusion Speech
    Back-end speech recognition is the most significant technology development in the dictation and transcription industries. Without physician training, or changes in practice patterns, Fusion Speech® powered by Nuance’s SpeechMagic™ harnesses this powerful technology for facility-wide deployment in nearly every medical specialty. Capture dictation with Fusion Voice®, process the dictation through Fusion Speech, and boost transcription productivity in Fusion Text®. The Fusion modules drive cost savings in reoccurring labor and outsourcing fees. This is the speech recognition solution you have envisioned. Other speech recognition has provided cute gimmicks but fell short in offering a sustainable business application. Fusion Speech provides the tools you require to truly deploy speech recognition that returns measurable and tangible results for your investments.
  • 33
    Alibaba Cloud Intelligent Speech Interaction
    Intelligent Speech Interaction is developed based on state-of-the-art technologies such as speech recognition, speech synthesis, and natural language understanding. Enterprises can integrate Intelligent Speech Interaction into their products to enable them to listen, understand, and converse with users, providing users with an immersive human-computer interaction experience. Intelligent Speech Interaction is currently available in Mandarin Chinese, Cantonese Chinese, English, Japanese, Korean, French and Indonesian, and please stay tuned for other languages. Intelligent Speech Interaction is suitable for various scenarios, including intelligent Q&A, intelligent quality inspection, real-time subtitling for speeches, and transcription of audio recordings. Intelligent Speech Interaction has been successfully applied in many industries such as finance, insurance, eCommerce and smart home.
    Starting Price: $1.40 per hour
  • 34
    SpeechPulse
    SpeechPulse uses your computer’s microphone for real-time speech recognition. It can type into your favorite apps, including text editors, web browsers, and office applications. SpeechPulse works fully offline and doesn’t require any internet connectivity. It supports speech recognition in multiple languages, including English, French, Spanish, Italian, German, Japanese, Chinese, and Russian (a total of 100 languages). SpeechPulse supports both auto punctuation and manual punctuation for the English language. It supports auto punctuation for all other languages. SpeechPulse can also generate subtitles for your audio and video files with accurate timestamps. It supports SRT and VTT subtitle formats. You can also customize the width of a subtitle line to include only a limited number of characters. SpeechPulse has a one-time payment. You can pay for the product once and use it forever.
    Starting Price: $59.95/one-time payment
  • 35
    PowerSpeak
    PowerSpeak from Saince is a versatile and powerful front end medical speech recognition software. We have included over 30 medical language dictionaries in the solution allowing you to take advantage of this technology irrespective of your specialization or care setting. It is an ideal clinical documentation and reporting solution not just for radiologists, but also for physicians of all specialties and in all care settings – acute care hospitals, imaging centers, labs, physician offices, behavioral health hospitals, long term care hospitals, nursing homes etc. Unlike other speech recognition solutions in the market that tie you down to a single device to use them, PowerSpeak Medical speech recognition software gives you the flexibility to install on five devices on a single license. PowerSpeak’s powerful and advanced speech recognition algorithms ensure that you enjoy 99% accuracy of the transcribed text every time. Less time spent correcting errors translates into more productivity.
  • 36
    Dragon Speech Recognition
    Putting words to work with AI‑powered speech recognition. Empower your employees to create high‑quality documentation. Save your organization time and money with Dragon Professional Anywhere, AI‑powered speech recognition that integrates into enterprise workflows. Empower attorneys to create high‑quality documentation and save time and money with Dragon Legal Anywhere, cloud‑hosted speech recognition that integrates directly into legal workflows. Enable officers to safely and efficiently meet reporting and documentation demands with this customized solution. Drive productivity at work and create and transcribe documents, short-cut repetitive steps—by voice. Seamlessly create, edit and transcribe legal documents by voice for improved efficiency, costs. Complete documents wherever work takes you with the cloud‑based, professional‑grade mobile dictation solution.
    Starting Price: $199.99 one-time fee per user
  • 37
    Vocola 3

    Vocola 3

    Vocola 3

    Dictation with Windows Speech Recognition (WSR) works well for "WSR-friendly" applications like MS Word, Outlook, and PowerPoint. Dictated text is inserted directly into document text, and commands like "Delete hedgehog" can refer to specific document text. But WSR dictation works less well for "WSR-unfriendly" applications like MS Excel, Gmail, and most programming environments. Dictation is not inserted directly into document text, and commands cannot refer to document text. Vocola improves this situation by supporting direct dictation for WSR-unfriendly applications, and by allowing correction and modification of the just-dictated phrase. Vocola and WSR use the same underlying speech profile, so any improvements you make via training, correction, or the speech dictionary benefit WSR dictation and Vocola dictation equally. Dictation to WSR-unfriendly applications is essentially unusable in Vista, as every utterance raises the correction panel.
  • 38
    Voice Pro

    Voice Pro

    LinguaTec

    Voice Pro Enterprise has been developed especially for use in enterprises. The recognition is done on the company server and can be accessed from any device (PC, Mac, smartphone, tablet). This ensures that all in-house information remains within the company. No more time-consuming speaker training is necessary, thanks to the speaker-independent recognition technology: Just speak into your device and you will see the transcribed text immediately. Companies finally have a sophisticated and secure speech recognition solution at their disposal. Regardless of whether you need to create a document at your work station, write an email on the move or dictate a sales report on site: Voice Pro Enterprise saves time and helps to make employees more productive. Voice Pro Enterprise results in a noticeable increase in employee efficiency. With Voice Pro Enterprise you dictate on average three times faster than you type. The high recognition accuracy minimizes post-processing.
    Starting Price: €149 one-time payment
  • 39
    Augnito

    Augnito

    Augnito

    Augnito combines the power of Speech Recognition AI with ease of mobility. You can edit, format, and complete reports at the speed of human speech, with best-in-class accuracy. Now use your personal templates and short forms from any workstation whether you are in the office, or at home or in the journey in between. Best suited for clinical specialties producing detailed reports such as Radiology, Histopathology and Surgical Notes, you can now dictate your reports from anywhere in the world. Augnito understands diverse accents and pronunciations out-of-the-box with no profile training. Built with the latest deep learning technology, it has the entire language of medicine which covers 50+ specialties and sub-specialties combined with all popular generic and drug names.
  • 40
    AppTek

    AppTek

    AppTek

    AppTek is a global leader in artificial intelligence (AI) and machine learning (ML) technologies for automatic speech recognition (ASR), neural machine translation (NMT), and natural language understanding (NLU). The AppTek platform delivers industry-leading, real-time streaming and batch technology solutions in the cloud or on-premise for organizations across a breadth of worldwide markets such as media and entertainment, call centers, government, enterprise business, and more. Built by scientists and research engineers who are recognized among the best in the world, AppTek’s solutions cover a wide array of languages, dialects, and channels. AppTek utilizes deep neural networks to transcribe and understand speech and text data, delivering more accurate and efficient tools.
  • 41
    Dragon Law Enforcement

    Dragon Law Enforcement

    Nuance Communications

    Eliminate the need to decipher handwritten notes or try to recall details from hours before. Officers simply speak to create detailed and accurate incident reports, 3 times faster than typing and with up to 99% recognition accuracy—Zall by voice. With a next-generation speech engine powered by Nuance Deep Learning technology, Dragon achieves high recognition accuracy while dictating, even for users with accents or those working in open office or mobile environments; making it ideal for diverse work groups and settings. Use fast and accurate dictation to enter data into RMS and CAD systems or other applications. Officers or support staff simply dictate anywhere they would normally type, and fill and navigate within form fields by voice.
  • 42
    WebsiteVoice

    WebsiteVoice

    WebsiteVoice

    Turn all your website articles into high-quality audio in less than 5 minutes and for free. Let your visitors listen to the content of your website in the background while they do other things with our text-to-speech technology and increase the time spent on your website. Accessibility is sometimes forgotten. Empower visitors with visual impairment and reading disabilities to still completely consume your content without the complications of reading. Listening to podcasts and audiobooks has become a growing trend and behavior for people to consume content. Capture a wider audience that would prefer tuning in instead of reading. Thanks to our Automatic Content Recognition technology, you can just drop our snippet on your site and forget about it. We will automatically enable text-to-speech voice for the relevant content. We use Artificial Intelligence and Machine Learning to constantly improve our voice algorithms to make your website text-to-speech as realistic as possible.
    Starting Price: $9 per month
  • 43
    VoxSci

    VoxSci

    VoxSciences

    Listening to voice messages can be terribly inefficient and laborious. VoxSciences™ provides a paradigm shift by transcribing voice messages into text messages. This gives voice messages a quantum leap to join email, SMS and IM on an equal basis with all the inherent advantages such as textural search. Our VERBS (Virtual Engine for Recognition of Basic Speech) engine converts voice messages into text messages and delivers them either as an email, SMS or via an API interface. Voicemail to text (SMS) is ideal for personal or corporate voicemail systems. Our XML API is typically used when a particularly high volumes of voice message transcription is required often by larger companies for Voice of The Customer analysis, comment lines, network or PABX operators and affiliates. Voice of the Customer is a market research technique that produces a detailed set of customer wants and needs. It involves the analysis of feedback from various sources such as email, web and IVR surveys.
  • 44
    Txtplay

    Txtplay

    Txtplay

    Txtplay not only makes your video and audio accessible for everyone it also extracts hidden powers in your media: searchable metadata. This means archiving, SEO, compliance become much easier to manage. Upload your media and select your language. Our speech recognition engine will take care of the job and notify you when it's done. You can continue working while our AI is doing the magic. We connect your media to the transcript in our online text editor where you can update, highlight, detect speakers and search through your text, and scroll in your audio or video. We support over 20 formats including: SRT, VTT,.docx. You can fine-tune the export with details like Timecode, Atlas format, speakers, etc. We also have developer-friendly options.
    Starting Price: €0.25 per min
  • 45
    SpeechWrite

    SpeechWrite

    SpeechWrite

    SpeechWrite specializes in a range of cloud dictation and voice recognition agile workflow solutions designed to meet the flexible working needs of the modern-day professional. Scalable and future-proofed solutions to suit all types of organizations. Our industry-leading range of digital dictation and transcription solutions link authors and transcribers facilitating efficient communication. Individual and organizational workflow settings enhance flexibility to ensure you receive your written dictations quickly and efficiently when in the office or on the move. Use your most powerful tool, your voice, and put it to work. Our practical technology, sophisticated yet simple, allows you to enhance your working environment and simply work smarter. We listen, learn and collaborate to support you through every stage of the process while also offering professional guidance and support along the way.
  • 46
    Dragon Professional Anywhere

    Dragon Professional Anywhere

    Nuance Communications

    Nuance Dragon Professional Anywhere empowers busy professionals, including remote workers, to use their voice naturally to create more detailed and accurate documentation quickly and easily. Mission critical documentation should be dictated by knowledge workers and field professionals, not technology limitations. Conversational AI empowers private and public sector professionals to document more naturally. Enables professionals to quickly and easily document the details of client meetings using speech recognition that is 3x faster than typing and up to 99% accurate. Most people speak at over 120 wpm but type at less than 40 wpm. Speak freely and as much as you like with no per-user limits. Business professionals can stay productive anywhere and focus on their clients and business rather than the technology.
  • 47
    SpokenData

    SpokenData

    ReplayWell

    Let the automatic speech-to-text technology transcribe your data. Or transcribe your data yourself or buy professional transcript. Use our on-line time synchonous editor to surf your data and transcripts. Download transcripts in many formats. Manage your team of transcribers using tags and categories. Help them with transcription by automatic voice-to-text technology. Integrate SpokenData into your application via our REST API. We adapt the voice-to-text on your data domain to maximize the transcript accuracy and lower your labor costs. Enable speech technologies in your applications through integrating SpokenData using our REST API. We are ready to process huge amounts of your data. You get API fitting your needs. Just contact our support team. We customize the voice-to-text on your data and purpose to maximize the transcript accuracy. Suitable for: web/mobile app developers, media monitoring agencies, audio/video archive business.
  • 48
    Voci

    Voci

    Medallia

    Companies engage with customers by phone more than any other channel, and these interactions represent a gold mine of untapped information. Listening to every customer call is costly and time-consuming and not physically practical. As a result, only a fraction of randomly selected calls is typically reviewed. These voice interactions reveal the true voice of your customers and enable you to get to the heart of their concerns. With our highly accurate, automated speech-to-text transcription, you can transform your unstructured voice data into transcripts that can be integrated into your analytics platforms. Voci enables you to improve agent quality monitoring, enhance the customer experience, extract competitive intelligence and ensure compliance.
  • 49
    IDVoice

    IDVoice

    ID R&D

    Voice biometrics is the science of using a person’s voice as a uniquely identifying characteristic for the purpose of authentication and/or personalizing the user experience. The technology is referred to in a variety of ways including voice verification, speaker verification, speaker identification and speaker recognition. There are two ways we put voice biometrics into practice. The first is Text Independent Voice Verification. This approach does not depend on the person speaking a particular passphrase. The other is Text Dependent Voice Verification. in which the user enrolls using a specific phrase but unlike a password, this phrase is not secret. IDVoice enables both options depending on your use case and in some scenarios they may be used together.
  • 50
    wolkvox

    wolkvox

    Microsyslabs

    wolkvox is a cloud-based call center management software that helps businesses streamline communications across numerous web chat applications and social media channels such as Telegram, WhatsApp, Line, Twitter, Facebook, and Instagram. Organizations can manage interactions using video calls, landline, mobile devices, SMS, email and more. wolkvox enables enterprises to create and monitor multiple customer categories, record and analyze client interactions and generate reports to track the performance of campaigns and agents. It offers a variety of features including a drag-and-drop interface, simultaneous calling, Artificial Intelligence (AI)-enabled speech analytics, gamification, and more. Additionally, administrators can use the predictive dialer to establish custom rules for virtual agents, call routing and messages and design templates for email and SMS campaigns. wolkvox supports integration with various third-party ERP, business intelligence, CRM, and information systems.