Alternatives to Yandex SpeechKit

Compare Yandex SpeechKit alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Yandex SpeechKit in 2024. Compare features, ratings, user reviews, pricing, and more from Yandex SpeechKit competitors and alternatives in order to make an informed decision for your business.

  • 1
    IBM watsonx Assistant
    IBM watsonx Assistant (Formerly Watson Assistant) is a market-leading enterprise conversational AI platform that allows you to build intelligent virtual and voice assistants that can provide customers with fast, consistent and accurate answers across any messaging platform, application, device or channel. Using artificial intelligence and large language models, watsonx Assistant learns from customer conversations, improving its ability to resolve issues the first time while removing the frustration of long wait times, tedious searches and unhelpful chatbots. Most chatbots try to mimic human interactions, frustrating customers when a misunderstanding arises. IBM watsonx Assistant is more than a chatbot. It knows when to search for an answer from a knowledge base, when to ask for clarity and when to direct users to a human agent for more assistance. And since it can be deployed in any cloud or on-premises environment – smarter AI is finally available wherever you need it.
    Compare vs. Yandex SpeechKit View Software
    Visit Website
  • 2
    Dialogflow

    Dialogflow

    Google

    Dialogflow from Google Cloud is a natural language understanding platform that makes it easy to design and integrate a conversational user interface into your mobile app, web application, device, bot, interactive voice response system, and so on. Using Dialogflow, you can provide new and engaging ways for users to interact with your product. Dialogflow can analyze multiple types of input from your customers, including text or audio inputs (like from a phone or voice recording). It can also respond to your customers in a couple of ways, either through text or with synthetic speech. Dialogflow CX and ES provide virtual agent services for chatbots and contact centers. If you have a contact center that employs human agents, you can use Agent Assist to help your human agents. Agent Assist provides real-time suggestions for human agents while they are in conversations with end-user customers.
    Compare vs. Yandex SpeechKit View Software
    Visit Website
  • 3
    Google Cloud Speech-to-Text
    Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech into text in 73 languages and 137 different local variants. Leverage Google’s most advanced deep learning neural network algorithms for automatic speech recognition (ASR) and deploy ASR wherever you need it, whether in the cloud with the API, on-premises with Speech-to-Text On-Prem, or locally on any device with Speech On-Device.
    Compare vs. Yandex SpeechKit View Software
    Visit Website
  • 4
    LumenVox

    LumenVox

    LumenVox

    Transforming customer engagement with AI-driven speech recognition and voice authentication technology. We’ve spent the last 20 years empowering our partners’ success through collaboration. Our curiosity keeps us innovating for the next 20. Our flexible speech-enabling technology enables you to build a solution that fulfills all your customers’ demands, affordably and reliably. We do one thing, and we do it well. And that's speech-enabling your applications. Finally, deliver great voice automation and interactions. Whether short and simple commands, or conversational questions, LumenVox ASR and TTS is accurate and affordable, helping you improve efficiencies on both sides of the phone line. You’ll never repeat yourself again. We provide you with the utmost flexibility from a capabilities, deployment and monetization perspective. If you can think it, you can build it with LumenVox. Shorten your development to deployment time with our easy, intuitive technology and toolsets.
  • 5
    Amazon Lex

    Amazon Lex

    Amazon

    Amazon Lex is a service for building conversational interfaces into any application using voice and text. Amazon Lex provides the advanced deep learning functionalities of automatic speech recognition (ASR) for converting speech to text, and natural language understanding (NLU) to recognize the intent of the text, to enable you to build applications with highly engaging user experiences and lifelike conversational interactions. With Amazon Lex, the same deep learning technologies that power Amazon Alexa are now available to any developer, enabling you to quickly and easily build sophisticated, natural language, conversational bots (“chatbots”). With Amazon Lex, you can build bots to increase contact center productivity, automate simple tasks, and drive operational efficiencies across the enterprise. As a fully managed service, Amazon Lex scales automatically, so you don’t need to worry about managing infrastructure.
  • 6
    SoundHound

    SoundHound

    SoundHound

    We believe every brand should have a voice and every person should be able to interact naturally with the products around them, by simply talking. At SoundHound Inc., we’re working together with our strategic partners to build a more accessible and connected world. We build custom voice assistants for companies wanting to keep their brand, users, and data. Built on the foundation of proprietary Speech-to-Meaning® and Deep Meaning Understanding® technologies, the Houndify platform provides conversational intelligence unmatched by others in the industry. Houndify everything! Voice-enable the world with conversational intelligence. Create a voice AI platform that exceeds human capabilities and brings value and delight via an ecosystem of billions of products enhanced by innovation and monetization opportunities. Headquartered in the heart of Silicon Valley, we are a global company with 9 offices in key markets and teams in 16 countries.
  • 7
    Vozy

    Vozy

    Vozy

    Vozy transforms the way companies interact with customers through voice assistants and conversational artificial intelligence to boost customer-centric enterprises with an automation that really works. With personalized solutions designed to meet the growing omnichannel customer care demand, Vozy is delivering significant cost savings and unprecedented customer experiences for companies in Latin America. That’s why powerhouses like SURA, Bancolombia, Protección, and Emtelco trust Vozy.
  • 8
    Azure AI Speech

    Azure AI Speech

    Microsoft

    Build voice-enabled apps confidently and quickly with the Speech SDK. Transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and use speaker recognition during conversations. Create custom models tailored to your app with Speech studio. Get state-of-the-art speech to text, lifelike text to speech, and award-winning speaker recognition. Your data stays yours, your speech input is not logged during processing. Create custom voices, add specific words to your base vocabulary, or build your own models. Run Speech anywhere, in the cloud or at the edge in containers. Quickly and accurately transcribe audio in more than 92 languages and variants. Gain customer insights with call center transcription, improve experiences with voice-enabled assistants, capture key discussions in meetings and more. Use text to speech to create apps and services that speak conversationally, choosing from more than 215 voices, and 60 languages.
  • 9
    TrulyNatural

    TrulyNatural

    Sensory

    Sensory is a pioneer in the use of embedded neural network-based speech recognition and has become the industry leader in optimizing and engineering speech recognition software with small footprints and minimal MIPS. This extensive experience and continuous innovation have led to the first embedded large vocabulary continuous-speech recognizer (LVCSR) with state of-the-art cloud performance. Unlike voice recognition software often used with smartphones and mobile devices, such as with a voice assistant mobile app, as well as with IoT (internet of things) enabled technologies (Alexa, Google Assistant, Siri, Cortana), Sensory’s solution is embedded and doesn’t require a wifi connection. Many applications don’t need or want to rely on cloud-based connection to do high-performance speech recognition. Others seek a client/cloud distributed system with optimal performance. The market concerns regarding privacy, performance and bandwidth are driving more processing to the edge.
  • 10
    Wynyard Voice Frequency Analytics
    There is a lot of unstructured data in various formats such as call records, recorded conversations, unclear voices, etc. To identify the relevant data and recognize the voices, a powerful tool is required. Wynyard Voice Frequency Analytics (VFA) is an analyzing tool that helps in identifying the person behind an unclaimed voice or decoding the speech in a readable format from an unclear voice. It is a web application that recognizes the identity of the speaker. The application is beneficial for the law enforcement and Government bodies to prevent crimes. Wynyard VFA works on the simple concept of matching the suspected voice with the ones available in the database and recognizing the owner of that voice. The advanced and superior technology used in the application ensures accurate results. The application can also be used to identify keywords or phrases from a conversation and convert the speech into readable text.
  • 11
    Alan AI

    Alan AI

    Alan AI

    Alan Studio, a simple but powerful IDE, is tailored to the challenges of voice interface design. Write and test conversational scenarios, maintain dialog versions and publish the results to a sandbox or the production environment. Focus on bigger things and let Alan take care of the rest. Alan captures key data points such as users' utterances, frequency of use and session length to let you see how customers interact with a voice assistant in your app. Leverage this data to understand users' behavior and flows, identify unhandled voice commands and optimize the voice assistant effectiveness. Alan provisions and handles the infrastructure required to scale, plan, and maintain voice deployments. To integrate with Alan, you only need to embed a lightweight client SDK in your app. Build a chatbot for your app to answer frequent user questions, handle common requests or just keep human-like conversations with your customers.
  • 12
    SpokenData

    SpokenData

    ReplayWell

    Let the automatic speech-to-text technology transcribe your data. Or transcribe your data yourself or buy professional transcript. Use our on-line time synchonous editor to surf your data and transcripts. Download transcripts in many formats. Manage your team of transcribers using tags and categories. Help them with transcription by automatic voice-to-text technology. Integrate SpokenData into your application via our REST API. We adapt the voice-to-text on your data domain to maximize the transcript accuracy and lower your labor costs. Enable speech technologies in your applications through integrating SpokenData using our REST API. We are ready to process huge amounts of your data. You get API fitting your needs. Just contact our support team. We customize the voice-to-text on your data and purpose to maximize the transcript accuracy. Suitable for: web/mobile app developers, media monitoring agencies, audio/video archive business.
  • 13
    AppTek

    AppTek

    AppTek

    AppTek is a global leader in artificial intelligence (AI) and machine learning (ML) technologies for automatic speech recognition (ASR), neural machine translation (NMT), and natural language understanding (NLU). The AppTek platform delivers industry-leading, real-time streaming and batch technology solutions in the cloud or on-premise for organizations across a breadth of worldwide markets such as media and entertainment, call centers, government, enterprise business, and more. Built by scientists and research engineers who are recognized among the best in the world, AppTek’s solutions cover a wide array of languages, dialects, and channels. AppTek utilizes deep neural networks to transcribe and understand speech and text data, delivering more accurate and efficient tools.
  • 14
    Alibaba Cloud Intelligent Speech Interaction
    Intelligent Speech Interaction is developed based on state-of-the-art technologies such as speech recognition, speech synthesis, and natural language understanding. Enterprises can integrate Intelligent Speech Interaction into their products to enable them to listen, understand, and converse with users, providing users with an immersive human-computer interaction experience. Intelligent Speech Interaction is currently available in Mandarin Chinese, Cantonese Chinese, English, Japanese, Korean, French and Indonesian, and please stay tuned for other languages. Intelligent Speech Interaction is suitable for various scenarios, including intelligent Q&A, intelligent quality inspection, real-time subtitling for speeches, and transcription of audio recordings. Intelligent Speech Interaction has been successfully applied in many industries such as finance, insurance, eCommerce and smart home.
    Starting Price: $1.40 per hour
  • 15
    Fusion Speech
    Back-end speech recognition is the most significant technology development in the dictation and transcription industries. Without physician training, or changes in practice patterns, Fusion Speech® powered by Nuance’s SpeechMagic™ harnesses this powerful technology for facility-wide deployment in nearly every medical specialty. Capture dictation with Fusion Voice®, process the dictation through Fusion Speech, and boost transcription productivity in Fusion Text®. The Fusion modules drive cost savings in reoccurring labor and outsourcing fees. This is the speech recognition solution you have envisioned. Other speech recognition has provided cute gimmicks but fell short in offering a sustainable business application. Fusion Speech provides the tools you require to truly deploy speech recognition that returns measurable and tangible results for your investments.
  • 16
    INVOX Medical
    The most intuitive voice dictation program on the market. Convenient and instant audio-to-text transcription. The program has a clear and simple design, which guarantees a comfortable, fast and precise operation. INVOX Medical has specific dictionaries and is adapted to many medical specialties. INVOX Medical accurately recognizes a wide variety of medical terminology. INVOX Medical is the voice recognition software already trusted by thousands of medical professionals around the world. It's accurate, easy, and incredibly intuitive. In a few minutes you will be dictating your medical reports with complete accuracy. And in addition, it has an unbeatable price. INVOX Medical uses the latest technology in the use of artificial intelligence to help you dictate your medical reports with maximum precision, allowing you to work up to three times faster. The system allows you to add terms to the dictionary, replace words and modify their pronunciation at any time.
    Starting Price: $35 per month
  • 17
    Phonexia Speech Platform
    Phonexia offers a comprehensive portfolio of cutting-edge speech recognition and voice biometrics technologies ready to meet any commercial and governmental scenarios. Powered by the latest advancements in artificial intelligence, acoustics, phonetics, and voice biometrics science, Phonexia products are extremely accurate, fast, and scalable. Phonexia’s AI-powered solutions let you build voicebots, verify a speaker’s identity based on voice biometrics, transcribe speech to text, and search for speakers and context in large amounts of audio. Secure access to your clients’ data conveniently with voice biometric authentication and detect fraud attempts natively. Phonexia offers a comprehensive portfolio of cutting-edge speech recognition and voice biometrics technologies ready to meet any commercial and governmental scenarios. Powered by the latest advancements in artificial intelligence, acoustics, phonetics, and voice biometrics science.
  • 18
    Graphlogic GL Platform
    Graphlogic Conversational AI Platform consists on: Robotic Process Automation (RPA) and Conversational AI for enterprises, leveraging state-of-the-art Natural Language Understanding (NLU) technology to create advanced chatbots, voicebots, Automatic Speech Recognition (ASR), Text-to-Speech (TTS) solutions, and Retrieval Augmented Generation (RAG) pipelines with Large Language Models (LLMs). Key components: - Conversational AI Platform - Natural Language understanding - Retrieval augmented generation or RAG pipeline - Speech-to-Text Engine - Text-to-Speech Engine - Channels connectivity - API builder - Visual Flow Builder - Pro-active outreach conversations - Conversational Analytics - Deploy everywhere (SaaS / Private Cloud / On-Premises) - Single-tenancy / multi-tenancy - Multiple language AI
    Starting Price: 75/1250 MAU/month
  • 19
    VoxCommando

    VoxCommando

    VoxCommando

    VoxCommando is a speech recognition and command utility that lets you take control of your multimedia Home Theatre PC (HTPC). VoxCommando can be run locally, without sacrificing privacy to any cloud-based services. Add voice control to your home automation. Use it as an assistive tool to speed up everyday tasks, reduce your reliance on the keyboard and mouse. VoxCommando is different from other speech recognition applications in that it is extremely customizable. It is designed to work with a wide variety of home automation services and multimedia programs, including user favorites like Kodi and MediaMonkey. It is able to achieve accurate speech recognition because it already knows what media is in your library.
  • 20
    tazti

    tazti

    Voice Tech Group

    Welcome to the tazti website! tazti is state of the art Speech Recognition & Voice Recognition software. You can easily mash up tazti to files, folders, programs, videos and songs on your PC, to open them by voice control. Play PC Games, control applications, programs, and robots by voice command! Over 300,000 people have now tried tazti and it's many features. tazti is super fun, especially if you are tired of pounding your keyboard or want an easy to use assistive technology. Great as well for people with Arthritis, Carpal Tunnel, Tendonitis, Fibromyalgia or other hand, finger or wrist pain.
  • 21
    Floatbot

    Floatbot

    Floatbot

    Our AI Chatbot and Voicebot platform helps you increase customer experience. Floatbot is a hybrid platform, one of the very few globally, that allows the development of both Chatbot and Voicebot within hours. Automate 80% of your Call Center operations with hybrid Voicebot and Chatbot. Increase your Customer Experience and CSAT score. Our AI Chatbot and Voicebot platform helps you increase customer experience. Floatbot is a hybrid platform, one of the very few globally, that allows the development of both Chatbot and Voicebot within hours. Their voice, your response. Speech recognition in vernacular language available on Floatbot platform. 10 Language, 100+ Dialects Powered with 100,000 hours of Voice. Design your custom workflows for Chatbot and Voicebot Floatbot’s No-Code platform requires zero coding to build bot. It allows rapid development and deployment with advanced bot builder.
  • 22
    SpeechMotion
    Document a patient encounter with full or partial dictation, voice recognition, or on-the-go with a customized solution tailored to your unique environment. Solving common documentation issues, like lowering costs and integrating workflows, begins with choosing a solution designed to meet your evolving needs. Improve workflow efficiencies and physician adoption for a rapid return on investment with a partner committed to your long-term success. A leading, national provider of US-based transcription, speech recognition, voice capture and advanced documentation technologies, SpeechMotion partners with healthcare facilities and the organizations supporting them to create a customized documentation solution tailored to support both long and short-term goals. SpeechMotion provides the flexible options healthcare facilities need to quickly and efficiently document a complete patient story, all under one product and service umbrella.
  • 23
    Deepgram

    Deepgram

    Deepgram

    Deploy accurate speech recognition at scale while continuously improving model performance by labeling data and training from a single console. We deliver state-of-the-art speech recognition and understanding at scale. We do it by providing cutting-edge model training and data-labeling alongside flexible deployment options. Our platform recognizes multiple languages, accents, and words, dynamically tuning to the needs of your business with every training session. The fastest, most accurate, most reliable, most scalable speech transcription, with understanding — rebuilt just for enterprise. We’ve reinvented ASR with 100% deep learning that allows companies to continuously improve accuracy. Stop waiting for the big tech players to improve their software and forcing your developers to manually boost accuracy with keywords in every API call. Start training your speech model and reaping the benefits in weeks, not months or years.
  • 24
    Yactraq

    Yactraq

    Yactraq

    Yactraq is the industry value leader in speech analytics software. Our customers typically realize benefits across two broad functional areas. Marketing teams looking to extend their Voice-of-the-Customer (VoC) capabilities beyond the feedback form and social media now want to mine sales and customer service phone calls as part of their omni-channel capability. Contact Center Quality Management teams typically use speech analytics / audio mining as a way of leveraging AI / Machine Learning to evaluate the performance of their call agents. Yactraq offers customized free trials based on a clients own data so they can experience the value of our software before deciding to buy. Our products are cost-effectively priced to suit the needs of end customers as well as partners in the Business Process Outsourcing (BPO), Contact Center as a Service (CCAS), Voice-of-the-Customer (VoC), CRM Software and Network Service Provider businesses.
  • 25
    Knovvu Speech Recognition
    Automate customer processes, evaluate agent performances objectively and ensure your operations are 100% efficient. In our connected world, many consumers are interacting with everyday connected appliances in new ways. With a trend in connected devices that often lack a screen, speech is emerging as a natural, intuitive interface for human-machine interaction. Speech recognition is the driving technology behind this development, revolutionizing the way people interact with their devices. With Knovvu Speech Recognition from Sestek, machines and applications can understand user commands in spoken language. With the ability to listen to and interpret spoken demands, users may interact with these devices by speaking aloud rather than inputting buttons and keystrokes. Our automatic speech recognition software has full application. Many organizations use technology to power intuitive and straightforward self-service solutions.
  • 26
    WebsiteVoice

    WebsiteVoice

    WebsiteVoice

    Turn all your website articles into high-quality audio in less than 5 minutes and for free. Let your visitors listen to the content of your website in the background while they do other things with our text-to-speech technology and increase the time spent on your website. Accessibility is sometimes forgotten. Empower visitors with visual impairment and reading disabilities to still completely consume your content without the complications of reading. Listening to podcasts and audiobooks has become a growing trend and behavior for people to consume content. Capture a wider audience that would prefer tuning in instead of reading. Thanks to our Automatic Content Recognition technology, you can just drop our snippet on your site and forget about it. We will automatically enable text-to-speech voice for the relevant content. We use Artificial Intelligence and Machine Learning to constantly improve our voice algorithms to make your website text-to-speech as realistic as possible.
    Starting Price: $9 per month
  • 27
    Rev.ai

    Rev.ai

    Rev.ai

    Rev.ai was built by leading speech recognition experts from millions of hours of accurate human-transcribed content. We began in 2011 with Rev.com, providing human transcription services. We are now the world's largest transcription vendor, with over 35,000 contractors who transcribe millions of minutes of audio each month. In 2017 we launched Temi, an automated speech-to-text transcription and editing service. Temi has already transcribed 20 million minutes of content and was named the best transcription service by Wirecutter. Today our best-in-class speech engine is available to everyone as Rev.ai. We're helping companies get the most out of their audio and video content by making it searchable and accessible.
  • 28
    Yosh.AI

    Yosh.AI

    Yosh.AI

    Yosh.AI is an official global Google Cloud Partner. After numerous international, successful implementations of AI solutions for international enterprises, Google recognized advanced and high quality solutions provided by Yosh.AI with a global partnership. The mission of Yosh.AI is to revolutionize the communication between retailers and the users through the use of AI-powered Virtual Voice Assistants and deliver a more pleasurable, effortless shopping experience. Thanks to innovative AI technology supporting both voice and text conversations, brands for the first time in history are now able to seamlessly communicate with users, allowing for more meaningful, user-driven, personalized one-to-one communications. Our mission is to empower e-commerce with AI solutions and drive user engagement and sales through delivering pleasurable, effortless fashion shopping experience with voice.
  • 29
    Dragon Professional Group

    Dragon Professional Group

    Nuance Communications

    Empower employees to dictate documents 3 times faster than typing with up to 99% recognition accuracy, right from the first use. Since documents are created in a fraction of the time it would typically take typing by hand, they spend less time on paperwork, and more time on more profitable tasks. With a next‑generation speech engine powered by Nuance Deep Learning technology, Dragon achieves high recognition accuracy while dictating, even for users with accents or those working in open office or mobile environments; making it ideal for diverse workgroups and settings. Dragon makes it easy to automate tasks or short‑cut repetitive steps. Create custom voice commands to insert standard boilerplate text or signatures into documents. Or create time‑saving macros to automate multi‑step workflows by voice. Once created, share these customizations across the Dragon user community for efficiency gains.
  • 30
    Dragon Home

    Dragon Home

    Nuance Communications

    With a next-generation speech engine leveraging Deep Learning technology, Dragon adapts to your voice or environmental variations—even while dictating. Dragon intelligently transcribes your spoken words into text 3x faster than typing with up to 99% recognition accuracy. And with a streamlined user interface and no training required, getting started is as easy as launch and dictate! With a new playback feature, you can select a block of text and “play that back” for easy proofreading and editing as you listen to what you dictated. Dragon works with today’s popular touchscreen PCs and tablets, so you can enjoy the versatility of interacting with your favorite applications—at home or school.
    Starting Price: $200 one-time payment
  • 31
    Dragon Professional Individual

    Dragon Professional Individual

    Nuance Communications

    As a business professional, you face heavy documentation demands each day. See how Dragon Professional Individual can help you get documents done faster and more accurately, both in and out of the office, so you can focus on revenue-generating tasks. With a next-generation speech engine leveraging Deep Learning technology, Dragon adapts to your voice or environmental variations—even while you’re dictating. Create documents and reports quickly and accurately, and zip through computer tasks in record time—all by voice. Dragon learns the words and phrases you use the most to minimize corrections. Keep up with documentation even on the road or out in the field. Dragon works with popular form factors such as portable touchscreen PCs.
    Starting Price: $500 one-time payment
  • 32
    VoxSigma

    VoxSigma

    Vocapia

    The VoxSigma software suite is offered as a Web service via a REST API over HTTPS, always providing customers access to our latest systems thereby quickly benefiting from regular advances and take advantage of additional features offered by the online environment. Our speech-to-text service is available 24/7/365 with failover servers and geographic redundancy. Automatic on-the-fly adaptation allows the user to provide texts related to the audio document being processed, what can be considered topic/domain adaptation. These accompanying texts serve to increase the lexical coverage of the speech-to-text system and to adapt the language model to the specific domain of the audio document with the aim of improving the transcription accuracy.
  • 33
    Maestra

    Maestra

    Maestra

    Automatic Transcripts, Subtitles and Voiceovers. In just minutes. Highly accurate speech to text software with a built in advanced text editor. Translate in English, French, Spanish, German and 80+ languages. Save time and money with Maestra’s automatic audio to text transcription software. Transcribe audio files to text automatically within seconds. No credit card required for the first 15 minutes. Creating subtitles for video with online automatic subtitling software can save you a considerable amount of time. You'll be able to auto generate subtitles for videos in just a few minutes. You can also translate your subtitles automatically to 80+ languages. With Maestra video dubber you can automatically voiceover your videos aloud to foreign languages using artificial intelligence and computer generated voices.
  • 34
    Acusis

    Acusis

    Acusis

    Acusis’ approach to Revenue Cycle Management (RCM) is full circle that provides finest experience to their clients. Acusis has a tenured team consisting of proven RCM experts and consultants on billing, coding, CDI, risk adjustment, HCC, account receivables and denials management. Clinical documentation management is simple and cost-effective with Acusis’ unique approach of combining cutting-edge technology and professional documentation services. While eCareNotes speech recognition platform helps Physicians save time and focus on delivering care, Acusis professional services team focuses on making life easy for HIM by offering superior editing services. From dictation capture to cutting-edge voice recognition, Acusis offers a wide array of cloud-based products for simplifying MTSO transcription workflow management. eCareNotes, the flagship technology platform helps MTSOs as well as in-house transcription teams of hospitals to reduce documentation costs and stay compliant.
  • 35
    Transkriptor

    Transkriptor

    Transkriptor

    Automatically transcribe audio, and turn your audio or video to text. Upload your file and convert your audio to text with Transkriptor. Transkriptor’s powerful artificial intelligence generates online transcriptions within few minutes. Transkriptor is used by many professionals or students. Transkriptor is the best assistant for interview transcription, lecture transcription and video transcription. Transkriptor creates editable TXT, word or SRT files. You can download your transcriptions within seconds or you can use Transkriptor’s online editor for easy and quick editing. Sign up today and be more productive in school, work, and life. Even though Transkriptor is one of the most powerful artificial intelligence solutions, it is extremely easy to use. Transkriptor is an online speech-to-text converter and no installation required. Simply upload your file and start.
    Starting Price: $9.99 per month
  • 36
    ShoutOUT.AI

    ShoutOUT.AI

    ShoutOUT.AI

    Intelligent assistant that can go beyond predefined conversational flows. Cutting edge Artificial Intelligence technologies decide the next response from the bot. Connecting to the next new channel is easy, even with voice channels. Understand your customer's intent and extract information using NLU. Don’t send your valuable data to third party services. Remain independent, deploy on-prem. Bring chatbot conversations into the real world by integrating ShoutOUT AI with yours and 3rd party APIs. Embrace this new normal of customer service. Transform the boring FAQ questions into an engaging chatbot that understand and answer sophisticated inquiries from your clients. You don’t need to be tech-savvy. Integrate an answer bot to your website in a few steps to seamlessly handle all conversations coming from multiple users at once. Get answers to the commonly asked questions on the spot.
  • 37
    STELLA

    STELLA

    STELLA

    STELLA is an artificial intelligence digital voice assistant that can answer an unlimited number of calls simultaneously, route calls, answer frequently asked questions, and book service appointments. Improve your CSI and lower costs at the same time. With STELLA Automotive, your customers can accomplish their goals without sitting on hold or getting frustrated. STELLA answers your telephone on the first ring. The STELLA team of scientists and engineers developed conversational AI with car dealers and customers in mind. With STELLA, saving time, retaining employees, and delighting customers is easy. STELLA is the smartest AI in automotive, and she keeps getting smarter. Because there are no per-minute fees, STELLA does not add cost as she handles more of your dealership’s calls. STELLA is a non-disruptive technology and is simple to deploy and run in your dealership. Your employees will not need to change any processes or spend time learning how to use or manage STELLA.
  • 38
    Herbie.ai

    Herbie.ai

    Herbie.ai

    Herbie.ai – A multi-national Conversational AI company (Part of SunSmart Global – 15+ years in enterprise solutions) Digital Transformation of Enterprises Unique Voice-Enabled Virtual Assistants to automate business cases. Franchise spread over 6 Countries – expanding to 13+. Instant enables seamless connection with 12+ Social Media platforms. The business provides end-to-end AI Solution that are "Innovative & Intelligent" in addressing the critical business needs of Mid to Large Corporates, built using advanced technology with applications across business verticals and geographies. Herbie.AI is a Microsoft Gold Application Development Partner, and is ISO 9001:2015 Certified. AI, ML & NLP based, Multi-lingual, Omni-channel, Always-on bots with transfer to agent. Improve your customer experience with ai powered conversational chatbot solution. Provide complete customer service with Herbie AI chatbots for customer support. Lower Customer Support Costs.
  • 39
    Chatlayer

    Chatlayer

    Chatlayer

    Personally and effortlessly service your customers with our smart conversational AI solutions. Build your bot in one language and make it understand 100+ languages with just one button click. We are a no-coding platform. Business users can create and manage bots without any coding skills needed. Talk to customers on Facebook messenger, WhatsApp, Google Assistant, Amazon Alexa & more. Our clever AI gives you detailed insights into your conversations and provides tips on how to improve them. Powered by our AI technology, the myBo bot helps customers of Belfius Insurance claims, freeing up time for human agents to handle the more complex cases. Build your own voice or chatbot with our 30-day free trial account. It’s easy, no IT skills needed. But don’t just take our word for it, try it out yourself for free. Chatlayer is not just a conversational platform, it’s also a partnership ecosystem! Get the most out of our tools thanks to our network of trusted partners.
  • 40
    CEDEX Technologies

    CEDEX Technologies

    CEDEX Technologies

    CEDEX Technologies is a specialized chatbot development company from Kerala, India. We are a dedicated team of chatbot and voicebot experts. We develop custom chatbots based on our client's unique business requirements. We design, develop and train high-quality chatbots and voice apps with conversational abilities, context sensitivity, and personality traits. Chatbots are evolving very fast and it is expected that they will eventually replace humans in areas like customer care, e-commerce, entertainment, news, delivery services, corporate information exchange etc. Developing chatbots may seen pretty easy on the surface. But to build a chatbot that meet the exact business requirements needs the expertise of a dedicated chatbot development team. Chatbots can help your customers 24/7. They don't have bad days and they don't get frustrated and thus provide a better customer support. Chatbots can automate tasks which are to be done frequently and at the right time.
  • 41
    Otter.ai

    Otter.ai

    Otter.ai

    Otter is where conversations live Generate rich notes for meetings, interviews, lectures, and other important voice conversations with Otter, your AI-powered assistant. Organizations who have the Otter advantage. Teams big and small trust Otter to transcribe their important conversations. Our shiny new release, Otter 2.0, adds more functionality to improve collaboration and productivity. The Teams plan includes capabilities designed especially for small and medium businesses and teams in larger enterprises. Record and review in real time. Search, play, edit, organize, and share your conversations from any device. Record conversations using Otter on your phone or web browser. Import or sync recordings from other services. Integrate with Zoom. Get real-time streaming transcripts and, within minutes, rich, searchable notes with text, audio, images, speaker ID, and key phrases. Share or export voice notes to inform others and get on the same page.
    Starting Price: $8.33 per month
  • 42
    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile's modern, robust speech recognition is optimized for mobile devices in over 40 languages. Designed for industry workflows, cutting edge noise abatement technology delivers outstanding recognition in noisy environments. A speaker-independent voice engine works for all users out-of-the-box, without the need to voice train or maintain voice files for each user. AccuSpeechMobile is a 100% device-based solution. No voice server or middleware is required and no changes are needed to the backend system (WMS, ERP, EAM, CMMS). Cloud or network connection is not required to use the full functionality of device-based data collection. AccuSpeechMobile fully supports multi-modal capabilities so that users can hear spoken information and speak commands in tandem with the use of intelligent scanners. The ability to reference additional information on the device screen is also always available in conjunction with speech-to-text and text-to-speech commands.
  • 43
    SpeechText.AI

    SpeechText.AI

    SpeechText.AI

    Transcribe audio and video into text. Get accurate transcriptions of podcasts with domain-specific speech recognition. SpeechText.AI is a powerful artificial intelligence software for speech to text conversion and audio transcription. Upload audio or video files. AI transcription software supports various file formats and transcribes from speech to text in any language. Select domain. Select industry domain and audio type from predefined categories to improve the recognition accuracy of domain-specific words. Transcribe. Our speech transcription engine uses state-of-the-art deep neural network models to convert from audio to text with close to human accuracy. Edit & Export. Search, modify and verify audio transcriptions using interactive editing tools. Export your content in different formats. Why SpeechText.AI? Set of amazing features to help you transcribe audio and video in seconds. Speech recognition. Powerful speech-to-text tech.
    Starting Price: $19 one-time payment
  • 44
    Rubidium

    Rubidium

    Rubidium

    Rubidium enables leading companies to embed voice commands and text to speech in their products. Voice Trigger is an “always on” engine that continuously listens and wakes up when you say the proper “magic word”. Voice Trigger identification uses a sophisticated miniature footprint Automatic Speech Recognition (ASR) engine to run in the background and distinguish between the trigger phrase and the rest of the speech, sounds and noise. Automated Speech Recognition (ASR) easily and safely controls any set of functions through voice commands. For example: call acceptance and rejection, device setup and installation procedure (pairing, calibration, interconnection, etc.), voice dialing, music streaming control and music selection. Rubidium technology is now embedded in over 50 million consumer products with customers and partners including leading global brands such as RIM (Blackberry), GN Netcom (Jabra), Panasonic, Uniden, CSR, Mattel, General Motors, Electrolux and many others.
  • 45
    Speech2Structure
    When treating a patient, doctors spend on average two-thirds of their time documenting the treatment and far less time on examinations or patient interviews. To allow doctors to spend more time with their patients, Averbis is working on Speech2Structure – a software solution where the documentation is recorded live by voice and structured on-the-fly. Speech2Structure can correctly recognize and resolve many linguistic variations such as negations, suspected diagnoses, diagnoses that have taken place, etc. when recognizing diagnoses. Pathological laboratory values or microbiology results are also converted into corresponding diagnoses. The recorded medications can also provide clues to diagnoses.
  • 46
    Azure Speaker Recognition
    A Speech service feature that verifies and identifies speakers. Enable frictionless, secure customer experiences: Improve the customer experience by streamlining verification processes. Use voice to verify individuals for secure, frictionless customer engagements in a wide range of solutions, from web applications to call centers. Speaker verification can use either passphrases or free-form voice input. Improve the customer experience by streamlining verification processes. Use voice to verify individuals for secure, frictionless customer engagements in a wide range of solutions, from web applications to call centers. Speaker verification can use either passphrases or free-form voice input. Unlock value from scenarios with multiple speakers: Determine a speaker’s identity from within a group of enrolled speakers. Speaker identification enables you to attribute speech to individual speakers, support multiuser voice recognition for personalized interactions, and more.
  • 47
    Voci

    Voci

    Medallia

    Companies engage with customers by phone more than any other channel, and these interactions represent a gold mine of untapped information. Listening to every customer call is costly and time-consuming and not physically practical. As a result, only a fraction of randomly selected calls is typically reviewed. These voice interactions reveal the true voice of your customers and enable you to get to the heart of their concerns. With our highly accurate, automated speech-to-text transcription, you can transform your unstructured voice data into transcripts that can be integrated into your analytics platforms. Voci enables you to improve agent quality monitoring, enhance the customer experience, extract competitive intelligence and ensure compliance.
  • 48
    EasyVoice

    EasyVoice

    EasyVoice

    Voice assisted applications empower business to stream from the cloud to any Alexa-enabled device. Our Alexa Developer team makes it possible for your business to be accessible through the spoken word. With one simple word, a target audience of millions has instant access to your products and services. Customer engagement with voice assistance by certified alexa developers. Easy Voice develops B2B and B2C voice solutions that interact with Alexa voice services (Alexa apps and skills). We provide a complete alexa developer solution for connecting people through Amazon Echo or other Alexa-enabled devices. The Alexa Skill and Dash Button Platform is the first solution to empower organizations to manage customer engagement with voice on a single solution. Easily integrates with existing front and back office solutions. We develop the world's leading voice assistant applications, skills, and apps.
  • 49
    Transcribe

    Transcribe

    Wreally

    Transcribe saves thousands of hours every month in transcription time for journalists, lawyers, podcasters, students and professional transcriptionists all over the world. Increase your productivity & save mountains of time when converting your interviews, audio notes, lectures, speeches, podcasts and any recorded speech to text. Put on your headphones, load your audio, slow it down and speak out what you hear. It's that simple. Our dictation engine will convert your speech to text on the fly. This is way faster than typing. We support English, Spanish, French, Hindi and almost all other European & Asian languages.
  • 50
    Braina

    Braina

    Brainasoft

    Braina (Brain Artificial) is an intelligent personal assistant, human language interface, automation and voice recognition software for Windows PC. Braina is a multi-functional AI software that allows you to interact with your computer using voice commands in most of the languages of the world. Braina also allows you to accurately convert speech to text in over 100 different languages of the world. Braina's artificial intelligence makes it possible for you to control your computer using natural language commands and makes your life easier. Braina is not a Siri or Cortana clone for PC but rather a powerful personal and office productivity software. It isn't just like a chat-bot; its priority is to be super functional and to help you in doing tasks. Braina helps you do things you do everyday. It is a multi-functional artificial intelligence software that provides a single window environment to control your computer and perform wide range of tasks using voice commands.
    Starting Price: $29 per year