Alternatives to SoapBox

Compare SoapBox alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to SoapBox in 2024. Compare features, ratings, user reviews, pricing, and more from SoapBox competitors and alternatives in order to make an informed decision for your business.

  • 1
    Twilio Voice
    Create a scalable voice experience with the API that connects millions globally. With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Create an engaging voice experience that you can quickly scale and modify with a wide array of customization options and resources, like our Voice SDK. Then, add on features like Interactive Voice Response (IVR), recording transcriptions, and speech recognition to create an experience that your customers will appreciate. Whether you're looking to set up global conferencing or alerts & notifications, Twilio has the support you need for building with Voice. Find docs, code samples, helper libraries, and developer tools such as Twilio Runtime and our visual workflow builder, Studio.
    Compare vs. SoapBox View Software
    Visit Website
  • 2
    Speechmatics

    Speechmatics

    Speechmatics

    Speechmatics is the most accurate and inclusive speech-to-text API ever released. Speechmatics is the world’s leading expert in Speech Intelligence, combining the latest breakthroughs in AI and ML to unlock the business value in human speech. Businesses use Speechmatics worldwide to accurately understand and transcribe human-level speech into text regardless of demographic, age, gender, accent, dialect, or location in real-time and on recorded media. Combining these transcripts with the latest AI-driven speech capabilities, businesses build products that utilize summarization, topic and chapter detection, sentiment analysis, translation, and more. Speechmatics processes over 500 years of transcription worldwide every month in 50 languages and can translate 69 language pairs. Having pioneered machine learning in speech recognition, its neural networks consider acoustics, languages, dialects, multiple speakers, punctuation, capitalization, context, and implicit meanings.
    Partner badge
    Compare vs. SoapBox View Software
    Visit Website
  • 3
    Speech2Structure
    When treating a patient, doctors spend on average two-thirds of their time documenting the treatment and far less time on examinations or patient interviews. To allow doctors to spend more time with their patients, Averbis is working on Speech2Structure – a software solution where the documentation is recorded live by voice and structured on-the-fly. Speech2Structure can correctly recognize and resolve many linguistic variations such as negations, suspected diagnoses, diagnoses that have taken place, etc. when recognizing diagnoses. Pathological laboratory values or microbiology results are also converted into corresponding diagnoses. The recorded medications can also provide clues to diagnoses.
  • 4
    Clarifai

    Clarifai

    Clarifai

    Clarifai is a leading AI platform for modeling image, video, text and audio data at scale. Our platform combines computer vision, natural language processing and audio recognition as building blocks for developing better, faster and stronger AI. We help our customers create innovative solutions for visual search, content moderation, aerial surveillance, visual inspection, intelligent document analysis, and more. The platform comes with the broadest repository of pre-trained, out-of-the-box AI models built with millions of inputs and context. Our models give you a head start; extending your own custom AI models. Clarifai Community builds upon this and offers 1000s of pre-trained models and workflows from Clarifai and other leading AI builders. Users can build and share models with other community members. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been recognized by leading analysts, IDC, Forrester and Gartner, as a leading computer vision AI platform. Visit clarifai.com
    Starting Price: $0
  • 5
    SoundHound

    SoundHound

    SoundHound

    We believe every brand should have a voice and every person should be able to interact naturally with the products around them, by simply talking. At SoundHound Inc., we’re working together with our strategic partners to build a more accessible and connected world. We build custom voice assistants for companies wanting to keep their brand, users, and data. Built on the foundation of proprietary Speech-to-Meaning® and Deep Meaning Understanding® technologies, the Houndify platform provides conversational intelligence unmatched by others in the industry. Houndify everything! Voice-enable the world with conversational intelligence. Create a voice AI platform that exceeds human capabilities and brings value and delight via an ecosystem of billions of products enhanced by innovation and monetization opportunities. Headquartered in the heart of Silicon Valley, we are a global company with 9 offices in key markets and teams in 16 countries.
  • 6
    Alibaba Cloud Intelligent Speech Interaction
    Intelligent Speech Interaction is developed based on state-of-the-art technologies such as speech recognition, speech synthesis, and natural language understanding. Enterprises can integrate Intelligent Speech Interaction into their products to enable them to listen, understand, and converse with users, providing users with an immersive human-computer interaction experience. Intelligent Speech Interaction is currently available in Mandarin Chinese, Cantonese Chinese, English, Japanese, Korean, French and Indonesian, and please stay tuned for other languages. Intelligent Speech Interaction is suitable for various scenarios, including intelligent Q&A, intelligent quality inspection, real-time subtitling for speeches, and transcription of audio recordings. Intelligent Speech Interaction has been successfully applied in many industries such as finance, insurance, eCommerce and smart home.
    Starting Price: $1.40 per hour
  • 7
     OTO

    OTO

    OTO Systems

    OTO allows call centers 100% visibility of what is said during customer calls within 20 hours. Complement your NPS scoring with in-call intonation analytics. Identify call agent engagement and proactively set your WFM plan. Pick calls for QA faster. OTO is language-agnostic and gives you output parameters on various angles. Our API allows companies to start analyzing 100% of in-call conversations within a couple of hours. Sign up for a free trial and start analyzing your call data! Voice is the most valuable touchpoint between you and your customer. We're here to help you truly understand and leverage your voice data at scale. Whether you're building a mobile app or data analytics dashboards, our lightweight DeepToneTM engine gives you access to our powerful voice models on any device, providing you with a rich layer of acoustic labels for nearly every audio format.
    Starting Price: $100 per month
  • 8
    Agara

    Agara

    Agara

    Agara is the world's leading Real-time Voice AI SaaS platform that processes customer support calls in real-time to eliminate hold time, reduce manual inputs and improve customer experience. Agara significantly improves customer satisfaction (CX) scores while reducing support costs by over 50%.
  • 9
    Vozy

    Vozy

    Vozy

    Vozy transforms the way companies interact with customers through voice assistants and conversational artificial intelligence to boost customer-centric enterprises with an automation that really works. With personalized solutions designed to meet the growing omnichannel customer care demand, Vozy is delivering significant cost savings and unprecedented customer experiences for companies in Latin America. That’s why powerhouses like SURA, Bancolombia, Protección, and Emtelco trust Vozy.
  • 10
    FortressIQ

    FortressIQ

    Automation Anywhere

    FortressIQ enables enterprises to decode work, transform experiences, and enhance workflows with the industry’s most advanced process intelligence platform. Using innovative computer vision and artificial intelligence, FortressIQ delivers unprecedented process insights, extremely fast, and with detail and accuracy unattainable with traditional methods. The platform autonomously acquires process data at scale even as processes extend across systems, empowering enterprises to understand, monitor, and improve operations, employee and customer experiences, and every business process. FortressIQ was founded in 2017, and is backed by Lightspeed Venture Partners, Boldstart Ventures, Comcast Ventures, Eniac Ventures, M12 and Tiger Global. Pinpoint inefficiencies and process variations continuously and automatically to reveal optimal process paths and reduce time to automation.
  • 11
    ELSA Speak

    ELSA Speak

    ELSA Speak

    ELSA, English Language Speech Assistant, is a fun and engaging app specially designed to help you improve your English pronunciation. ELSA's artificial intelligence technology was developed using voice data of people speaking English with various accents. This allows ELSA to recognize the speech patterns of non-native speakers, setting it apart from most other voice recognition technologies. Strict but caring, the ELSA AI Coach pays close attention to every bit of progress you make along the way, and reminds you when you go off track. You will be rewarded for your hard work. ELSA gets smarter every day! Traditional language learning is transformed by our personalized English teaching technology. Our self-evolving AI analyzes your performance and behavioral data to personalize your daily curriculum. We are the first and best speech recognition app designed to evaluate and give immediate, detailed feedback on pronunciation and fluency.
    Starting Price: Free
  • 12
    Symbl

    Symbl

    Symbl.ai

    Symbl is an API platform for developers and businesses to rapidly deploy conversational intelligence at scale – on any channel of communication. Our comprehensive suite of APIs unlock proprietary machine learning algorithms that can ingest any form of conversation data to identify actionable insights across domains and channels (voice, email, chat, social) contextually – without the need for any upfront training data, wake words, or custom classifiers. Symbl is democratizing conversational tech to make collaboration effortless at scale. We provide the technology for organizations to deploy at scale our proprietary workplace productivity API so brands can optimize key workflows for knowledge workers or enhance the customer experience. Whether you are a seasoned developer or just starting to explore how to harness employee collaboration to fit your organization’s needs, our API can be customized for your specific applications.
  • 13
    goFLUENT

    goFLUENT

    goFLUENT

    goFLUENT is the world’s leading blended learning solution provider for acquiring and refining communication skills in strategic business languages such as English, French, German, Italian, Mandarin, Portuguese, and Spanish. Dedicated to diversity & inclusion, talent development, and employee retention, our global mission is to provide all employees with an equal voice to reach their full potential, regardless of their native tongue. We accelerate language training by delivering hyper-personalized solutions that blend technology, content, and human interaction, available globally on any device. Transforming more than 1,000 international corporations’ language training approaches in 150+ countries, goFLUENT speeds up the acquisition of language skills needed to gain confidence, save time, and grow their talent on a global scale.
  • 14
    Earworms

    Earworms

    earworms Learning

    Have you ever had an earworm? Catchy music and lyrics that you just can't get out of your head? Well, utilizing the power of music, the Earworms Musical Brain Trainer puts the words of a foreign language into your head! In recent years there have been a lot of advances in language learning techniques, supported by the findings of neurological science and linguistic pedagogy. Earworms MBT bundles a lot of these findings into a powerful edutainment language learning system, unique in its teaching power. Listening to the melodious tracks puts users into a relaxed state of alertness, ideal for learning. The sound patterns combined with rhythmic repetitions from a mesmeric male voice who speaks the English and a female native speaker for the target language, 'worm' their way into the auditory cortex -- the area of the brain from which words can be easily imagined and recalled.
  • 15
    LumenVox Automatic Speech Recognition (ASR)
    Transforming customer engagement with AI-powered voice recognition and voice authentication technology. Our flexible voice-enabled technology allows you to create a solution that meets all of your customers' demands, affordably and reliably. We do one thing, and we do it well. And that's voice enablement for your apps. Finally, deliver great voice automation and interactions. Whether it's short, simple commands or conversational questions, LumenVox ASR and TTS are accurate and affordable, helping you improve efficiency on both sides of the phone line. You will never repeat yourself. Recognize multiple dialects from a single global language model to serve all your customers. We give you maximum flexibility from a capabilities, implementation and monetization perspective. If you can think it, you can build it with LumenVox
  • 16
    GoVivace

    GoVivace

    GoVivace

    Our automatic speech recognition engine supports several English accents and can be localized to any language. Also, the ASR engine supports standard telephony as well as web and mobile applications. Being capable of actioning voice commands given to electronic devices such as computers, tablets, smartphones or telephones with the aid of a microphone, the GoVivace’s Automatic Speech Recognition Engine finds use in diverse applications. This automatic speech recognition engine compares the spoken input with a number of pre-specified possibilities and convert speech to text. The entire set of pre-specified possibilities constitute the application’s grammar, which powers the interface between the dialogue-speaker and the back-end processing. GoVivace’s patented Automatic Speech Recognition solution needs only very simple grammar for its processing. It can also support very large grammars for complex tasks.
  • 17
    Voice Pro

    Voice Pro

    LinguaTec

    Voice Pro Enterprise has been developed especially for use in enterprises. The recognition is done on the company server and can be accessed from any device (PC, Mac, smartphone, tablet). This ensures that all in-house information remains within the company. No more time-consuming speaker training is necessary, thanks to the speaker-independent recognition technology: Just speak into your device and you will see the transcribed text immediately. Companies finally have a sophisticated and secure speech recognition solution at their disposal. Regardless of whether you need to create a document at your work station, write an email on the move or dictate a sales report on site: Voice Pro Enterprise saves time and helps to make employees more productive. Voice Pro Enterprise results in a noticeable increase in employee efficiency. With Voice Pro Enterprise you dictate on average three times faster than you type. The high recognition accuracy minimizes post-processing.
    Starting Price: €149 one-time payment
  • 18
    Classtime

    Classtime

    Classtime Inc.

    Classroom management solution designed for students and teachers with features such as analytics, real-time grading, and libraries. Classtime is a solution for teachers that complements in-class teaching with immediate feedback on students level of understanding. Create great questions, engage everyone, improve understanding. No registration required to see how it works!
    Starting Price: $9.00/month/teacher
  • 19
    Talkio AI

    Talkio AI

    Talkio AI

    Talkio AI is built on top of ChatGPT and lets you interact with the AI through voice to train your oral language skills. Talkio AI offers premium voices and supports multiple dialects for the most popular languages. With our advanced language technology, you can immerse yourself in authentic conversations and gain proficiency in the dialects that matter most to you. Ever wondered how it would be to have a personal language tutor available anytime, anywhere? At Talkio AI, we turn this dream into reality. Our AI Tutors are the perfect companions to improve your oral language skills. Powered by advanced AI technology, they mimic human interaction and conversation, offering an immersive language learning experience that is both engaging and effective.
  • 20
    Pronounce

    Pronounce

    Pronounce

    Pronounce is an innovative language learning platform focused on enhancing English pronunciation and fluency through AI-driven tools. It offers instant feedback on American or British English accents, making it ideal for anyone looking to improve their spoken English. The platform features AI speech checking, meeting transcription, and AI chats with virtual speaking partners to practice conversational skills. Available with both free and premium plans, Pronounce caters to a broad audience, from language learners to professionals seeking to refine their communication skills in specific environments​.
    Starting Price: Free
  • 21
    Memrise

    Memrise

    Memrise

    Specialising in combining cognitive science, powerful tech and entertaining content, Memrise makes language learning genuinely recreational. We offer 200 language combinations across 24 languages on our website, iOS and Android apps. By leveraging lots of brain science and plenty of humour, we’re striving to enrich people’s consciousness and help people achieve confident, real-world language skills in just a few short months. Memrise’s courses have one thing that textbooks don’t: real-life language. Our team of in-house linguists are not only experts but also passionate about teaching you the language they speak themselves in everyday life. To add to the richness, our courses are packed with thousands of video clips of native speakers speaking in their native language, in their hometown. So you can learn to understand authentic voices and accents, as well as taking in the scenery and getting a sense of the culture.
  • 22
    AITalk

    AITalk

    AITalk

    Unlock language mastery with AITalk – your AI-powered companion for fluent conversations anytime, anywhere. Learn to speak naturally by chatting with AI. Pick topics, chat freely, and master any language, one conversation at a time. Boost your IELTS speaking skills and beyond with our all-in-one app: AI-powered conversations, writing assistance, creative naming, and grammar correction at your fingertips. Boost your IELTS Speaking score with our AI app, offering personalized practice and instant feedback for confident communication. Immerse yourself in authentic conversations with lifelike AI partners, each with their own unique voice and personality. This immersive experience enhances your learning and helps you understand different accents and speech patterns more effectively.
  • 23
    Open English

    Open English

    Open English

    Connect to our classes, which start every 30 minutes, from your computer or our app. Learn with an immersion method in English and native pronunciation experts. Enjoy a portal included in your course that prepares you for these exams at no additional cost. Receive a certificate based on the levels of the Common European Framework of Reference (CEFR). You will have at your fingertips an advanced voice recognition technology tool that will show you how to improve your diction. Master English at work thanks to our content focused on specific professional areas. In our program, you have 24-hour access to live classes with expert teachers in online teaching. Be part of our student group, where you can interact with other students and exchange ideas that will help you in your learning process. Various publications recognize the success of the #1 leading online English course, thanks to real results in real students.
    Starting Price: Free
  • 24
    Pimsleur

    Pimsleur

    Simon & Schuster

    Combining the ease & interactivity of language learning apps with the convenience and power of the portable Pimsleur Method™ ... learning languages online has never been easier. Expand your horizons. Reconnect with your Heritage. Travel with confidence. Discover new worlds. Experience life-changing adventures. Create unforgettable memories. Easy listening, rewarding results in just 30 minutes a day! Give us 30 minutes a day and we’ll have you speaking your new language in no time. That’s all it takes for you to confidently inquire about prices, order dinner, ask for (or offer) directions – in your new language – with a near-native accent. Just listen, respond and learn to converse in … French while commuting … German while jogging … Spanish while cooking. It’s really that portable and flexible.
  • 25
    italki

    italki

    italki

    italki is a global language learning community that connects students and teachers for 1-on-1 online language lessons. At italki, we believe that human interaction and cultural sharing are the best way to become fluent in a foreign language. With over 7 million students and 20,000 high-quality teachers teaching more than 130 languages, italki can help everyone with their personal journey to fluency. 1-on-1 lessons in more than 150 languages * Learn from certified teachers with proven experience * Find teachers from all over the world sharing their languages, dialects, and cultures * Study at your own pace without worrying about rigid schedules or fixed fees Practice for free with the italki community * Develop your language skills by building connections with others * Receive feedback from native speakers and professional teachers * Meet and share experiences with millions of language learners from more than 190 countries italki: Become fluent in any language
    Starting Price: $8 / lesson
  • 26
    Fluento

    Fluento

    Fluento

    Speak English fluently with personalized feedback, role-playing, and connection to a global community. Feedback that is better than a real teacher. Get comprehensive feedback and analysis with Fluento that focuses on achieving your goals, measuring progress, and boosting confidence. AI guidance boosts your confidence by providing tips and suggestions during conversations. Plan your practice ahead according to your schedule or connect now. Master language skills with super personalized realistic role-playing scenarios like job interviews or salary negotiation. Algorithms that make sure you connect with motivated, like-minded learners, like you. An algorithm that matches you with similar people who speak a different native language. Join a global community of 590 motivated learners and continue the conversation between practice sessions. Fluento is super customized, we can build tasks for every scenario.
  • 27
    Babbel

    Babbel

    Lesson Nine

    Welcome to Babbel for Business. Prepare your company for the future with our cost-efficient and flexible language learning solution. For more than 10 years, Babbel has been breaking down language barriers and helping people to understand each other better. The new online group classes with Babbel Live enable language learning in small groups with certified teachers. Whether your team is working remotely or from the office, connect your employees through a motivating language learning experience! German, English, Spanish, French, Polish, Dutch, Italian, Portuguese, Danish, Swedish, Norwegian, Turkish, Indonesian, Russian. Babbel courses are suitable for all abilities — from complete beginners to learners who are looking to refresh their existing knowledge. Babbel’s courses have been meticulously crafted by our team of hundreds of language experts, with each lesson tailored specifically to your learners’ native language.
  • 28
    Fluenz

    Fluenz

    Fluenz

    Programs for serious language learners. On your phone, laptop, or tablet, we'll take you to fluency with hundreds of video-tutorials and tens of thousands of workouts. Learning a language is hard. Games, fads, or free Apps can't get you there. Fluenz is the real alternative. We're the only digital company that engages its learners face to face at our Immersions all over the world. Every moment of learning has been carefully calibrated to help you master the language. We've invested millions of dollars to find just the right sequence, timing, and explanation you will need. We can't hide behind the screen, and instead deliver results for our participants every single day. From our multi-device App to the global Immersions, we have spent 15 years creating the most comprehensive language journey in the world.
  • 29
    Infosys Nia
    Infosys Nia™ is an enterprise grade AI platform which simplifies the AI adoption journey for Business & IT. Infosys Nia supports end-to-end enterprise AI journey from data management, digitization of document and images, model development to operationalizing models. Nia’s advanced, modular and scalable capabilities address business needs across Enterprises. Nia Data provides highly effective tools and frameworks for complex data workflows to power further ML experimentation on the Nia AML workbench. The Nia DocAI platform automates the end-to-end document processing lifecycle from ingestion to consumption, using AI capabilities such as InfoExtractor, computer vision, NLP and cognitive search.
  • 30
    NaturalText

    NaturalText

    NaturalText

    NaturalText A.I. helps you get more out of your data. Discover relationships, create collections, and unveil hidden insights in documents and other text-based data. NaturalText A.I. uses novel artificial intelligence technology to uncover hidden relationships in data. The software uses various state-of-the-art methods to understand context, analyze patterns, and reveal insights—all in a human-readable way. Reveal insights hidden in your data. Finding everything hidden in your text data is a difficult, if not impossible, task. With traditional search, you can only locate information related to a document. NaturalText A.I., on the other hand, uncovers new information within millions of documents, including scientific papers and patents. Use NaturalText A.I. to reveal insights in the data you are currently missing.
    Starting Price: $5000.00
  • 31
    RapidMiner
    RapidMiner is reinventing enterprise AI so that anyone has the power to positively shape the future. We’re doing this by enabling ‘data loving’ people of all skill levels, across the enterprise, to rapidly create and operate AI solutions to drive immediate business impact. We offer an end-to-end platform that unifies data prep, machine learning, and model operations with a user experience that provides depth for data scientists and simplifies complex tasks for everyone else. Our Center of Excellence methodology and the RapidMiner Academy ensures customers are successful, no matter their experience or resource levels. Simplify operations, no matter how complex models are, or how they were created. Deploy, evaluate, compare, monitor, manage and swap any model. Solve your business issues faster with sharper insights and predictive models, no one understands the business problem like you do.
    Starting Price: Free
  • 32
    Sia

    Sia

    OneOrigin

    Sia™ revolutionizes higher education by streamlining student lifecycle management from enrollment to retention. This AI-driven tool quickly processes transcripts, aiding in credit transfers and boosting student retention. By analyzing academic histories and interests, Sia™ offers personalized course and career recommendations, enhancing student engagement and academic planning. Its role as a virtual assistant on university websites simplifies information access, reducing staff workload and improving student experience. Sia™'s innovative approach transforms administrative processes, ensuring efficient, personalized support for student success.
  • 33
    Analance
    Combining Data Science, Business Intelligence, and Data Management Capabilities in One Integrated, Self-Serve Platform. Analance is a robust, salable end-to-end platform that combines Data Science, Advanced Analytics, Business Intelligence, and Data Management into one integrated self-serve platform. It is built to deliver core analytical processing power to ensure data insights are accessible to everyone, performance remains consistent as the system grows, and business objectives are continuously met within a single platform. Analance is focused on turning quality data into accurate predictions allowing both data scientists and citizen data scientists with point and click pre-built algorithms and an environment for custom coding. Company – Overview Ducen IT helps Business and IT users of Fortune 1000 companies with advanced analytics, business intelligence and data management through its unique end-to-end data science platform called Analance.
  • 34
    Abacus.AI

    Abacus.AI

    Abacus.AI

    Abacus.AI is the world's first end-to-end autonomous AI platform that enables real-time deep learning at scale for common enterprise use-cases. Apply our innovative neural architecture search techniques to train custom deep learning models and deploy them on our end to end DLOps platform. Our AI engine will increase your user engagement by at least 30% with personalized recommendations. We generate recommendations that are truly personalized to individual preferences which means more user interaction and conversion. Don't waste time in dealing with data hassles. We will automatically create your data pipelines and retrain your models. We use generative modeling to produce recommendations that means even with very little data about a particular user/item you won't have a cold start.
  • 35
    FirstLanguage

    FirstLanguage

    FirstLanguage

    Our Natural Language Processing(NLP) APIs provide best-in-class accuracy at an affordable rate and cover all aspects of NLP under a single roof. Save weeks of time training and creating language models. Take advantage of our best-in-class APIs to kickstart your app development. We provide the building blocks to create your own apps effectively like chatbots, sentiment analysis, etc. Text classification on multiple domains and 100+ languages. Perform effective sentiment analysis. We grow when your business does. So we have put together simple pricing that allows you to easily scale your business when it needs to evolve. Perfect for individual developers who are creating apps or building proof of concepts. Head to the Dashboard and get your API Key. Place this in the header of all your API calls. Use our SDK in your preferred language to start coding. Or you can refer to the auto-generated code blocks provided in 18 programming languages.
    Starting Price: $150 per month
  • 36
    DeepScribe

    DeepScribe

    DeepScribe

    DeepScribe’s AI-powered scribe captures the natural conversation between a clinician and patient and automatically writes medical documentation, allowing clinicians to focus on patient care instead of note-taking. Through an easy-to-use mobile app, DeepScribe records the natural clinical encounter and transcribes it in real time. Our proprietary AI then extracts the medical information from the transcript, classifies it into a standard note, and then integrates that note directly into a clinician’s electronic health record system. Unlike traditional scribes, dictation tools, or other solutions, the ambient nature of DeepScribe means it doesn’t intrude on the patient visit or disrupt the clinical workflow. Providers can simply talk to their patient like normal, then review their notes after the visit and sign-off in their EHR. DeepScribe handles documentation, charting, and even populates suggested diagnostic coding based on the information extracted from the visit.
  • 37
    IDVoice

    IDVoice

    ID R&D

    Voice biometrics is the science of using a person’s voice as a uniquely identifying characteristic for the purpose of authentication and/or personalizing the user experience. The technology is referred to in a variety of ways including voice verification, speaker verification, speaker identification and speaker recognition. There are two ways we put voice biometrics into practice. The first is Text Independent Voice Verification. This approach does not depend on the person speaking a particular passphrase. The other is Text Dependent Voice Verification. in which the user enrolls using a specific phrase but unlike a password, this phrase is not secret. IDVoice enables both options depending on your use case and in some scenarios they may be used together.
  • 38
    Dragon Law Enforcement

    Dragon Law Enforcement

    Nuance Communications

    Eliminate the need to decipher handwritten notes or try to recall details from hours before. Officers simply speak to create detailed and accurate incident reports, 3 times faster than typing and with up to 99% recognition accuracy—Zall by voice. With a next-generation speech engine powered by Nuance Deep Learning technology, Dragon achieves high recognition accuracy while dictating, even for users with accents or those working in open office or mobile environments; making it ideal for diverse work groups and settings. Use fast and accurate dictation to enter data into RMS and CAD systems or other applications. Officers or support staff simply dictate anywhere they would normally type, and fill and navigate within form fields by voice.
  • 39
    Dragon Professional Group

    Dragon Professional Group

    Nuance Communications

    Empower employees to dictate documents 3 times faster than typing with up to 99% recognition accuracy, right from the first use. Since documents are created in a fraction of the time it would typically take typing by hand, they spend less time on paperwork, and more time on more profitable tasks. With a next‑generation speech engine powered by Nuance Deep Learning technology, Dragon achieves high recognition accuracy while dictating, even for users with accents or those working in open office or mobile environments; making it ideal for diverse workgroups and settings. Dragon makes it easy to automate tasks or short‑cut repetitive steps. Create custom voice commands to insert standard boilerplate text or signatures into documents. Or create time‑saving macros to automate multi‑step workflows by voice. Once created, share these customizations across the Dragon user community for efficiency gains.
  • 40
    tazti

    tazti

    Voice Tech Group

    Welcome to the tazti website! tazti is state of the art Speech Recognition & Voice Recognition software. You can easily mash up tazti to files, folders, programs, videos and songs on your PC, to open them by voice control. Play PC Games, control applications, programs, and robots by voice command! Over 300,000 people have now tried tazti and it's many features. tazti is super fun, especially if you are tired of pounding your keyboard or want an easy to use assistive technology. Great as well for people with Arthritis, Carpal Tunnel, Tendonitis, Fibromyalgia or other hand, finger or wrist pain.
    Starting Price: $39.99
  • 41
    Whisper

    Whisper

    OpenAI

    We’ve trained and are open-sourcing a neural net called Whisper that approaches human-level robustness and accuracy in English speech recognition. Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise, and technical language. Moreover, it enables transcription in multiple languages, as well as translation from those languages into English. We are open-sourcing models and inference code to serve as a foundation for building useful applications and for further research on robust speech processing. The Whisper architecture is a simple end-to-end approach, implemented as an encoder-decoder Transformer. Input audio is split into 30-second chunks, converted into a log-Mel spectrogram, and then passed into an encoder.
  • 42
    LearningBranch

    LearningBranch

    LearningBranch

    LearningBranch is an automated conversational-based AI assessment and training platform that helps hire and train employees who will perform. Our communication and language assessments focus on real-world people skills. Standardize assessments and rank candidates based on performance free of biases. Our conversational-based assessments are bias-free and data driven. Monitor and up-skill your existing hires and create benchmarks on the skills that drive performance. The platform provides assessment and training tools focusing on the live spoken and written soft skills for customer service, sales and teamwork. LearningBranch enables you to replace interviews, increase speed to hire, and increase speed to proficiency.
  • 43
    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile's modern, robust speech recognition is optimized for mobile devices in over 40 languages. Designed for industry workflows, cutting edge noise abatement technology delivers outstanding recognition in noisy environments. A speaker-independent voice engine works for all users out-of-the-box, without the need to voice train or maintain voice files for each user. AccuSpeechMobile is a 100% device-based solution. No voice server or middleware is required and no changes are needed to the backend system (WMS, ERP, EAM, CMMS). Cloud or network connection is not required to use the full functionality of device-based data collection. AccuSpeechMobile fully supports multi-modal capabilities so that users can hear spoken information and speak commands in tandem with the use of intelligent scanners. The ability to reference additional information on the device screen is also always available in conjunction with speech-to-text and text-to-speech commands.
  • 44
    Rubidium

    Rubidium

    Rubidium

    Rubidium enables leading companies to embed voice commands and text to speech in their products. Voice Trigger is an “always on” engine that continuously listens and wakes up when you say the proper “magic word”. Voice Trigger identification uses a sophisticated miniature footprint Automatic Speech Recognition (ASR) engine to run in the background and distinguish between the trigger phrase and the rest of the speech, sounds and noise. Automated Speech Recognition (ASR) easily and safely controls any set of functions through voice commands. For example: call acceptance and rejection, device setup and installation procedure (pairing, calibration, interconnection, etc.), voice dialing, music streaming control and music selection. Rubidium technology is now embedded in over 50 million consumer products with customers and partners including leading global brands such as RIM (Blackberry), GN Netcom (Jabra), Panasonic, Uniden, CSR, Mattel, General Motors, Electrolux and many others.
  • 45
    Verbio

    Verbio

    Verbio

    Increase security and user experience in daily interactions with the unique potential of voice. An innovative language agnostic, cost-effective and reliable alternative to seamlessly verify and identify users in real-time. Voice biometrics allows to automatically recognize any person through the characteristics of their voice and it can smartly substitute traditional authentication methods (cards, passwords, signature, fingerprint, etc) in security access control, user verification for digital transactions or for fraud prevention and detection. With an easy and cost-effective solution, authentication through voice biometrics brings an innovative and safe experience to users, with a risk-free and remote access. Biometric Authentication and Identification through voice has never been so secure and fast with different operational uttering models for each type of client and advanced anti-spoofing methodologies.
  • 46
    AppTek

    AppTek

    AppTek

    AppTek is a global leader in artificial intelligence (AI) and machine learning (ML) technologies for automatic speech recognition (ASR), neural machine translation (NMT), and natural language understanding (NLU). The AppTek platform delivers industry-leading, real-time streaming and batch technology solutions in the cloud or on-premise for organizations across a breadth of worldwide markets such as media and entertainment, call centers, government, enterprise business, and more. Built by scientists and research engineers who are recognized among the best in the world, AppTek’s solutions cover a wide array of languages, dialects, and channels. AppTek utilizes deep neural networks to transcribe and understand speech and text data, delivering more accurate and efficient tools.
  • 47
    Autogon

    Autogon

    Autogon

    Autogon is a leading AI and machine learning company, that simplifies complex technology to empower businesses with accessible, cutting-edge solutions for data-driven decisions and global competitiveness. Discover the empowering potential of Autogon models as they enable industries to leverage the power of AI, fostering innovation and fueling growth across diverse sectors. Experience the future of AI with Autogon Qore, your all-in-one solution for image classification, text generation, visual Q&A, sentiment analysis, voice cloning, and more. Empower your business with cutting-edge AI capabilities and innovation. Make informed decisions, streamline operations, and drive growth without the need for extensive technical expertise. Empower engineers, analysts, and scientists to harness the full potential of artificial intelligence and machine learning for their projects and research. Create custom software using clear APIs and integration SDKs.
  • 48
    Work by Speech

    Work by Speech

    Mikołaj Magowski

    Work by Speech is the first program in the world that allows efficient work on a computer by speech without needing a keyboard and mouse. Work by Speech Features: - Efficient work on a computer by speech alone - Quiet speaking support - Application switching and opening by speech - Built-in voice commands for the most common actions - Custom voice commands management - Macro recording and editing - Separate dictation mode - Fast and repeatable mouse control by speech with support for all mouse actions - Customizable mousegrid that can be moved by speech - Automatic mousegrid optimization for every used application - Very low processor and memory usage - Works with any microphone under Windows 10 and 11 - Available for the English language only - Free updates
    Starting Price: Free
  • 49
    TrulyNatural
    Sensory is a pioneer in the use of embedded neural network-based speech recognition and has become the industry leader in optimizing and engineering speech recognition software with small footprints and minimal MIPS. This extensive experience and continuous innovation have led to the first embedded large vocabulary continuous-speech recognizer (LVCSR) with state of-the-art cloud performance. Unlike voice recognition software often used with smartphones and mobile devices, such as with a voice assistant mobile app, as well as with IoT (internet of things) enabled technologies (Alexa, Google Assistant, Siri, Cortana), Sensory’s solution is embedded and doesn’t require a wifi connection. Many applications don’t need or want to rely on cloud-based connection to do high-performance speech recognition. Others seek a client/cloud distributed system with optimal performance. The market concerns regarding privacy, performance and bandwidth are driving more processing to the edge.
  • 50
    VoiceMe

    VoiceMe

    VoiceMe

    In an always more contactless world, arises the necessity of a new model of digital trust. VoiceMe enables people, companies, and objects to interact with each other through a simple interface and in an ultra-secured way opening the door to a new generation of services. Access restricted physical areas guaranteeing users' identity. Sign with legal validation documents and contracts. Our algorithms pre-identify the user based on behaviors, using also biometric parameters obtained from the upper face and voice. All customer-related data remains exclusively at the user's disposal, offering maximum privacy and respect for GDPR regulation. Each data set is encrypted, divided in pieces, and spread on a network of nodes, making it impossible for an external unauthorized source to extract. At each authorized data usage the inverse process is done to recompose the data set. API or SDK for third-party allows easy integration in already existing systems.