Best Artificial Intelligence Software for Windows - Page 51

Compare the Top Artificial Intelligence Software for Windows as of May 2026 - Page 51

  • 1
    Soroco Scout
    Soroco Scout is a powerful process discovery and task mining tool developed by Soroco. It helps organizations understand how their teams work by analyzing digital interactions across multiple platforms. Scout gathers detailed insights into the workflows and patterns of daily tasks, revealing inefficiencies and bottlenecks in business processes. Using AI and machine learning, it provides data-driven recommendations to optimize operations, streamline workflows, and improve productivity. Soroco Scout empowers companies to make more informed decisions by mapping out how work is truly done, offering a comprehensive view of employee activity and areas for potential automation. This leads to more efficient processes and cost savings across the organization.
  • 2
    Converse Smartly
    Converse Smartly® is a powerful speech to text software which converts audio to text. It enables organizations and individuals to work smarter, faster and with greater accuracy. The application can be used to analyze dialogue or speech from team meetings, interviews, conferences and seminars. We strive to provide the preeminent online speech recognition tool by engaging cutting-edge speech-recognition technology for the most accurate results technology can achieve today, together with incorporating built-in tools to increase users' efficiency, productivity and comfort. Render the most advanced deep-learning neural network algorithms to the audio subject for speech recognition with unparalleled accuracy. Converse Smartly(s) Speech-to-Text accuracy improves over time as the continuous machine learning powered by enhanced algorithms improves the internal speech recognition technology used by multiple products.
  • 3
    Vocola 3

    Vocola 3

    Vocola 3

    Dictation with Windows Speech Recognition (WSR) works well for "WSR-friendly" applications like MS Word, Outlook, and PowerPoint. Dictated text is inserted directly into document text, and commands like "Delete hedgehog" can refer to specific document text. But WSR dictation works less well for "WSR-unfriendly" applications like MS Excel, Gmail, and most programming environments. Dictation is not inserted directly into document text, and commands cannot refer to document text. Vocola improves this situation by supporting direct dictation for WSR-unfriendly applications, and by allowing correction and modification of the just-dictated phrase. Vocola and WSR use the same underlying speech profile, so any improvements you make via training, correction, or the speech dictionary benefit WSR dictation and Vocola dictation equally. Dictation to WSR-unfriendly applications is essentially unusable in Vista, as every utterance raises the correction panel.
  • 4
    Dictation.io

    Dictation.io

    Dictation.io

    Use the magic of speech recognition to write emails and documents in Google Chrome. Dictation accurately transcribes your speech to text in real time. You can add paragraphs, punctuation marks, and even smileys using voice commands. Dictation can recognize and transcribe popular languages including English, Español, Français, Italiano, Português, and many more. You can add new paragraphs, punctuation marks, smileys and other special characters using simple voice commands. For instance, say "New line" to move the cursor to the next list or say "Smiling Face" to insert :-) smiley. Dictation uses Google Speech Recognition to transcribe your spoken words into text. It stores the converted text in your browser locally and no data is uploaded anywhere. Learn more. Dictation lets you write text in any language by voice alone, without needing a keyboard or mouse.
  • 5
    Dragon Professional Anywhere

    Dragon Professional Anywhere

    Nuance Communications

    Nuance Dragon Professional Anywhere empowers busy professionals, including remote workers, to use their voice naturally to create more detailed and accurate documentation quickly and easily. Mission critical documentation should be dictated by knowledge workers and field professionals, not technology limitations. Conversational AI empowers private and public sector professionals to document more naturally. Enables professionals to quickly and easily document the details of client meetings using speech recognition that is 3x faster than typing and up to 99% accurate. Most people speak at over 120 wpm but type at less than 40 wpm. Speak freely and as much as you like with no per-user limits. Business professionals can stay productive anywhere and focus on their clients and business rather than the technology.
  • 6
    Dragon Legal Anywhere

    Dragon Legal Anywhere

    Nuance Communications

    Nuance’s Dragon Legal Anywhere helps attorneys, judges, clerks, paralegals, and other legal professionals create high-quality documentation, in less time, by using the power of their voice. Legal documentation should be dictated by legal practitioners, not technology limitations. Conversational AI empowers legal teams to document more naturally. Dragon Legal Anywhere’s specialized vocabulary means professionals can dictate contracts, briefs, or format legal citations and other legal documentation, 3X faster than typing, with up to 99% accuracy right from the first use. Speak freely and as much as you like with no per-user limits—legal professionals can stay productive anywhere and focus on their clients and business rather than the technology. Create custom voice commands to insert standard clauses into documents. Or create step‑by‑step commands to automate multi‑part workflows by voice.
  • 7
    Dragon Law Enforcement

    Dragon Law Enforcement

    Nuance Communications

    Eliminate the need to decipher handwritten notes or try to recall details from hours before. Officers simply speak to create detailed and accurate incident reports, 3 times faster than typing and with up to 99% recognition accuracy—Zall by voice. With a next-generation speech engine powered by Nuance Deep Learning technology, Dragon achieves high recognition accuracy while dictating, even for users with accents or those working in open office or mobile environments; making it ideal for diverse work groups and settings. Use fast and accurate dictation to enter data into RMS and CAD systems or other applications. Officers or support staff simply dictate anywhere they would normally type, and fill and navigate within form fields by voice.
  • 8
    Read Aloud

    Read Aloud

    Read Aloud

    With the Read Aloud browser extensions you can read aloud the content of any web page with one click. This widget will work for all users, regardless of their operating system (desktop or mobile), regardless of the browser they're using, or whether they have the Read Aloud extension installed. See the widget live on our customers' websites. Convert text to speech and create voice narrations. Natural flowing voice and very helpful for multitasking, simple, easy, customizable. It works on a variety of websites, including news sites, blogs, fan fiction, publications, textbooks, school and class websites, online universities and course materials. Read Aloud is aimed at users who prefer to listen to content instead of reading, people with dyslexia or other learning disabilities, children learning to read, or simply to provide users with alternative way to consume web content.
  • 9
    Acapela TTS

    Acapela TTS

    Acapela Group

    Acapela TTS for Mac OS X has been designed to speech enable any Mac OS X based application with Acapela’s wide portfolio of languages and voices. Several APIs and programming languages are available to simplify the integration process, one common API with Acapela TTS for Windows allowing dual platform development. For accessibility applications, reading tools, K-12, language learning, language translation, Universal Design Literacy tools (UDL), learning and physical disabilities, professional video or audio generation, and much more. Easy integration into your installation and redistribution package, Mac App Store friendly. More than 120 voices in 30 languages and accents. Two voice qualities available in each language, to meet all your needs and constraints. Breathe life into your interface and content, improve accessibility of your product to people with difficulties reading or seeing text, give your users an eye-free experience.
  • 10
    @Voice Aloud Reader
    @Voice Aloud Reader reads aloud the text displayed in an Android app, e.g. web pages, news articles, long emails, sms, PDF files and more. Save articles opened in @Voice to files for later listening. Construct listening lists of many articles for uninterrupted listening one after the other. Order the list as needed, e.g. more important articles first. Pause/resume speech as needed with wired or Bluetooth headset buttons, plus click next/previous buttons to jump by sentence, long-click to switch to the next/previous article on a list. Options for additional pause between paragraph, start talking as soon as a new article is loaded or wait for a button press, start/stop talking when wired headset plug is inserted/removed.
  • 11
    FaceMe

    FaceMe

    CyberLink

    Designed for smart surveillance, FaceMe® Security is a value-added solution that runs on PC, workstations, servers, and integrates into VMS (video management systems). It detects individuals within a crowd through their face and can identify them by matching the live capture with profiles kept in a database, even for people wearing a mask. It also displays their body temperature, detects anyone not wearing a mask properly over their nose and mouth and highlights block-listed people. It can send real-time alerts to security personnel or other people in the organization. FaceMe® Security Central compares the extracted facial templates with the database in order to confirm identity. It also provides a web-based console to manage the Microsoft SQL based face database, configure IP cameras, and inform relevant personnel about registered visitors, VIPs, block-listed people, or employees entering each monitored area.
  • 12
    Acapela Cloud

    Acapela Cloud

    Acapela Group

    Acapela Cloud online service allows to easily build speech enabled applications. It features an easy to integrate API, a web interface with advanced UX, new layouts as well as prompt editing capabilities. Cost effective and very easy to use, it gives all content a natural (digital) voice. It provides an immediate solution to answer all needs for voice interface or audio interactivity, in a wide range of languages and voices. With only a few lines of code, connect to the Acapela Cloud server, send the text to be spoken and let the service do its job! Acapela Cloud will instantly generate the voice file that will be played on your applications or devices. Over 30 languages and 100 standard voices are available, 24/7. Check out the list on the Acapela Cloud website. Easily integrate speech synthesis capability into your application and control every aspect of the voice generation process using various features, parameters, settings and effects.
  • 13
    Mobius Conveyor
    With Mobius Conveyor on your iPhone or iPad, you have the world's most flexible dictation system at your disposal. Instantly dictate onto any computer and into any EMR with our month-to-month subscriptions. Dictate to your heart's content. Unlimited usage is always included as a feature. Mobius is compatible with all of the software you use at work, including your EMRs. From clinic to clinic, hospital to hospital, whether in the car or at home, Mobius travels with you. No matter which computer you find yourself in front of, now you can dictate with your personal vocabulary, custom macros, and AI-trained voice recognition. With live dictation mode, your spoken words appear wherever your cursor is placed. Dictate documentation, messages to patients, Word documents, or even e-mails. Anywhere you would usually type, now you can dictate.
  • 14
    alwaysAI

    alwaysAI

    alwaysAI

    alwaysAI provides developers with a simple and flexible way to build, train, and deploy computer vision applications to a wide variety of IoT devices. Select from a catalog of deep learning models or upload your own. Use our flexible and customizable APIs to quickly enable core computer vision services. Quickly prototype, test and iterate with a variety of camera-enabled ARM-32, ARM-64 and x86 devices. Identify objects in an image by name or classification. Identify and count objects appearing in a real-time video feed. Follow the same object across a series of frames. Find faces or full bodies in a scene to count or track. Locate and define borders around separate objects. Separate key objects in an image from background visuals. Determine human body poses, fall detection, emotions. Use our model training toolkit to train an object detection model to identify virtually any object. Create a model tailored to your specific use-case.
  • 15
    Numa

    Numa

    Numa

    Numa is the AI Customer Operations System built for dealerships that are tired of losing revenue and customers to broken processes. Numa fixes customer experience & operations at the infrastructure level that can follow-up with your customers automatically and give visibility into customer satisfaction to your advisors, reps, and managers. Operator answers and routes every inbound call so nothing goes dark. Status Updates proactively reaches out to customers so advisors stop drowning in callbacks. Voice AI books appointments automatically so customers never wait. LiveCSI flags heat cases in real time so managers can intervene before a CSI score takes the hit. Opportunities can proactively reach out on declined services, open recalls, and equity moments. And it all runs through one unified system: one inbox, one shared context. The result: recovered revenue, freed-up advisors, and a customer experience that increases CSI and builds loyalty.
  • 16
    PRSONAS-Greeter

    PRSONAS-Greeter

    PRSONAS by nuMedia Innovations

    PRSONAS-Greeter™ welcomes customers to your office, live event, retail, or any other physical location. The Greeter initiates the greeting in a safe, touchless way using motion activation. It will engage with guests in a real conversation with advanced speech recognition providing them with all the desired information they need about your business or facility. Your guest receive immediate assistance without interrupting your staff and gathers key insights allowing you to know what your guests are asking for, how to serve them better at no additional cost to your company for staffing.
    Starting Price: $199. /month
  • 17
    SHARK

    SHARK

    SHARK

    SHARK is a fast, modular, feature-rich open-source C++ machine learning library. It provides methods for linear and nonlinear optimization, kernel-based learning algorithms, neural networks, and various other machine learning techniques. It serves as a powerful toolbox for real-world applications as well as research. Shark depends on Boost and CMake. It is compatible with Windows, Solaris, MacOS X, and Linux. Shark is licensed under the permissive GNU Lesser General Public License. Shark provides an excellent trade-off between flexibility and ease-of-use on the one hand, and computational efficiency on the other. Shark offers numerous algorithms from various machine learning and computational intelligence domains in a way that they can be easily combined and extended. Shark comes with a lot of powerful algorithms that are to our best knowledge not implemented in any other library.
  • 18
    PyTorch

    PyTorch

    PyTorch

    Transition seamlessly between eager and graph modes with TorchScript, and accelerate the path to production with TorchServe. Scalable distributed training and performance optimization in research and production is enabled by the torch-distributed backend. A rich ecosystem of tools and libraries extends PyTorch and supports development in computer vision, NLP and more. PyTorch is well supported on major cloud platforms, providing frictionless development and easy scaling. Select your preferences and run the install command. Stable represents the most currently tested and supported version of PyTorch. This should be suitable for many users. Preview is available if you want the latest, not fully tested and supported, 1.10 builds that are generated nightly. Please ensure that you have met the prerequisites (e.g., numpy), depending on your package manager. Anaconda is our recommended package manager since it installs all dependencies.
  • 19
    MXNet

    MXNet

    The Apache Software Foundation

    A hybrid front-end seamlessly transitions between Gluon eager imperative mode and symbolic mode to provide both flexibility and speed. Scalable distributed training and performance optimization in research and production is enabled by the dual parameter server and Horovod support. Deep integration into Python and support for Scala, Julia, Clojure, Java, C++, R and Perl. A thriving ecosystem of tools and libraries extends MXNet and enables use-cases in computer vision, NLP, time series and more. Apache MXNet is an effort undergoing incubation at The Apache Software Foundation (ASF), sponsored by the Apache Incubator. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision-making process have stabilized in a manner consistent with other successful ASF projects. Join the MXNet scientific community to contribute, learn, and get answers to your questions.
  • 20
    BriefCam

    BriefCam

    BriefCam

    The BriefCam® complete video content analytics platform drives exponential value from surveillance system investments by making video searchable, actionable and quantifiable. The unique fusion of VIDEO SYNOPSIS® and Deep Learning solutions enable rapid video review and search, face recognition, real-time alerting and quantitative video insights. Improves post-event investigation productivity by pinpointing people and objects of interest with speed and precision. Real-time alerting capabilities enable organizations to proactively respond to situational changes in their environment. Extract and aggregate video metadata such as men, women, children, vehicles, size, color, speed, path, and more, enabling users to quantitatively analyze their video. BriefCam’s comprehensive and extensive video content analytics platform is deployed by law enforcement and public safety organizations, government and transportation agencies, major enterprises, healthcare and educational institutions.
  • 21
    Qognify

    Qognify

    Qognify

    Qognify helps organizations minimize the impact of incidents with its innovative portfolio of video management software and enterprise incident management solutions. With thousands of deployments in banks, utility companies, airports, seaports, city centers, and transportation agencies, Qognify helps agencies all over the world keep people and assets safe. Qognify places a premium on operational and physical security strategies because safety is priceless. Qognify solutions help organizations capture, analyze, and leverage big data to anticipate, manage, and mitigate security and safety risks, maintain business continuity, and streamline operations. The Qognify offerings provide valuable insights that enable enterprises and security-conscious organizations to take the best action at the right time by correlating structured and unstructured data from multiple sensors and channels, detecting irregular patterns, and recognizing trends.
  • 22
    IBM Intelligent Video Analytics
    IBM Intelligent Video Analytics has been helping agencies and organizations worldwide analyze video captured by fixed cameras, such as those used for physical security, closed-circuit television (CCTV), and monitoring traffic, to extract key information from streaming video to uncover insights and patterns within untold hours of camera footage. Real-time alerts to call attention to events. Rich content-based indexing to find critical images and patterns. Standards-based open and extensible architecture. Ingestion of pre-recorded videos from both fixed cameras and cameras in motion. With ingested video files, analysts can extract critical information and find relevant images faster, which may help accelerate investigations. Advanced facial recognition, which may improve lead generation and risk assessment. Matching faces on the video to an agency's or organization's watch list may help them identify persons of interest and speed investigation.
  • 23
    AXIS Camera Station

    AXIS Camera Station

    Axis Communications

    AXIS Camera Station is a video and access management software for surveillance suitable for a wide range of businesses. Retail stores, hotels, schools and manufacturing industries are just some of the companies that enjoy full control and protection of their premises and can quickly take care of incidents. All to make businesses run more smoothly. AXIS Camera Station matches our other network video products and features to offer you a complete, flexible, safe and reliable system. AXIS Camera Station is powerful and easy to use with an intuitive interface so anyone can manage the system, handle incidents and quickly export high-definition evidence. Axis Camera Station is upgraded with new possibilities all the time to better protect your premises and make your life easier. Get to know the newest features in AXIS Camera Station video management software. Find hardware guidelines, information about supported products and how to design and maintain your system.
  • 24
    PyGaze

    PyGaze

    PyGaze

    PyGaze offers a platform that is more user-friendly than the currently existing alternatives, when it comes to creating complicated experiments (or other software). From stimulus presentation to eye-tracker communication: everything can be handled by via PyGaze scripting. Of course, we do rely heavily on all the brilliant dependencies (see the links in the first paragraph) and you could do pretty much the same if you were to use each of these independently, but this would require more programming skills and would cost more effort and time than when you would use PyGaze. Apart from this, PyGaze does come with some added functionality. For example, we provide an implementation of a saccade detection algorithm that was specifically designed for online detection (for details, please refer to our paper).
  • 25
    Cheat Layer

    Cheat Layer

    Cheat Layer

    CheatLayer exposes a powerful GPT-4 powered scripting layer on all websites to automate business tasks and save hundreds of hours per month. Use machine learning (GPT-4) to automate any website. Use natural language to request automation tasks like gathering leads, scraping data, pushing buttons, and sending data to Google Sheets. Schedule hourly, daily, weekly, or monthly tasks. Cheatlayer will open the browser tab, perform your work, then close itself for you on schedule. Turn any website into an API and save hundreds of hours per month. To generate code using machine learning, click "Generate GPT" and write using natural words what you want the script to do. Hover over the "Run" button on any script and click "Edit" to edit any script. You can also click the handwriting button next to CheatLayer to open the editor for a new script. If you want to schedule scripts to run hours/daily/weekly/monthly, hover over the "Run" button next to any script, then select the schedule option.
    Starting Price: $49 per month
  • 26
    Hivelocity

    Hivelocity

    Hivelocity

    Offering 24x7x365 phone support. Hivelocity offers predictable costs and superior full hardware performance with no noisy neighbors. API automation enables code controlled infrastructure scaling. Custom built servers, GPU servers and colocation also available. Dedicated servers are inherently more secure than a multi-tenant cloud or virtual environment. HIPAA and PCI compliance are easy to achieve on dedicated servers. Manage expansive infrastructure with ease using robust tooling such as managed services, instant deployment across the globe, DNS management, instant reloads, bandwidth monitoring, and more all from a lightning fast, mobile friendly control panel. Over come challenges faster with our tailored technical support experience. Unlike the big clouds and public hosting providers, you have direct access to our team of highly talented techs, network engineers, developers, and executives ready to help overcome any challenges standing in the way of your strategic objective
  • 27
    Infomaniak

    Infomaniak

    Infomaniak Network

    Infomaniak is a major cloud player in Europe and the leading developer of web technologies in Switzerland. From the design of data centers and products to the orchestration of cloud infrastructures, Infomaniak is a Swiss cloud player that controls its value chain from end to end and is exclusively owned by its employees. This independence enables it to guarantee the security, confidentiality and sovereignty of the data of more than one million users in more than 208 countries. At the heart of Europe in Geneva and Winterthur, Infomaniak develops all the solutions that companies need to ensure their online visibility and sustainable development.
  • 28
    EnConnect
    AI-powered digital communication product featuring an AI assistant, secure live chat and video calls to streamline interactions, with the added benefit of no human intervention needed. EnConnect enables safe and real-time communication between organizations and clients with digital assistant and secure messaging services. EnConnect’s live chat and high-definition video call capabilities take customer contacts to a new level of security and efficiency.
  • 29
    VidScribe AI

    VidScribe AI

    Teknikforce

    VidScribe AI is a powerful AI-based software that can translate, transcribe, redub, and add subtitles to your videos in 100s of languages. This software can bring free traffic for you from the places you have never tapped before. VidScribe can translate your videos into any language you want, not only the text but also the audio. It is easier to rank on local language SERPs with subtitled & redubbed videos. Features of VidScribe AI: * Automatically uploads your videos directly to other social media platforms. * 100% editable. Modify anytime you want. * Get natural sounding speech in multiple languages. * Includes powerful training that shows how to rank on top. * Feed it with any YouTube URL or video and you’ll get your output within minutes. * No need for waiting! Get your videos translated immediately. * Automatically subtitles your videos with high-visibility in multiple colors.
    Starting Price: $37/year
  • 30
    Talkatoo

    Talkatoo

    Talkatoo

    Talkatoo is a voice-enabled AI tool designed to integrate effortlessly with your workflow, transforming speech to text using specialized vocabularies. You focus on patient care; we handle the technology. Built to be affordable and tailored for clinics, Talkatoo helps you reclaim valuable time throughout your day. With processing speeds over 200 words per minute—five times faster than typing—and a built-in medical dictionary. Our key features—Auto-SOAP records, Desktop Dictation, and the AI Assistant empower you to streamline tasks with ease. Record entire appointments to generate formatted SOAP notes instantly, dictate into any application from notes to email, and use the AI Assistant to create discharge instructions, translate documents, and more. Simply download, click, and start speaking, no tech expertise needed.
    Starting Price: $117 per month
MongoDB Logo MongoDB