Alternatives to Detectnix Vision

Compare Detectnix Vision alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Detectnix Vision in 2026. Compare features, ratings, user reviews, pricing, and more from Detectnix Vision competitors and alternatives in order to make an informed decision for your business.

  • 1
    Ango Hub

    Ango Hub

    iMerit

    Ango Hub is a quality-focused, enterprise-ready data annotation platform for AI teams, available on cloud and on-premise. It supports computer vision, medical imaging, NLP, audio, video, and 3D point cloud annotation, powering use cases from autonomous driving and robotics to healthcare AI. Built for AI fine-tuning, RLHF, LLM evaluation, and human-in-the-loop workflows, Ango Hub boosts throughput with automation, model-assisted pre-labeling, and customizable QA while maintaining accuracy. Features include centralized instructions, review pipelines, issue tracking, and consensus across up to 30 annotators. With nearly twenty labeling tools—such as rotated bounding boxes, label relations, nested conditional questions, and table-based labeling—it supports both simple and complex projects. It also enables annotation pipelines for chain-of-thought reasoning and next-gen LLM training and enterprise-grade security with HIPAA compliance, SOC 2 certification, and role-based access controls.
  • 2
    4K Image Compressor
    4K Image Compressor is a free cross-platform app that lets you optimize PNG, JPEG, HEIC, and WEBP images quickly and without quality loss. With 4K Image Compressor, you can: Reduce the size of an image by a desired number of percent or to a particular size in megabytes, kilobytes, or even bytes PNG, JPEG, HEIC, and WEBP are supported for optimization and conversion, and more are coming soon Batch compression is available to save your time You can upload images in lossless and lossy formats.
  • 3
    Azure Computer Vision
    Boost content discoverability, automate text extraction, analyze video in real time, and create products that more people can use by embedding vision capabilities in your apps. Use visual data processing to label content with objects and concepts, extract text, generate image descriptions, moderate content, and understand people’s movement in physical spaces. No machine learning expertise is required.
  • 4
    inferdo

    inferdo

    inferdo

    Easily integrate our Computer Vision API to add some Machine Learning magic to your app. At inferdo, we pride ourselves in our ability to offer state-of-the-art, pre-trained deep learning models, but also our ability to efficiently serve them at scale. That means we can pass the savings on to you! Simply provide an image URL to our API and we'll handle the rest. Use our Content Moderation API to flag possible inappropriate content in your images. This model is trained to detect nudity and NSFW content in images, both real and drawn. Check out our API cost comparisons here vs our competitors. Use our Image Labeling API to add semantic labels to your images. This model is trained to classify thousands of unique labels across a wide verity of categories. Use our Face Detection API to detect the location of human faces in your images. Need more information? Then use our Face Details API to detect faces, gender, age, and other facial features.
    Starting Price: $0.0005 per month
  • 5
    Yandex Vision
    Yandex Vision OCR recognizes text in an image and outputs it along with automatic punctuation. The service supports and automatically identifies more than 50 languages. Extract standard fields and recognize text in templates and documents, e.g., passports, driver’s licenses, vehicle registration certificates, and license plates. With support for Russian and English, as well as combinations of handwritten and printed texts. The service scans the table structure and outputs text in row and column coordinates. Optical character recognition (OCR), document recognition, and license plate number recognition. Yandex Vision OCR allows you to work with JPEG, PNG, and PDF formats. File sizes should be no larger than 20 MB with no more than 300 pages per file. The service can scan images and find passports from 20 countries, driver’s licenses, vehicle registration documents, and license plates.
  • 6
    AnyWebP

    AnyWebP

    AnyWebP

    Although WebP has so many advantages, it is still hindered by its compatibility issues. Most offline image viewers like the typical Windows image viewer are not natively available for opening the WebP files. You can not preview or edit the WebP directly. This image is difficult for sharing due to the lack of widespread adoption. Imagine that you have hundreds of WebP images, and it will be difficult to open them one by one. The best and most practical method is converting them all to standard JPEG or JPG images. AnyWebP has improved the images to WebP workflow greatly as it supports adding all possible image formats. You can add them all and convert them to WebP at one time. The whole process is fast, reliable, and professional. Convert .tiff/.psd/.xcf/.gif/.bmp/.tga/.miff/.dcm/.xpm/.pcx/.fits/.ppm/.pgm/.pfm/.mng/.hdr/.dds/.otb/.psb to WebP instantly.
  • 7
    Linker Vision

    Linker Vision

    Linker Vision

    Linker VisionAI Platform is a comprehensive, end-to-end solution for vision AI, encompassing simulation, training, and deployment to empower smart cities and enterprises. It comprises three core components, Mirra, for synthetic data generation using NVIDIA Omniverse and NVIDIA Cosmos; DataVerse, facilitating data curation, annotation, and model training with NVIDIA NeMo and NVIDIA TAO; and Observ, enabling large-scale Vision Language Model (VLM) deployment with NVIDIA NIM. This integrated approach allows for the seamless transition from data simulation to real-world application, ensuring that AI models are robust and adaptable. Linker VisionAI Platform supports a range of applications, including traffic and transportation management, worker safety, disaster response, and more, by leveraging urban camera networks and AI to drive responsive decisions.
  • 8
    csDeveloper Image Converter
    Image Converter is an image/photo converter that allows you to transform photos or images to other extensions, you can convert to JPG, JPEG, PDF, GIF, PNG, BMP, and WEBP. Image converter features include the possibility to save converted images directly to the gallery. Easy to share, converted images for everyone with formats like .jpg .pdf .png .jpeg .bmp .gif, and .webp. Easy to choose output image format type. Edit output image file name. Convert the image without losing its quality and resolution. You can share directly from the view. Turn images into converted images and convert image galleries. Manage all converted images directly from the gallery, then you can share or delete them. This is a very simple app for converting multiple images. You can easily convert any image into several formats easily and it's free to use, just select multiple images or a single image and convert them.
  • 9
    Store360

    Store360

    Vision Group Retail

    Store360 is an AI-powered retail execution platform from Vision Group Retail that helps brands and retailers gain real-time visibility into what is actually happening on store shelves. It enables field representatives to capture shelf photos through a mobile app and receive immediate, data-driven insights on product assortment, pricing, promotions, and placement. Using advanced image recognition, it identifies products down to the SKU level, flags compliance issues such as out-of-stocks or display gaps, and guides teams on corrective actions before they leave the store. It automatically measures key retail KPIs like share of shelf, planogram compliance, and on-shelf availability, while custom dashboards provide visibility across stores, reps, and retail partners. Store360 is designed to replace manual audits by automating data collection and analysis, improving accuracy and speeding up decision-making across large retail networks.
  • 10
    Pixea

    Pixea

    ImageTasks

    Pixea is an image viewer for macOS with a nice minimal modern user interface. Pixea works great with JPEG, HEIC, PSD, RAW, WEBP, PNG, GIF, and many other formats. Provides basic image processing, including flip and rotate, showing a color histogram, EXIF, and other information. Supports keyboard shortcuts and trackpad gestures. Shows images inside archives, without extracting them. Supports JPEG, HEIC, GIF, PNG, TIFF, Photoshop (PSD), BMP, fax images, macOS and Windows icons, radiance images, and Google's WebP. RAW formats Leica DNG and RAW, Sony ARW, Olympus ORF, Minolta MRW, Nikon NEF, Fuji RAF, Canon CR2 and CRW, Hasselblad 3FR. Sketch files (preview only) and ZIP archives. Export JPEG, JPEG-2000, PNG, TIFF, and BMP. Pixea supports animated GIF, PNG, and WebP images.
  • 11
    Standard Vision OS^
    Enabling autonomous checkout for brick & mortar retailers with our modern AI-powered computer vision platform. Just grab and go! Customers can walk in, grab what they need, and leave without waiting in line or stopping to scan and pay. Standard’s computer vision and AI-powered solution is the only one that can be quickly and easily installed in retailers’ existing stores. Standard’s technology is a giant leap forward for retailers who want autonomous checkout, but don’t want to build new stores to get it. Standard doesn't use any facial recognition or other biometrics. All of our deployments are on-premise to ensure maximum performance and security for retailers and their customers. Standard’s solution is camera-first, with no turnstiles or gates. That means simple and quick installs with no disruption to customers or the business. Standard believes good retail is predicated on happy customers having a great experience.
  • 12
    Passio

    Passio

    Passio

    Our easy-to-use SDKs reach millions of users who use Passio every day to transform their health, homes, businesses and lives. We help businesses transform their applications with real-time, on-device computer vision and AI-driven user experiences. Bring your paint and home improvement store into the homes of your customers and allow them to visualize and purchase your paint and home remodel products. Help your customers make better buying decision by seeing your products in their homes in augmented reality and by using computer vision to identify their remodel scenarios, surface types and surface conditions. Remodel AI comes with a flexible painter which takes advantage of the latest AR technology and offers you multiple methods for room scanning and paint visualization. It takes seconds tor transform the room and your users will be delighted to see their new environments in real-time on their iOS and Android devices.
  • 13
    XnConvert
    XnConvert is a fast, powerful, and free cross-platform batch image converter. It allows you to automate editing of your photo collections: you can rotate, convert and compress your images, photos, and pictures easily, and apply over 80 actions (like resize, crop, color adjustments, and filter,). All common picture and graphics formats are supported (JPEG, TIFF, PNG, GIF, WebP, PSD, JPEG2000, JPEG-XL, OpenEXR, camera RAW, HEIC, PDF, DNG, CR2). You can save and reuse your presets for another batch image conversion. XnConvert is Multi-platform, it is available for Windows, Mac, and Linux for both 32-bit and 64-bit editions. XnConvert is Multilingual, it includes more than 20 different translations. It offers powerful features in an easy-to-use interface providing convenient drag & drop functionality. XnConvert is compatible with more than 500 formats and is exports to about 70 different file formats.
  • 14
    Unicorn Render

    Unicorn Render

    Unicorn Render

    Unicorn Render is a professional rendering software that enables users to produce stunning realistic pictures and achieve high-end rendering levels without any prior skills. It offers a user-friendly interface designed to provide everything needed to obtain amazing results with minimal controls. Available as a standalone application or as a plugin, Unicorn Render integrates advanced AI technology and professional visualization tools. The software supports GPU+CPU acceleration through deep learning photorealistic rendering technology and NVIDIA CUDA technology, allowing joint support for CUDA GPUs and multicore CPUs. It features real-time progressive physics illumination, a Metropolis Light Transport sampler (MLT), a caustic sampler, and native NVIDIA MDL material support. Unicorn Render's WYSIWYG editing mode ensures that 100% of editing can be done in final image quality, eliminating surprises in the production of the final image.
  • 15
    Kernel for Windows Data Recovery

    Kernel for Windows Data Recovery

    KernelApps Private Limited

    Kernel Windows Data Recovery is an advanced tool to recover accidentally deleted or lost Windows data like files, folders, confidential documents, emails, and multimedia files. With a powerful algorithm, this tool efficiently scans corrupt USB Drives, SD cards, Micro SD cards, Windows partitions, and other devices for data recovery. • Three types of Scan mode are available for data recovery: Quick Scan, Deep Scan, and File Trace. • Up to 2 GB of free Windows data recovery with the trial version. • Recovers data from formatted, corrupt, and even password-protected storage devices with 100% accuracy. • Restore large-sized data from bad sectors of USB Drives or Windows partitions without any size restrictions. • Find specific files in storage devices and recover their data with the advanced Search feature of this recovery tool. • Advanced file filter allows users to recover specific file formats like JPEG, DOC, and other formats.
  • 16
    SAFR

    SAFR

    SAFR from RealNetworks

    Unlock a new level of situational awareness with exceptionally accurate face recognition and additional face- and person-based computer vision features. SAFR delivers actionable insights that protect the health and safety of people everywhere. Designed as a standalone networked solution, SAFR SCAN provides SMB and enterprise-level users with uncompromised biometrics features and performance at an affordable price point. Its fast, frictionless throughput can authenticate up to 30 individuals per minute, making it ideal for high-volume applications in office building lobbies, professional offices, secured employee entrances and more. To ensure personal privacy, all enrolled and scanned biometric data is fully encrypted and does not contain any visual imagery of individuals’ faces. This helps to ensure that individuals' identities are protected, avoiding any liability issues related to new and emerging privacy protection mandates.
  • 17
    ccminer

    ccminer

    ccminer

    ccminer is an open-source project for CUDA compatible GPUs (nVidia). The project is compatible with both Linux and Windows platforms. This site is intended to share cryptocurrencies mining tools you can trust. Available open-source binaries will be compiled and signed by us. Most of these projects are open-source but could require technical abilities to be compiled correctly.
  • 18
    EyeRecognize

    EyeRecognize

    EyeRecognize

    Our image and video recognition APIs are proven, highly scalable, and leverage deep learning technology that you can implement within your own applications without prior knowledge of machine learning expertise. EyeRecognize’s suite of image and video recognition API services allow you to identify objects, people, text, scenes, and activities in images and videos, as well as detect any faces and NSFW content. Face Detection and Analysis, detect all face in images and video and get attributes such as face location, gender, age, eyes, and even emotion. Text Detection, extract text from images such as license plates, street signs, advertising, and brand names. Identify NSFW "Not Safe for Work" and other potentially inappropriate content across both image and video. The team behind EyeRecognize has been collectively developing artificial intelligence powered applications for over 40 years and first pioneered the use of machine learning to automate content moderation for social media.
  • 19
    Nyckel

    Nyckel

    Nyckel

    Nyckel makes it easy to auto-label images and text using AI. We say ‘easy’ because trying to do classification through complex “we-do-it-all” AI/ML tools is hard. Especially if you’re not a machine learning expert. That’s why Nyckel built a platform that makes image and text classification easy for everyone. In just a few minutes, you can train an AI model to identify attributes of any image or text. Whether you’re sorting through images, moderating text, or needing real-time content labeling, Nyckel lets you build a custom classifier in just 5 minutes. And with our Classification API, you can auto-label at scale. Nyckel’s goal is to make AI-powered classification a practical tool for anyone. Learn more at Nyckel.com.
  • 20
    Scandit

    Scandit

    Scandit

    Scandit is the leader in smart data capture empowering workers, customers, and businesses by providing actionable insights and automating end-to-end processes. The Smart Data Capture platform enables smart devices, such as smartphones, handheld computers, drones, digital eyewear, robots, and fixed cameras to interact with physical items by capturing data from barcodes, text, IDs, and objects with unmatched speed, accuracy, and intelligence. Scandit’s advanced barcode scanning software turns smart devices into high-performance and cost-efficient smart scanning tools. With little to no integration effort, upgrading the effectiveness and capabilities of your scanning workflows is as simple as choosing the solution that fits into your IT environment, testing it and deploying it to users. Scandit barcode scanning software is built for businesses needing an advanced barcode scanning solution that deploys quickly and excels under challenging scanning environments.
  • 21
    Clear Scan

    Clear Scan

    Clear Scan

    Clear Scanner is the best scanning app with hassle free work that saves a huge amount of both time and money. So get this amazing free mini pocket scanner app now in your smartphones and get the scanning done from any part of the world and send the scanned image to any person at any location. Professional quality results with multiple filter options. Get brighter and clearer image make the contents more readable! Folders and subfolders. You can easily manage your files and folders, in order to organise your documents better. Create an offline backup file Or you can sync your scans across devices. The app also offers various professional editing features even after saving the images along with various multiple filters. You can also save the image using an appropriate name and reorder the scanned files that makes it easier for the user to find the file, document, image, or other scanned notes. You can choose to email a specific document or an entire folder with faster processing speed.
  • 22
    Azure AI Custom Vision
    Create a custom computer vision model in minutes. Customize and embed state-of-the-art computer vision image analysis for specific domains with AI Custom Vision, part of Azure AI Services. Build frictionless customer experiences, optimize manufacturing processes, accelerate digital marketing campaigns, and more. No machine learning expertise is required. Set your model to perceive a particular object for your use case. Easily build your image identifier model using the simple interface. Start training your computer vision model by simply uploading and labeling a few images. The model tests itself on these and continually improves precision through a feedback loop as you add images. To speed development, use customizable, built-in models for retail, manufacturing, and food. See how Minsur, one of the world's largest tin mines, uses AI Custom Vision for sustainable mining. Rely on enterprise-grade security and privacy for your data and any trained models.
    Starting Price: $2 per 1,000 transactions
  • 23
    FonePaw Video Converter Ultimate
    Multifunctional software makes it possible for you to convert, edit and play videos, DVD and audios. In addition, you can also create you own videos or GIF image freely with it. You can convert one video at a time or add several video files for converting simultaneously. It can decode and encode videos on a CUDA-enabled graphics card, leading to your fast and high quality HD and SD video conversion. Your video will not be quality loss. Equipped with NVIDIA's CUDA and AMD APP acceleration technology, you're able to experience 6X faster conversion speed and supports multi-core processor completely. Supported with NVIDIA® CUDA™, AMD®, etc. technologies, FonePaw Video Converter Ultimate can decode and encode videos on a CUDA-enabled graphics card, leading to your fast and high quality HD and SD video conversion. This all-in-one video converter is capable of converting video, audio and DVD files efficiently and even editing them with better effect.
    Starting Price: $39 one-time payment
  • 24
    SysTools Image Converter
    Experts verified bulk image converters to convert multiple image types such as. webp .jpg, .jpeg, .jpe, .gif, .png, .bmp, .icon, .tiff, .emf, .exif, .wmf, .memorybmp, .jfif, .ico, .ccitt, & .tga into 17+ file formats. Convert Images to PDF, DOC, DOCX, HTML, TEXT (BASE64), & other formats. Support to export multiple images in bulk without losing their quality. Option to create and save all images in a single DOC, DOCX file. The tool offers users to create a single file for all added images. Export images to JPG, JPEG, PNG, APNG, BMP, WEBP, GIF, TIFF, TIF, TGA, JPEG2000(J2K), & JPEG2000(JP2) in bulk. Move up and move down options to arrange images accordingly. Preview added images one after one before the image conversion process. Facility to add multiple images in a single DOC, DOCX, and HTML file. Manage page size, and margin, and set page orientation. Preserve the image quality even after the image file conversion. Download the bulk image converter and install it on all Windows OS ver
    Starting Price: $29 one-time payment
  • 25
    Qwen2.5-VL

    Qwen2.5-VL

    Alibaba

    Qwen2.5-VL is the latest vision-language model from the Qwen series, representing a significant advancement over its predecessor, Qwen2-VL. This model excels in visual understanding, capable of recognizing a wide array of objects, including text, charts, icons, graphics, and layouts within images. It functions as a visual agent, capable of reasoning and dynamically directing tools, enabling applications such as computer and phone usage. Qwen2.5-VL can comprehend videos exceeding one hour in length and can pinpoint relevant segments within them. Additionally, it accurately localizes objects in images by generating bounding boxes or points and provides stable JSON outputs for coordinates and attributes. The model also supports structured outputs for data like scanned invoices, forms, and tables, benefiting sectors such as finance and commerce. Available in base and instruct versions across 3B, 7B, and 72B sizes, Qwen2.5-VL is accessible through platforms like Hugging Face and ModelScope.
  • 26
    Digimarc Discover
    Scanning barcodes is faster and easier with the Digimarc Discover app. Scan Digimarc Barcode and all common retail barcodes for instant discovery in the store and on-the-go. Digimarc Discover is a free mobile app (iOS/Android) that scans Digimarc Barcode, DWCODE™, QR Codes and a variety of traditional retail barcodes. Powered by our Mobile SDK, the Digimarc Discover app connects consumers and store associates to brand-generated content. Digimarc Discover’s scanning engine is the Digimarc Mobile SDK, the most versatile barcode-scanning software available. Development kits available for Apple iOS, Google Android, and Microsoft Windows 10 are optimized to more efficiently scan the barcodes most commonly used in retail. Digimarc Discover features a full camera view for more scanning flexibility, along with a small badge on each card to make it easy to see what type of code was scanned. In addition, all past and present scans are stored in the app’s activity section for quick retrieval.
  • 27
    Darknet

    Darknet

    Darknet

    Darknet is an open-source neural network framework written in C and CUDA. It is fast, easy to install, and supports CPU and GPU computation. You can find the source on GitHub or you can read more about what Darknet can do. Darknet is easy to install with only two optional dependencies, OpenCV if you want a wider variety of supported image types, and CUDA if you want GPU computation. Darknet on the CPU is fast but it's like 500 times faster on GPU! You'll have to have an Nvidia GPU and you'll have to install CUDA. By default, Darknet uses stb_image.h for image loading. If you want more support for weird formats (like CMYK jpegs, thanks Obama) you can use OpenCV instead! OpenCV also allows you to view images and detections without having to save them to disk. Classify images with popular models like ResNet and ResNeXt. Recurrent neural networks are all the rage for time-series data and NLP.
  • 28
    Arclab Website Link Analyzer
    Arclab Website Link Analyzer spiders (scans) your website the same way as search engine robots. It scans each web page (or resource) for links, CSS, images etc. and adds the URI's found to the processing queue. The fast and multi-threaded engine scans your whole website for possible problems and errors. Just press a button to start the scan and get a detailed analysis of your website. Using additional includes (e.g. subdomains for mobile devices) and excludes (e.g. protected subfolders) you can define exactly what pages and resources should be scanned. Website Link Analyzer is a software product for Windows PC and does not require to subscribe to a web-based service - this means: direct scans, unlimited checks, no subscriptions and no recurring fees for checking your website. Broken links are quite a problem and look unprofessional for your website visitors. Furthermore broken links can harm your page ranking in the search engine result pages (SERP).
  • 29
    Everseen

    Everseen

    Everseen

    Improve your customer experience through a holistic view of your entire business process. Track a day in the life of a product from the distribution center to the back of the store, shelf, through to the shopper's bags. Adaptive and orchestrated retail intelligence when and where you need it. Actioning the moments that matter most with customers, associates, and vendors. Everseen's Visual AI™ Platform is a rapid Application Builder integrating existing systems, applications, and sensors and applying AI to identify previously unseen and unmeasurable problems. Everseen Visual AI works across the entire supply chain to reduce shrink and increase perpetual inventory accuracy. Seamless integration to any camera, any device, any system, any sensor on any cloud, on any edge. Build large-scale data, video, and sensor pipelines. Visual AI™ and Data powering human-centric designed AI expert systems. Enterprise-wide problem mapping and linking.
  • 30
    nablet Video Search
    Effortlessly Detect Unauthorized Usage and Monitor Broadcast Contracts with precision using our cutting-edge software for effortless fragment identification in external videos. Are you tired of finding your hard work and creativity being used without permission in external videos? Put an end to unauthorized usage of your video with nablet Video Search, the ultimate solution for content creators, businesses, and broadcasters. With state-of-the-art technology and advanced algorithms, nablet Video Search scans videos, both local files and network streams, to detect and identify fragments of your own content with unparalleled accuracy.
  • 31
    Neurolabs

    Neurolabs

    Neurolabs

    Industry-leading technology powered by synthetic data for flawless retail execution. The new wave of vision technology for consumer packaged goods. Select from an extensive catalog of over 100,000 SKUs in the Neurolabs platform including top brands such as P&G, Nestlé, Unilever, Coca-Cola, and much more. Your field agents can upload multiple shelf images from mobile devices to our API which will automatically stitch the images together to generate the scene. SKU-level detection provides you with detailed information to compute retail execution KPIs such as out-of-shelf rate, shelf share percentage, competitor price comparison, and so much more! Discover how our cutting-edge image recognition technology can help you maximize store operations, enhance customer experience, and boost profitability. Implement a real-world deployment in less than 1 week. Access image recognition datasets for over 100,000 SKUs.
  • 32
    Picview

    Picview

    Chitaner

    Picview is an image viewer for macOS with a nice minimal modern user interface. Picview works great with JPEG, HEIC, PSD, RAW, WEBP, PNG, GIF, and many other formats. Provides basic image processing, including flip and rotate, EXIF, and other information. Supports keyboard shortcuts and trackpad gestures. The most important, Picview like Windows' default picture viewer — Photos, where zoom in and out with the mouse wheel and switch between images with arrow keys and/or with on-screen buttons.
  • 33
    Mistral Small 3.1
    ​Mistral Small 3.1 is a state-of-the-art, multimodal, and multilingual AI model released under the Apache 2.0 license. Building upon Mistral Small 3, this enhanced version offers improved text performance, and advanced multimodal understanding, and supports an expanded context window of up to 128,000 tokens. It outperforms comparable models like Gemma 3 and GPT-4o Mini, delivering inference speeds of 150 tokens per second. Designed for versatility, Mistral Small 3.1 excels in tasks such as instruction following, conversational assistance, image understanding, and function calling, making it suitable for both enterprise and consumer-grade AI applications. Its lightweight architecture allows it to run efficiently on a single RTX 4090 or a Mac with 32GB RAM, facilitating on-device deployments. It is available for download on Hugging Face, accessible via Mistral AI's developer playground, and integrated into platforms likeGemini Enterprise Agent Platform, with availability on NVIDIA NIM.
  • 34
    GPT-4V (Vision)
    GPT-4 with vision (GPT-4V) enables users to instruct GPT-4 to analyze image inputs provided by the user, and is the latest capability we are making broadly available. Incorporating additional modalities (such as image inputs) into large language models (LLMs) is viewed by some as a key frontier in artificial intelligence research and development. Multimodal LLMs offer the possibility of expanding the impact of language-only systems with novel interfaces and capabilities, enabling them to solve new tasks and provide novel experiences for their users. In this system card, we analyze the safety properties of GPT-4V. Our work on safety for GPT-4V builds on the work done for GPT-4 and here we dive deeper into the evaluations, preparation, and mitigation work done specifically for image inputs.
  • 35
    CaelumOne

    CaelumOne

    CaelumOne

    The CaelumOne Enterprise Content Management System (ECM) is a ground breaking solution developed to solve modern day Information Management problems. The CaelumOne Document Management System (DMS) uses best-in-class technology and security to protect your documents against unauthorized access, outright loss, and unnecessary duplication. We have employed the strictest security standards and encryption technologies available to ensure that your documents are safely secured offline or in the cloud. Document, images and video files can be added either individually, dragged and dropped, or scanned to the system via email or a secure WebDAV link. They can also be added in bulk as a .zip file while retaining all original folder and subfolder structures.
  • 36
    RoboRealm

    RoboRealm

    RoboRealm

    RoboRealm is a Windows-based machine vision software designed to simplify vision programming and enable rapid prototyping with advanced modules. It features an intuitive GUI requiring no or low code, making it accessible for both casual users and serious robotic scientists. It supports hundreds of image processing modules and is camera agnostic, allowing for flexibility in hardware choices. Users can experience real-time parameter changes, and the software includes a fully supported server API for integration with other systems. RoboRealm accommodates multiple image sources and offers various output interfaces, including file, web, FTP, and email. Its plugin framework allows for the development of custom modules, and an active online community provides expert assistance. It enables the combination of modules through an easy-to-use pipeline to create tailored solutions for tasks such as surface defect detection, measurement, counting, detection, etc.
    Starting Price: $25 per month
  • 37
    Clarifai

    Clarifai

    Clarifai

    Clarifai is a leading AI platform for modeling image, video, text and audio data at scale. Our platform combines computer vision, natural language processing and audio recognition as building blocks for developing better, faster and stronger AI. We help our customers create innovative solutions for visual search, content moderation, aerial surveillance, visual inspection, intelligent document analysis, and more. The platform comes with the broadest repository of pre-trained, out-of-the-box AI models built with millions of inputs and context. Our models give you a head start; extending your own custom AI models. Clarifai Community builds upon this and offers 1000s of pre-trained models and workflows from Clarifai and other leading AI builders. Users can build and share models with other community members. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been recognized by leading analysts, IDC, Forrester and Gartner, as a leading computer vision AI platform. Visit clarifai.com
  • 38
    UPDF Converter
    UPDF Converter for Windows and Mac is an easy-to-use PDF Converter with OCR. It allows you to convert PDF documents to other formats or extensions without losing formats and layouts. It supports converting a single PDF or dozens of PDFs in batch with one click. UPDF Converter is a powerful all-in-one converter for your PDF files. Key features: 1. Supported Conversion Formats: Convert PDF to fully editable Microsoft Office Word, Excel, PowerPoint and other formats, such as Image (PNG, JPEG, BMP, GIF, TIFF), HTML, XML, CSV, Text, PDF/A. It is an offline converter! It is safe and faster! 2. Convert Scanned Documents with OCR: UPDF supports converting scanned PDF to editable and searchable text. It supports recognizing over 15+ languages including English, French, German, Italian, Portuguese, Russian, Spanish, Catalan, Danish, Dutch, Norwegian, Polish, Romanian, Swedish, Slovenian, and Turkish.
  • 39
    Scrile AI
    Scrile AI is a white-label platform that lets you launch your own AI companion website with GPT-based chat and AI image generation. Create fully customizable characters with unique personalities, system prompts, and monetization settings. Users can chat, receive AI-generated images, and unlock premium content via tokens or subscriptions. The platform includes a powerful admin dashboard with revenue analytics, user management, and moderation tools. Scrile AI is fully hosted, GDPR-compliant, and supports NSFW features like geo-blocking and content blurring. Ideal for creators and entrepreneurs looking to monetize AI-based interactions—no coding required.
    Starting Price: $299/month
  • 40
    CleanBrowsing

    CleanBrowsing

    CleanBrowsing

    A modernized approach to DNS-based content filtering and security. Easily decide what should, and should not, be allowed on your internet. Effective for our kids, powerful for our business. CleanBrowsing is a DNS-based content filtering service that offers a safe way to browse the web without surprises. It intercepts domain requests and filter sites that should be blocked, based on your filtering needs. Our, free, family filter, for example, blocks porn, obscene, and adult content, while still allowing Google, Youtube, Bing, DuckDuckGo and the rest of the web to load safely. Our free filters are comprised of three predefined filters for global consumption (Security, Adult, and Family). The Family filter blocks adult / obscene content and applies Safe Search filters to Google, Bing, Yandex, etc. The security filter, however, only focuses on restricting access to malicious activity.
  • 41
    Command A Vision
    Command A Vision is Cohere’s multimodal AI solution built for enterprise use that combines image understanding with language capabilities to drive business outcomes while keeping compute costs low; it extends the Command family by adding vision comprehension, allowing organizations to interpret and act on visual content in concert with text, and integrates into workplace systems to surface insights, boost productivity, and enable more intelligent search and discovery. The offering is positioned alongside Cohere’s broader AI stack and emphasizes putting AI to work in real-world workflows, helping teams unify multimodal signals, extract actionable meaning from images and associated metadata, and surface relevant business intelligence without excessive infrastructure overhead. Command A Vision excels at understanding and analyzing a wide range of visual and multilingual data, including charts, graphs, tables, and diagrams.
  • 42
    JPG Converter

    JPG Converter

    TechGenCenter

    Jpg converter app Converting multiple images to JPEG format. jpeg to jpg converter app, can convert one or multiple images to JPG format . PNG, GIF, webp and BMP can be converted to JPEG format. All converted images are stored in picture and JPEG folder. All in one image converter is a simple step to follow and convert your photos in the best jpg format. Jpeg photo converter is an image converter that allows you to transform photos or images to other extensions. JPEG photo extension can be selected from jpg and jpeg. You can select any type of image JPG, PNG, GIF, WebP, BMP to convert file format to JPG, JPEG. Image to jpg and jpeg converter allows you to convert image file format without losing quality. Jpg image convert also allows you to choose image quality ranging from low, normal and high quality.
  • 43
    Fyma

    Fyma

    Fyma

    Our data and insights help you unlock the full potential of your commercial real estate and retail portfolio. Gain a deeper understanding of market trends, identify areas of opportunity, and make informed decisions based on insights. Fyma helps improve the customer experience, maximize sales and revenue, and enhance operational efficiency for shopping center operators. Our retail solution provides real-time analytics and alerts, historical analysis, and can integrate with other retail systems for seamless operations. Our mobility data solution is designed to provide comprehensive insights into transportation networks and patterns around your property. The platform collects and analyzes data on traffic volume, modal share, movement patterns and congestion - providing real-time updates and historical analysis. The system can detect available spaces and guide drivers to them, reducing congestion and improving the overall parking experience.
  • 44
    Cogito

    Cogito

    Cogito Tech LLC

    Cogito Tech is a leading AI data solutions provider specializing in data labeling and annotation services. We deliver high-quality data for applications across computer vision, natural language processing (NLP), and content services. Our expertise extends to fine-tuning large language models (LLMs) through techniques like Reinforcement Learning from Human Feedback (RLHF), enabling rapid deployment and customization to meet business objectives. The company is headquartered in the United States and was featured in The Financial Times’ FT ranking: The Americas’ Fastest-Growing Companies 2025 and Everest Group’s report Data Annotation and Labeling (DAL) Solutions for AI/ML PEAK Matrix® Assessment 2024 Services offered by Cogito: • Image Annotation Service • AI-assisted Data Labeling Service • Medical Image Annotation • NLP & Audio Annotation Service • ADAS Annotation Services • Healthcare Training Data for AI • Audio & Video Transcription Services
  • 45
    Kaizen OCR

    Kaizen OCR

    StepForward Solutions LLP

    Kaizen OCR - Fast & Accurate Text Extraction Tool Turn any image or screenshot into editable text with Kaizen OCR, the lightweight and powerful OCR desktop software for Windows. Whether you’re scanning documents, extracting text from screenshots, or working with multilingual content - Kaizen OCR delivers speed, accuracy, and simplicity in one package.
  • 46
    Alegion

    Alegion

    Alegion

    Alegion is the data labeling solution for enterprise-grade Machine Learning. We lead the industry in streaming, high-resolution, high-density video annotation, delivering accurately-annotated, model-ready data to train and validate ML models. Alegion provides both the platform and workforce to operate with quality at scale, processing structured and unstructured data including video, image, audio, and text. Our ML powered platform speeds up task completion by as much as 70%, including classless object tracking and single click smart polygon generation. Segmentation options include Keypoint, Bounding Box, Polyline, & Polygon segmentation, for image and video. Semantic Segmentation tools deliver seamless entity boundaries with pixel perfect accuracy. NLP and NER capabilities support text and audio classification and sentiment analysis. The platform is highly configurable to support hybrid use cases. Available via SaaS (Alegion Control), Managed Platform, and Managed Labeling Services.
  • 47
    Vizua

    Vizua

    Vizua

    Vizua is a free browser-based image editing suite with 90+ tools. Compress JPEG, PNG, WebP, and GIF files while preserving quality. Convert between formats including HEIC to JPG, WebP to PNG, and RAW to JPG. Resize, crop, rotate, add text, borders, and effects like blur and grayscale. AI-powered tools enable background removal and text extraction via OCR. Supports advanced formats like AVIF for efficient compression. Preset resizing for social media and YouTube thumbnails. Additional tools include EXIF metadata viewer, color picker, QR code generator, and image watermarking. All processing happens locally in the browser — no files are uploaded. No account required. Available in 16 languages.
  • 48
    Hocha

    Hocha

    Hocha

    Hocha is an AI-powered image generation and editing platform that enables users to transform uploaded images (JPEG, PNG, GIF, WEBP up to 7 MB) into high-quality 3D figures, stylized headshots, illustrations with varied character poses, or enhanced and refined images, all with a few clicks and minimal setup. It offers a free trial (no registration or login required) to experiment with preset prompts and built-in tools for 3D figure generation, headshot creation, illustration generation, and image editing. Generations usually complete in seconds, delivering professional-quality results ready for personal or, if you purchase a license, commercial use. Additional tools include a “Spanish Vocabulary Poster Generator,” which helps users create educational posters combining Spanish words with English translations. When you subscribe (or buy a one-time bundle), you receive full commercial-use rights for generated images, enabling their use in marketing, websites, ads, etc.
    Starting Price: $10 one-time payment
  • 49
    Visual Layer

    Visual Layer

    Visual Layer

    Visual Layer is a platform for working with large volumes of image and video data. It supports visual search, filtering, tagging, and dataset structuring across raw files, metadata, and labels. No code is required, and both technical and non-technical teams use it in production. Common applications include curating datasets for machine learning, auditing visual content for compliance, reviewing surveillance material, and preparing media for downstream platforms. The platform detects duplicates, mislabeled items, outliers, and low-quality files to improve data quality before model training or operational decision-making. It is model-agnostic, supports both cloud and on-premise deployment, and is built by the creators of Fastdup, the widely used open-source tool for visual deduplication.
    Starting Price: $200/month
  • 50
    IRISmart Security

    IRISmart Security

    IRIS Portable Scanners & Conversion Software

    Introducing IRISmart™ Security, software that boosts your registration processes, for Windows. IRISmart™ Security was developed to make recording procedures simpler and more secure, particularly in the hotel sector, but also in all reception and customer service departments. Recognition of international official documents: ID carts, passports, driving licences, and more. Automatically rename your documents, while specifying the export folder. Get indexed and compressed PDF files. Classify your documents on the fly, based on a predefined naming convention. Automatically sort them into the pre-set filing system. After scanned ID cards and passports have been processed, a daily folder is created. This folder contains a central Excel file (with automatic indexing of the extracted metadata), along with images of the passports, ID cards, and other scanned documents (.TIF format).
    Starting Price: $399 one-time payment