Compare the Top Computer Vision Software in China as of October 2025

What is Computer Vision Software in China?

Computer vision software allows machines to interpret and analyze visual data from images or videos, enabling applications like object detection, image recognition, and video analysis. It utilizes advanced algorithms and deep learning techniques to understand and classify visual information, often mimicking human vision processes. These tools are essential in fields like autonomous vehicles, facial recognition, medical imaging, and augmented reality, where accurate interpretation of visual input is crucial. Computer vision software often includes features for image preprocessing, feature extraction, and model training to improve the accuracy of visual analysis. Overall, it enables machines to "see" and make informed decisions based on visual data, revolutionizing industries with automation and intelligence. Compare and read user reviews of the best Computer Vision software in China currently available using the table below. This list is updated regularly.

  • 1
    Ango Hub

    Ango Hub

    iMerit

    Ango Hub is a quality-focused, enterprise-ready data annotation platform for AI teams, available on cloud and on-premise. It supports computer vision, medical imaging, NLP, audio, video, and 3D point cloud annotation, powering use cases from autonomous driving and robotics to healthcare AI. Built for AI fine-tuning, RLHF, LLM evaluation, and human-in-the-loop workflows, Ango Hub boosts throughput with automation, model-assisted pre-labeling, and customizable QA while maintaining accuracy. Features include centralized instructions, review pipelines, issue tracking, and consensus across up to 30 annotators. With nearly twenty labeling tools—such as rotated bounding boxes, label relations, nested conditional questions, and table-based labeling—it supports both simple and complex projects. It also enables annotation pipelines for chain-of-thought reasoning and next-gen LLM training and enterprise-grade security with HIPAA compliance, SOC 2 certification, and role-based access controls.
    View Software
    Visit Website
  • 2
    Azure Computer Vision
    Boost content discoverability, automate text extraction, analyze video in real time, and create products that more people can use by embedding vision capabilities in your apps. Use visual data processing to label content with objects and concepts, extract text, generate image descriptions, moderate content, and understand people’s movement in physical spaces. No machine learning expertise is required.
  • 3
    Microsoft Copilot
    Meet your everyday AI companion for work and life. Work smarter, be more productive, boost creativity, and stay connected to the people and things in your life with Copilot—an AI companion that works everywhere you do and intelligently adapts to your needs. Copilot empowers you with smarter ways to be more productive, creative, and connected to the people and things that matter to you. Find just what you’re looking for. Get relevant answers to your questions. And shop online knowing you’ve gotten the best deal. Get answers to your questions, inspiration for your projects, and solutions for your to-do list. Transform your ideas effortlessly. Creating beautiful images and polished drafts is easy and fun. Whatever you’re into—browsing the web, searching for answers, exploring your creative potential, or coming up with more useful content, Copilot can help you uncover new possibilities.
    Starting Price: Free
  • 4
    Luxand

    Luxand

    Luxand

    Luxand FaceSDK is a cutting-edge, cross-platform software development kit designed to deliver high-performance face recognition, identification, and facial feature detection. Perfect for software developers worldwide, Luxand FaceSDK integrates seamlessly with web, desktop, and mobile applications, enabling face-based user authentication, as well as automatic face detection and recognition, elevating the user experience to new heights.
  • 5
    Roboflow

    Roboflow

    Roboflow

    Roboflow has everything you need to build and deploy computer vision models. Connect Roboflow at any step in your pipeline with APIs and SDKs, or use the end-to-end interface to automate the entire process from image to inference. Whether you’re in need of data labeling, model training, or model deployment, Roboflow gives you building blocks to bring custom computer vision solutions to your business.
    Starting Price: $250/month
  • 6
    Lightly

    Lightly

    Lightly

    Lightly selects the subset of your data with the biggest impact on model accuracy, allowing you to improve your model iteratively by using the best data for retraining. Get the most out of your data by reducing data redundancy, and bias, and focusing on edge cases. Lightly's algorithms can process lots of data within less than 24 hours. Connect Lightly to your existing cloud buckets and process new data automatically. Use our API to automate the whole data selection process. Use state-of-the-art active learning algorithms. Lightly combines active- and self-supervised learning algorithms for data selection. Use a combination of model predictions, embeddings, and metadata to reach your desired data distribution. Improve your model by better understanding your data distribution, bias, and edge cases. Manage data curation runs and keep track of new data for labeling and model training. Easy installation via a Docker image and cloud storage integration, no data leaves your infrastructure.
    Starting Price: $280 per month
  • 7
    SuperAnnotate

    SuperAnnotate

    SuperAnnotate

    SuperAnnotate is the world's leading platform for building the highest quality training datasets for computer vision and NLP. With advanced tooling and QA, ML and automation features, data curation, robust SDK, offline access, and integrated annotation services, we enable machine learning teams to build incredibly accurate datasets and successful ML pipelines 3-5x faster. By bringing our annotation tool and professional annotators together we've built a unified annotation environment, optimized to provide integrated software and services experience that leads to higher quality data and more efficient data pipelines.
  • 8
    Cogito

    Cogito

    Cogito Tech LLC

    Cogito Tech is a leading AI data solutions provider specializing in data labeling and annotation services. We deliver high-quality data for applications across computer vision, natural language processing (NLP), and content services. Our expertise extends to fine-tuning large language models (LLMs) through techniques like Reinforcement Learning from Human Feedback (RLHF), enabling rapid deployment and customization to meet business objectives. The company is headquartered in the United States and was featured in The Financial Times’ FT ranking: The Americas’ Fastest-Growing Companies 2025 and Everest Group’s report Data Annotation and Labeling (DAL) Solutions for AI/ML PEAK Matrix® Assessment 2024 Services offered by Cogito: • Image Annotation Service • AI-assisted Data Labeling Service • Medical Image Annotation • NLP & Audio Annotation Service • ADAS Annotation Services • Healthcare Training Data for AI • Audio & Video Transcription Services
    Starting Price: $25/Hour
  • 9
    viAct.ai

    viAct.ai

    viAct.ai

    viAct’s Smart Site Safety System (SSSS or 4S) is a simple & easy-to-use safety monitoring system using AI. viAct’s SSSS leverages the power of video analytics for workplace safety to improve safety performance in various jobsites. This safety monitoring system using AI collects real-time data from jobsites, transfers & stores it in viAct’s centralized management platform-viHUB. This enables stakeholders to have better grasp of real-time happenings in jobsite. Further, in case of an event of safety non-compliance, instant & real-time alerts are triggered by the dangerous situation alert system – enabling concerned stakeholders to take insightful action before it is too late. viAct’s smart site safety system can benefit the following industries: • Construction • Oil & Gas • Mining • Manufacturing • Transportation viAct’s Smart Site Safety System has been successfully serving various workplaces across various regions like Hong Kong, Singapore, Saudi Arabia, & Dubai.
    Starting Price: $100 per month
  • 10
    Samsara

    Samsara

    Samsara

    Prevent HOS Violations, Drivers Log Hours in Mobile App. Real-time visibility into drivers nearing or in violation gives you peace of mind that you're complying with the ELD mandate. Access a single system for Hours of Service logging, GPS tracking, dispatch, and maintenance with a complete platform that's FMCSA certified. A built-in WiFi hotspot keeps devices connected, even when there's no cell reception. Prevent compliance errors and speed up repairs with paperless DVIRs and a real-time maintenance dashboard. Simplify compliance and operations with a platform that combines GPS tracking, Hours of Service, paperless DVIRs, temperature monitoring, and more. Plug and play installation with zero-configuration allows you to get set up in 15 minutes. Samsara's hardware works with any vehicle, including cars, light and heavy trucks, buses, and more.
  • 11
    GPT-4V (Vision)
    GPT-4 with vision (GPT-4V) enables users to instruct GPT-4 to analyze image inputs provided by the user, and is the latest capability we are making broadly available. Incorporating additional modalities (such as image inputs) into large language models (LLMs) is viewed by some as a key frontier in artificial intelligence research and development. Multimodal LLMs offer the possibility of expanding the impact of language-only systems with novel interfaces and capabilities, enabling them to solve new tasks and provide novel experiences for their users. In this system card, we analyze the safety properties of GPT-4V. Our work on safety for GPT-4V builds on the work done for GPT-4 and here we dive deeper into the evaluations, preparation, and mitigation work done specifically for image inputs.
  • 12
    3DiVi Omni Platform
    The 3DiVi Omni Platform is an integrable face recognition system designed to analyze images and video streams, offering capabilities such as face detection, tracking, and identification. It supports features like face identification by control lists, recognition of masked or partially covered faces, and provides integration through an API and an admin web interface. The platform is optimized for high performance, capable of processing large-scale databases efficiently, and is suitable for various applications, including access control and video analytics. Deployment options are versatile, supporting both on-premise and cloud environments, with compatibility across multiple operating systems. Additionally, the Omni Platform offers services such as market analysis, implementation support, and flexible licensing models to assist clients throughout all stages of deployment.
  • 13
    Physna

    Physna

    Physna

    Our industry-leading geometric deep learning technology bridges the gap between physical objects and digital code. We are changing the future of engineering, industrial design and procurement, and putting the power in the hands of innovators and creators, one 3D model at a time. Using proprietary algorithms and advanced geometric deep learning technology, Physna codifies 3D models into detailed data that is understandable by software applications. By enabling 3D models to be treated and analyzed like other code, Physna’s technology bridges the gap between the physical world and digital world of software. Physna analyzes CAD models and other 3D models and creates a codified representation of the model, or the “Physna DNA” of that model. This digitized model allows Physna to show the detailed differences and similarities between models, even incomplete models or models with different file formats. See all components in complex assemblies, even parts contained in other parts.
  • 14
    Clarifai

    Clarifai

    Clarifai

    Clarifai is a leading AI platform for modeling image, video, text and audio data at scale. Our platform combines computer vision, natural language processing and audio recognition as building blocks for developing better, faster and stronger AI. We help our customers create innovative solutions for visual search, content moderation, aerial surveillance, visual inspection, intelligent document analysis, and more. The platform comes with the broadest repository of pre-trained, out-of-the-box AI models built with millions of inputs and context. Our models give you a head start; extending your own custom AI models. Clarifai Community builds upon this and offers 1000s of pre-trained models and workflows from Clarifai and other leading AI builders. Users can build and share models with other community members. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been recognized by leading analysts, IDC, Forrester and Gartner, as a leading computer vision AI platform. Visit clarifai.com
    Starting Price: $0
  • 15
    Alegion

    Alegion

    Alegion

    Alegion is the data labeling solution for enterprise-grade Machine Learning. We lead the industry in streaming, high-resolution, high-density video annotation, delivering accurately-annotated, model-ready data to train and validate ML models. Alegion provides both the platform and workforce to operate with quality at scale, processing structured and unstructured data including video, image, audio, and text. Our ML powered platform speeds up task completion by as much as 70%, including classless object tracking and single click smart polygon generation. Segmentation options include Keypoint, Bounding Box, Polyline, & Polygon segmentation, for image and video. Semantic Segmentation tools deliver seamless entity boundaries with pixel perfect accuracy. NLP and NER capabilities support text and audio classification and sentiment analysis. The platform is highly configurable to support hybrid use cases. Available via SaaS (Alegion Control), Managed Platform, and Managed Labeling Services.
    Starting Price: $5000
  • 16
    Nyckel

    Nyckel

    Nyckel

    Nyckel makes it easy to auto-label images and text using AI. We say ‘easy’ because trying to do classification through complex “we-do-it-all” AI/ML tools is hard. Especially if you’re not a machine learning expert. That’s why Nyckel built a platform that makes image and text classification easy for everyone. In just a few minutes, you can train an AI model to identify attributes of any image or text. Whether you’re sorting through images, moderating text, or needing real-time content labeling, Nyckel lets you build a custom classifier in just 5 minutes. And with our Classification API, you can auto-label at scale. Nyckel’s goal is to make AI-powered classification a practical tool for anyone. Learn more at Nyckel.com.
    Starting Price: Free
  • 17
    Eden AI

    Eden AI

    Eden AI

    Eden AI simplifies the use and deployment of AI technologies by providing a unique API connected to the best AI engines. Your time is precious: we take care of providing you with the AI engine best suited to your project and your data. No need to wait for weeks to change your AI engine. You can do it for free in a few seconds. We make sure to get you the cheapest provider while ensuring equal performance.
    Starting Price: $29/month/user
  • 18
    Deep Block

    Deep Block

    Omnis Labs

    Deep Block is the world's fastest AI-powered remote sensing imagery analysis solution. Train your own AI models to detect instantly any objects in large satellite, aerial, and drone images. Deep Block's no-code data labeling interface lets you achieve your MLOps projects in days, with no prior expertise. Instead of hiring your own in-house AI engineering team, anybody can start training their own AI. If you have a mouse and a keyboard, you can use our web-based platform, check our project library for inspiration, and choose between 9 out-of-the-box AI training modules (image segmentation, object detection, facial detection, facial comparison…) to get you started. The power of Deep Block is not limited to training your own AI. Once, your AI model is ready, Deep Block's high-performance AI models can deliver very accurate results when detecting objects (0.9 mAP) and with minimum false positives (0.9 recall).
    Starting Price: $10 per month
  • 19
    V7 Darwin
    V7 Darwin is a powerful AI-driven platform for labeling and training data that streamlines the process of annotating images, videos, and other data types. By using AI-assisted tools, V7 Darwin enables faster, more accurate labeling for a variety of use cases such as machine learning model training, object detection, and medical imaging. The platform supports multiple types of annotations, including keypoints, bounding boxes, and segmentation masks. It integrates with various workflows through APIs, SDKs, and custom integrations, making it an ideal solution for businesses seeking high-quality data for their AI projects.
    Starting Price: $150
  • 20
    SKY ENGINE

    SKY ENGINE

    SKY ENGINE AI

    SKY ENGINE AI is a simulation and deep learning platform that generates fully annotated, synthetic data and trains AI computer vision algorithms at scale. The platform is architected to procedurally generate highly balanced imagery data of photorealistic environments and objects and provides advanced domain adaptation algorithms. SKY ENGINE AI platform is a tool for developers: Data Scientists, ML/Software Engineers creating computer vision projects in any industry. SKY ENGINE AI is a Deep Learning environment for AI training in Virtual Reality with Sensors Physics Simulation & Fusion for any Computer Vision applications. SKY ENGINE AI Synthetic Data Generation makes Data Scientist life easier providing perfectly balanced datasets for any Computer Vision applications like object detection & recognition, 3D positioning, pose estimation and other sophisticated cases including analysis of multi-sensor data i.e., Radars, Lidars, Satellite, X-rays, and more.
  • 21
    Eyewey

    Eyewey

    Eyewey

    Train your own models, get access to pre-trained computer vision models and app templates, learn how to create AI apps or solve a business problem using computer vision in a couple of hours. Start creating your own dataset for detection by adding the images of the object you need to train. You can add up to 5000 images per dataset. After images are added to your dataset, they are pushed automatically into training. Once the model is finished training, you will be notified accordingly. You can simply download your model to be used for detection. You can also integrate your model to our pre-existing app templates for quick coding. Our mobile app which is available on both Android and IOS utilizes the power of computer vision to help people with complete blindness in their day-to-day lives. It is capable of alerting hazardous objects or signs, detecting common objects, recognizing text as well as currencies and understanding basic scenarios through deep learning.
    Starting Price: $6.67 per month
  • 22
    Alteia

    Alteia

    Alteia

    Alteia is a leading AI software platform that enables digital transformation through visual data analysis. It is supporting any industry-critical business needs, such as predictive maintenance, safety analysis, productivity management, and yield estimation, thanks to prebuilt, configurable, and high-value AI applications. Alteia combines computer vision and AI technologies that allow it to securely provide various industries with a unified database for all their visual data, and build high-value applications on top of it.
    Starting Price: $1500
  • 23
    Gravio

    Gravio

    Gravio

    Gravio enables new ways to connect and interact with your environment through the power of IoT, sensors, edge computing, computer vision, and AI without programming knowledge. Gravio is an easy-to-use software platform that runs on Windows, macOS, or Linux. You can connect to various inputs and outputs, including some bundled IoT sensors, computer vision/AI cameras, and MQTT or HTTP APIs. Gravio is very easy to use without software programming knowledge. Gravio unlocks the power of connected technologies by connecting sensors, input devices, cameras, and APIs within a space, then continuously gathering and sharing their information, enabling new ways to interact with, learn from and enhance a physical space. To create these experiences, Gravio provides a powerful low-code/no-code environment to enable entrepreneurs and organizations of all sizes, across industries, to build custom, connected experiences for new and existing environments.
    Starting Price: $4.99 per month
  • 24
    Chooch

    Chooch

    Chooch

    Chooch is an industry-leading, full lifecycle AI-powered computer vision platform that detects visuals, objects, and actions in video images and responds with pre-programmed actions using customizable alerts. It services the entire machine learning AI workflow from data augmentation tools, model training and hosting, edge device deployment, real-time inferencing, and smart analytics. This provides organizations with the ability to apply computer vision in the broadest variety of use cases from a single platform. Chooch AI Vision can be deployed quickly with ReadyNow models for the most common use cases like fall detection and workplace safety, face recognition, demographics, weapon detection, and more. Using existing cameras and edge infrastructure, models can be deployed to video streams detecting patterns and anomalies and witness real-time insights in seconds.
    Starting Price: Free
  • 25
    OpenCV

    OpenCV

    OpenCV

    OpenCV (Open Source Computer Vision Library) is an open-source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in commercial products. Being a BSD-licensed product, OpenCV makes it easy for businesses to utilize and modify the code. The library has more than 2500 optimized algorithms, which includes a comprehensive set of both classic and state-of-the-art computer vision and machine learning algorithms. These algorithms can be used to detect and recognize faces, identify objects, classify human actions in videos, track camera movements, track moving objects, extract 3D models of objects, produce 3D point clouds from stereo cameras, and stitch images together to produce a high-resolution image of an entire scene, find similar images from an image database, remove red eyes from images taken using flash, follow eye movements, recognize scenery, etc.
    Starting Price: Free
  • 26
    Vertex AI Vision
    Easily build, deploy, and manage computer vision applications with a fully managed, end-to-end application development environment that reduces the time to build computer vision applications from days to minutes at one-tenth the cost of current offerings. Quickly and conveniently ingest real-time video and image streams at a global scale. Easily build computer vision applications using a drag-and-drop interface. Store and search petabytes of data with built-in AI capabilities. Vertex AI Vision includes all the tools needed to manage the life cycle of computer vision applications, across ingestion, analysis, storage, and deployment. Easily connect application output to a data destination, like BigQuery for analytics, or live streaming to drive real-time business actions. Ingest thousands of video streams from across the globe. With a monthly pricing model, enjoy up to one-tenth lower costs than previous offerings.
    Starting Price: $0.0085 per GB
  • 27
    ShelfWatch

    ShelfWatch

    ParallelDots

    Real-time shelf monitoring insights for your perfect store. ShelfWatch effectively comprehends the environment in which SKUs are merchandised. It provides actionable insights and creates a virtuous feedback loop which helps CPG companies in their perfect store execution. Image Recognition technology increases sales force productivity, improves shelf condition insights, and helps drive incremental sales. ShelfWatch gives a complete picture of your perfect store execution by calculating different KPIs that can be customized as per requirement. ShelfWatch’s mobile app takes images to assimilate analysis on product placement and visibility on the shelf. It also provides smart features like blur detection and angle or eye-level alignment while taking images. Images can be clicked even in a no-internet zone without hindrance and can be uploaded once an internet connection is available. ShelfWatch easily integrates with multiple SFA and DMS apps.
    Starting Price: Free
  • 28
    FieldDay

    FieldDay

    FieldDay

    Unlock the world of AI and Machine Learning right on your phone with FieldDay. We’ve taken the complexity out of creating machine learning models and turned it into an engaging, hands-on experience that’s as simple as using your camera. FieldDay allows you to create custom AI apps and embed them in your favourite tools, using just your phone. Feed FieldDay examples to learn from, and generate a custom model ready to be embedded in your app/project. A range of projects and apps driven by custom FieldDay machine learning models. Our range of integrations and export options simplifies the process of embedding a machine-learning model into the platform you prefer. With FieldDay, you can collect data directly from your phone’s camera. Our bespoke interface is designed for easy and intuitive annotation during collection, so you can build a custom dataset in no time. FieldDay lets you preview and correct your models in real-time.
    Starting Price: $19.99 per month
  • 29
    Voxel51

    Voxel51

    Voxel51

    FiftyOne by Voxel51 - the most powerful visual AI and computer vision data platform. Without the right data, even the smartest AI models fail. FiftyOne gives machine learning engineers the power to deeply understand and evaluate their visual datasets—across images, videos, 3D point clouds, geospatial, and medical data. With over 2.8 million open source installs and customers like Walmart, GM, Bosch, Medtronic, and the University of Michigan Health, FiftyOne is an indispensable tool for building computer vision systems that work in the real world, not just in the lab. FiftyOne streamlines visual data curation and model analysis with workflows to simplify the labor-intensive processes of visualizing and analyzing insights during data curation and model refinement—addressing a major challenge in large-scale data pipelines with billions of samples. Proven impact with FiftyOne: ⬆️30% increase in model accuracy ⏱️5+ months of development time saved 📈30% boost in productivity
    Starting Price: $0
  • 30
    Azure AI Custom Vision
    Create a custom computer vision model in minutes. Customize and embed state-of-the-art computer vision image analysis for specific domains with AI Custom Vision, part of Azure AI Services. Build frictionless customer experiences, optimize manufacturing processes, accelerate digital marketing campaigns, and more. No machine learning expertise is required. Set your model to perceive a particular object for your use case. Easily build your image identifier model using the simple interface. Start training your computer vision model by simply uploading and labeling a few images. The model tests itself on these and continually improves precision through a feedback loop as you add images. To speed development, use customizable, built-in models for retail, manufacturing, and food. See how Minsur, one of the world's largest tin mines, uses AI Custom Vision for sustainable mining. Rely on enterprise-grade security and privacy for your data and any trained models.
    Starting Price: $2 per 1,000 transactions
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next