Alternatives to DeePhi Quantization Tool

Compare DeePhi Quantization Tool alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to DeePhi Quantization Tool in 2026. Compare features, ratings, user reviews, pricing, and more from DeePhi Quantization Tool competitors and alternatives in order to make an informed decision for your business.

  • 1
    Latent AI

    Latent AI

    Latent AI

    We take the hard work out of AI processing on the edge. The Latent AI Efficient Inference Platform (LEIP) enables adaptive AI at the edge by optimizing for compute, energy and memory without requiring changes to existing AI/ML infrastructure and frameworks. LEIP is a modular, fully-integrated workflow designed to train, quantize, adapt and deploy edge AI neural networks. LEIP is a modular, fully-integrated workflow designed to train, quantize and deploy edge AI neural networks. Latent AI believes in a vibrant and sustainable future driven by the power of AI and the promise of edge computing. Our mission is to deliver on the vast potential of edge AI with solutions that are efficient, practical, and useful. Latent AI helps a variety of federal and commercial organizations gain the most from their edge AI with an automated edge MLOps pipeline that creates ultra-efficient, compressed, and secured edge models at scale while also removing all maintenance and configuration concerns
  • 2
    Deci

    Deci

    Deci AI

    Easily build, optimize, and deploy fast & accurate models with Deci’s deep learning development platform powered by Neural Architecture Search. Instantly achieve accuracy & runtime performance that outperform SoTA models for any use case and inference hardware. Reach production faster with automated tools. No more endless iterations and dozens of different libraries. Enable new use cases on resource-constrained devices or cut up to 80% of your cloud compute costs. Automatically find accurate & fast architectures tailored for your application, hardware and performance targets with Deci’s NAS based AutoNAC engine. Automatically compile and quantize your models using best-of-breed compilers and quickly evaluate different production settings. Automatically compile and quantize your models using best-of-breed compilers and quickly evaluate different production settings.
  • 3
    Zebra by Mipsology
    Zebra by Mipsology is the ideal Deep Learning compute engine for neural network inference. Zebra seamlessly replaces or complements CPUs/GPUs, allowing any neural network to compute faster, with lower power consumption, at a lower cost. Zebra deploys swiftly, seamlessly, and painlessly without knowledge of underlying hardware technology, use of specific compilation tools, or changes to the neural network, the training, the framework, and the application. Zebra computes neural networks at world-class speed, setting a new standard for performance. Zebra runs on highest-throughput boards all the way to the smallest boards. The scaling provides the required throughput, in data centers, at the edge, or in the cloud. Zebra accelerates any neural network, including user-defined neural networks. Zebra processes the same CPU/GPU-based trained neural network with the same accuracy without any change.
  • 4
    TFLearn

    TFLearn

    TFLearn

    TFlearn is a modular and transparent deep learning library built on top of Tensorflow. It was designed to provide a higher-level API to TensorFlow in order to facilitate and speed up experimentations while remaining fully transparent and compatible with it. Easy-to-use and understand high-level API for implementing deep neural networks, with tutorial and examples. Fast prototyping through highly modular built-in neural network layers, regularizers, optimizers, metrics. Full transparency over Tensorflow. All functions are built over tensors and can be used independently of TFLearn. Powerful helper functions to train any TensorFlow graph, with support of multiple inputs, outputs, and optimizers. Easy and beautiful graph visualization, with details about weights, gradients, activations and more. The high-level API currently supports most of the recent deep learning models, such as Convolutions, LSTM, BiRNN, BatchNorm, PReLU, Residual networks, Generative networks.
  • 5
    NVIDIA TensorRT
    NVIDIA TensorRT is an ecosystem of APIs for high-performance deep learning inference, encompassing an inference runtime and model optimizations that deliver low latency and high throughput for production applications. Built on the CUDA parallel programming model, TensorRT optimizes neural network models trained on all major frameworks, calibrating them for lower precision with high accuracy, and deploying them across hyperscale data centers, workstations, laptops, and edge devices. It employs techniques such as quantization, layer and tensor fusion, and kernel tuning on all types of NVIDIA GPUs, from edge devices to PCs to data centers. The ecosystem includes TensorRT-LLM, an open source library that accelerates and optimizes inference performance of recent large language models on the NVIDIA AI platform, enabling developers to experiment with new LLMs for high performance and quick customization through a simplified Python API.
  • 6
    ThirdAI

    ThirdAI

    ThirdAI

    ThirdAI (pronunciation: /THərd ī/ Third eye) is a cutting-edge Artificial intelligence startup carving scalable and sustainable AI. ThirdAI accelerator builds hash-based processing algorithms for training and inference with neural networks. The technology is a result of 10 years of innovation in finding efficient (beyond tensor) mathematics for deep learning. Our algorithmic innovation has demonstrated how we can make Commodity x86 CPUs 15x or faster than most potent NVIDIA GPUs for training large neural networks. The demonstration has shaken the common knowledge prevailing in the AI community that specialized processors like GPUs are significantly superior to CPUs for training neural networks. Our innovation would not only benefit current AI training by shifting to lower-cost CPUs, but it should also allow the “unlocking” of AI training workloads on GPUs that were not previously feasible.
  • 7
    Xilinx

    Xilinx

    Xilinx

    The Xilinx’s AI development platform for AI inference on Xilinx hardware platforms consists of optimized IP, tools, libraries, models, and example designs. It is designed with high efficiency and ease-of-use in mind, unleashing the full potential of AI acceleration on Xilinx FPGA and ACAP. Supports mainstream frameworks and the latest models capable of diverse deep learning tasks. Provides a comprehensive set of pre-optimized models that are ready to deploy on Xilinx devices. You can find the closest model and start re-training for your applications! Provides a powerful open source quantizer that supports pruned and unpruned model quantization, calibration, and fine tuning. The AI profiler provides layer by layer analysis to help with bottlenecks. The AI library offers open source high-level C++ and Python APIs for maximum portability from edge to cloud. Efficient and scalable IP cores can be customized to meet your needs of many different applications.
  • 8
    Microsoft Cognitive Toolkit
    The Microsoft Cognitive Toolkit (CNTK) is an open-source toolkit for commercial-grade distributed deep learning. It describes neural networks as a series of computational steps via a directed graph. CNTK allows the user to easily realize and combine popular model types such as feed-forward DNNs, convolutional neural networks (CNNs) and recurrent neural networks (RNNs/LSTMs). CNTK implements stochastic gradient descent (SGD, error backpropagation) learning with automatic differentiation and parallelization across multiple GPUs and servers. CNTK can be included as a library in your Python, C#, or C++ programs, or used as a standalone machine-learning tool through its own model description language (BrainScript). In addition you can use the CNTK model evaluation functionality from your Java programs. CNTK supports 64-bit Linux or 64-bit Windows operating systems. To install you can either choose pre-compiled binary packages, or compile the toolkit from the source provided in GitHub.
  • 9
    Mirai

    Mirai

    Mirai

    Mirai is a developer-focused on-device AI infrastructure platform designed to convert, optimize, and run machine learning models directly on Apple devices with high performance and privacy. It provides a unified pipeline that enables teams to convert and quantize models, benchmark them, distribute them, and execute inference locally. It is built specifically for Apple Silicon and aims to deliver near-zero latency, zero inference cost, and full data privacy by keeping sensitive processing on the user’s device. Through its SDK and inference engine, developers can integrate AI features into applications quickly, using hardware-aware optimizations that unlock the full power of the GPU and Neural Engine. Mirai also includes dynamic routing capabilities that automatically decide whether a request should run locally or in the cloud based on latency, privacy, or workload requirements.
  • 10
    NeuroIntelligence
    NeuroIntelligence is a neural networks software application designed to assist neural network, data mining, pattern recognition, and predictive modeling experts in solving real-world problems. NeuroIntelligence features only proven neural network modeling algorithms and neural net techniques; software is fast and easy-to-use. Visualized architecture search, neural network training and testing. Neural network architecture search, fitness bars, network training graphs comparison. Training graphs, dataset error, network error, weights and errors distribution, neural network input importance. Testing, actual vs. output graph, scatter plot, response graph, ROC curve, confusion matrix. The interface of NeuroIntelligence is optimized to solve data mining, forecasting, classification and pattern recognition problems. You can create a better solution much faster using the tool's easy-to-use GUI and unique time-saving capabilities.
  • 11
    Luminal

    Luminal

    Luminal

    Luminal is a machine-learning framework built for speed, simplicity, and composability, focusing on static graphs and compiler-based optimization to deliver high performance even for complex neural networks. It compiles models into minimal “primops” (only 12 primitive operations) and then applies compiler passes to replace those with device-specific optimized kernels, enabling efficient execution on GPU or other backends. It supports modules (building blocks of networks with a standard forward API) and the GraphTensor interface (typed tensors and graphs at compile time) for model definition and execution. Luminal’s core remains intentionally small and hackable, with extensibility via external compilers for datatypes, devices, training, quantization, and more. Quick-start guidance shows how to clone the repo, build a “Hello World” example, or run a larger model like LLaMA 3 using GPU features.
  • 12
    NVIDIA Modulus
    NVIDIA Modulus is a neural network framework that blends the power of physics in the form of governing partial differential equations (PDEs) with data to build high-fidelity, parameterized surrogate models with near-real-time latency. Whether you’re looking to get started with AI-driven physics problems or designing digital twin models for complex non-linear, multi-physics systems, NVIDIA Modulus can support your work. Offers building blocks for developing physics machine learning surrogate models that combine both physics and data. The framework is generalizable to different domains and use cases—from engineering simulations to life sciences and from forward simulations to inverse/data assimilation problems. Provides parameterized system representation that solves for multiple scenarios in near real time, letting you train once offline to infer in real time repeatedly.
  • 13
    FriendliAI

    FriendliAI

    FriendliAI

    FriendliAI is a generative AI infrastructure platform that offers fast, efficient, and reliable inference solutions for production environments. It provides a suite of tools and services designed to optimize the deployment and serving of large language models (LLMs) and other generative AI workloads at scale. Key offerings include Friendli Endpoints, which allow users to build and serve custom generative AI models, saving GPU costs and accelerating AI inference. It supports seamless integration with popular open source models from the Hugging Face Hub, enabling lightning-fast, high-performance inference. FriendliAI's cutting-edge technologies, such as Iteration Batching, Friendli DNN Library, Friendli TCache, and Native Quantization, contribute to significant cost savings (50–90%), reduced GPU requirements (6× fewer GPUs), higher throughput (10.7×), and lower latency (6.2×).
  • 14
    Auria Pro

    Auria Pro

    WaveMachine Labs

    Auria Pro is a comprehensive music production suite designed for the iPad, offering a range of professional features to facilitate high-quality music creation. It supports unlimited audio and MIDI tracks, allowing for extensive project development. The built-in Lyra sampler supports SFZ, EXS, and SF2 formats, enabling playback of large instruments with ease. Included are FabFilter's One and Twin2 synthesizers, providing a variety of sound design options. Auria Pro features real-time audio warping using élastique Pro v3, facilitating tempo and pitch adjustments. The MIDI sequencer includes a piano roll editor, real-time quantizing, and groove templates, enhancing MIDI editing capabilities. The app offers a flexible mixing environment with up to 32 buses, six aux sends, and a 64-bit double-precision floating-point mixing engine. It supports third-party plug-ins via in-app purchases and includes built-in effects like convolution reverb and classic reverb.
    Starting Price: $49.99 one-time payment
  • 15
    voyage-3-large
    Voyage AI has unveiled voyage-3-large, a cutting-edge general-purpose and multilingual embedding model that leads across eight evaluated domains, including law, finance, and code, outperforming OpenAI-v3-large and Cohere-v3-English by averages of 9.74% and 20.71%, respectively. Enabled by Matryoshka learning and quantization-aware training, it supports embeddings of 2048, 1024, 512, and 256 dimensions, along with multiple quantization options such as 32-bit floating point, signed and unsigned 8-bit integer, and binary precision, significantly reducing vector database costs with minimal impact on retrieval quality. Notably, voyage-3-large offers a 32K-token context length, surpassing OpenAI's 8K and Cohere's 512 tokens. Evaluations across 100 datasets in diverse domains demonstrate its superior performance, with flexible precision and dimensionality options enabling substantial storage savings without compromising quality.
  • 16
    Neural Designer
    Neural Designer is a powerful software tool for developing and deploying machine learning models. It provides a user-friendly interface that allows users to build, train, and evaluate neural networks without requiring extensive programming knowledge. With a wide range of features and algorithms, Neural Designer simplifies the entire machine learning workflow, from data preprocessing to model optimization. In addition, it supports various data types, including numerical, categorical, and text, making it versatile for domains. Additionally, Neural Designer offers automatic model selection and hyperparameter optimization, enabling users to find the best model for their data with minimal effort. Finally, its intuitive visualizations and comprehensive reports facilitate interpreting and understanding the model's performance.
    Starting Price: $2495/year (per user)
  • 17
    vLLM

    vLLM

    vLLM

    vLLM is a high-performance library designed to facilitate efficient inference and serving of Large Language Models (LLMs). Originally developed in the Sky Computing Lab at UC Berkeley, vLLM has evolved into a community-driven project with contributions from both academia and industry. It offers state-of-the-art serving throughput by efficiently managing attention key and value memory through its PagedAttention mechanism. It supports continuous batching of incoming requests and utilizes optimized CUDA kernels, including integration with FlashAttention and FlashInfer, to enhance model execution speed. Additionally, vLLM provides quantization support for GPTQ, AWQ, INT4, INT8, and FP8, as well as speculative decoding capabilities. Users benefit from seamless integration with popular Hugging Face models, support for various decoding algorithms such as parallel sampling and beam search, and compatibility with NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs, and more.
  • 18
    Neuralhub

    Neuralhub

    Neuralhub

    Neuralhub is a system that makes working with neural networks easier, helping AI enthusiasts, researchers, and engineers to create, experiment, and innovate in the AI space. Our mission extends beyond providing tools; we're also creating a community, a place to share and work together. We aim to simplify the way we do deep learning today by bringing all the tools, research, and models into a single collaborative space, making AI research, learning, and development more accessible. Build a neural network from scratch or use our library of common network components, layers, architectures, novel research, and pre-trained models to experiment and build something of your own. Construct your neural network with one click. Visually see and interact with every component in the network. Easily tune hyperparameters such as epochs, features, labels and much more.
  • 19
    Supervisely

    Supervisely

    Supervisely

    The leading platform for entire computer vision lifecycle. Iterate from image annotation to accurate neural networks 10x faster. With our best-in-class data labeling tools transform your images / videos / 3d point cloud into high-quality training data. Train your models, track experiments, visualize and continuously improve model predictions, build custom solution within the single environment. Our self-hosted solution guaranties data privacy, powerful customization capabilities, and easy integration into your technology stack. A turnkey solution for Computer Vision: multi-format data annotation & management, quality control at scale and neural networks training in end-to-end platform. Inspired by professional video editing software, created by data scientists for data scientists — the most powerful video labeling tool for machine learning and more.
  • 20
    YandexART
    YandexART is a diffusion neural network by Yandex designed for image and video creation. This new neural network ranks as a global leader among generative models in terms of image generation quality. Integrated into Yandex services like Yandex Business and Shedevrum, it generates images and videos using the cascade diffusion method—initially creating images based on requests and progressively enhancing their resolution while infusing them with intricate details. The updated version of this neural network is already operational within the Shedevrum application, enhancing user experiences. YandexART fueling Shedevrum boasts an immense scale, with 5 billion parameters, and underwent training on an extensive dataset comprising 330 million pairs of images and corresponding text descriptions. Through the fusion of a refined dataset, a proprietary text encoder, and reinforcement learning, Shedevrum consistently delivers high-calibre content.
  • 21
    NVIDIA DIGITS

    NVIDIA DIGITS

    NVIDIA DIGITS

    The NVIDIA Deep Learning GPU Training System (DIGITS) puts the power of deep learning into the hands of engineers and data scientists. DIGITS can be used to rapidly train the highly accurate deep neural network (DNNs) for image classification, segmentation and object detection tasks. DIGITS simplifies common deep learning tasks such as managing data, designing and training neural networks on multi-GPU systems, monitoring performance in real-time with advanced visualizations, and selecting the best performing model from the results browser for deployment. DIGITS is completely interactive so that data scientists can focus on designing and training networks rather than programming and debugging. Interactively train models using TensorFlow and visualize model architecture using TensorBoard. Integrate custom plug-ins for importing special data formats such as DICOM used in medical imaging.
  • 22
    Ludwig

    Ludwig

    Uber AI

    Ludwig is a low-code framework for building custom AI models like LLMs and other deep neural networks. Build custom models with ease: a declarative YAML configuration file is all you need to train a state-of-the-art LLM on your data. Support for multi-task and multi-modality learning. Comprehensive config validation detects invalid parameter combinations and prevents runtime failures. Optimized for scale and efficiency: automatic batch size selection, distributed training (DDP, DeepSpeed), parameter efficient fine-tuning (PEFT), 4-bit quantization (QLoRA), and larger-than-memory datasets. Expert level control: retain full control of your models down to the activation functions. Support for hyperparameter optimization, explainability, and rich metric visualizations. Modular and extensible: experiment with different model architectures, tasks, features, and modalities with just a few parameter changes in the config. Think building blocks for deep learning.
  • 23
    Chainer

    Chainer

    Chainer

    A powerful, flexible, and intuitive framework for neural networks. Chainer supports CUDA computation. It only requires a few lines of code to leverage a GPU. It also runs on multiple GPUs with little effort. Chainer supports various network architectures including feed-forward nets, convnets, recurrent nets and recursive nets. It also supports per-batch architectures. Forward computation can include any control flow statements of Python without lacking the ability of backpropagation. It makes code intuitive and easy to debug. Comes with ChainerRLA, a library that implements various state-of-the-art deep reinforcement algorithms. Also, with ChainerCVA, a collection of tools to train and run neural networks for computer vision tasks. Chainer supports CUDA computation. It only requires a few lines of code to leverage a GPU. It also runs on multiple GPUs with little effort.
  • 24
    Cogniac

    Cogniac

    Cogniac

    Cogniac’s no-code solution enables organizations to capitalize on the latest developments in Artificial Intelligence (AI) and convolutional neural networks to deliver superhuman operational performance. Cogniac’s AI machine vision platform enables enterprise customers to achieve Industry 4.0 standards through visual data management and automation. Cogniac helps organizations’ operations divisions deliver smart continuous improvement. The Cogniac user interface has been designed and built to be operated by a non-technical user. With simplicity at its heart, the drag and drop nature of the Cogniac platform allows subject matter experts to focus on the tasks that drive the most value. Cogniac’s platform can identify defects from as little as 100 labeled images. Once trained by 25 approved and 75 defective images, the Cogniac AI will deliver results that are comparable to a human subject matter expert within hours of set-up.
  • 25
    Sharky Neural Network

    Sharky Neural Network

    SharkTime Software

    Sharky Neural Network is a Windows application providing a visual, interactive introduction to machine learning. This free software serves as a playground for experimenting with neural network classification in real-time. Instead of relying on static charts, Sharky offers a "live view" of the learning process. You can watch the network adjust its classification boundaries like a movie unfolding on your screen. Users can swap architectures and data shapes to see how topology affects results. The app uses the backpropagation algorithm with optional momentum to give you direct control over learning dynamics. Perfect for students and hobbyists, Sharky Neural Network makes hidden layers and data clustering intuitive. It is a lightweight tool that effectively bridges the gap between theory and practice.
  • 26
    Darknet

    Darknet

    Darknet

    Darknet is an open-source neural network framework written in C and CUDA. It is fast, easy to install, and supports CPU and GPU computation. You can find the source on GitHub or you can read more about what Darknet can do. Darknet is easy to install with only two optional dependencies, OpenCV if you want a wider variety of supported image types, and CUDA if you want GPU computation. Darknet on the CPU is fast but it's like 500 times faster on GPU! You'll have to have an Nvidia GPU and you'll have to install CUDA. By default, Darknet uses stb_image.h for image loading. If you want more support for weird formats (like CMYK jpegs, thanks Obama) you can use OpenCV instead! OpenCV also allows you to view images and detections without having to save them to disk. Classify images with popular models like ResNet and ResNeXt. Recurrent neural networks are all the rage for time-series data and NLP.
  • 27
    Torch

    Torch

    Torch

    Torch is a scientific computing framework with wide support for machine learning algorithms that puts GPUs first. It is easy to use and efficient, thanks to an easy and fast scripting language, LuaJIT, and an underlying C/CUDA implementation. The goal of Torch is to have maximum flexibility and speed in building your scientific algorithms while making the process extremely simple. Torch comes with a large ecosystem of community-driven packages in machine learning, computer vision, signal processing, parallel processing, image, video, audio and networking among others, and builds on top of the Lua community. At the heart of Torch are the popular neural network and optimization libraries which are simple to use, while having maximum flexibility in implementing complex neural network topologies. You can build arbitrary graphs of neural networks, and parallelize them over CPUs and GPUs in an efficient manner.
  • 28
    ConvNetJS

    ConvNetJS

    ConvNetJS

    ConvNetJS is a Javascript library for training deep learning models (neural networks) entirely in your browser. Open a tab and you're training. No software requirements, no compilers, no installations, no GPUs, no sweat. The library allows you to formulate and solve neural networks in Javascript, and was originally written by @karpathy. However, the library has since been extended by contributions from the community and more are warmly welcome. The fastest way to obtain the library in a plug-and-play way if you don't care about developing is through this link to convnet-min.js, which contains the minified library. Alternatively, you can also choose to download the latest release of the library from Github. The file you are probably most interested in is build/convnet-min.js, which contains the entire library. To use it, create a bare-bones index.html file in some folder and copy build/convnet-min.js to the same folder.
  • 29
    OpenAI Jukebox
    We’re introducing Jukebox, a neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artistic styles. We’re releasing the model weights and code, along with a tool to explore the generated samples. Provided with genre, artist, and lyrics as input, Jukebox outputs a new music sample produced from scratch. Jukebox produces a wide range of music and singing styles and generalizes to lyrics not seen during training. All the lyrics below have been co-written by a language model and OpenAI researchers. When conditioned on lyrics seen during training, Jukebox produces songs very different from the original songs it was trained on. We provide 12 seconds of audio to condition on and Jukebox completes the rest in a specified style. We chose to work on music because we want to continue to push the boundaries of generative models. Jukebox’s autoencoder model compresses audio to a discrete space, using a quantization-based approach called VQ-VAE.
  • 30
    voyage-code-3
    Voyage AI introduces voyage-code-3, a next-generation embedding model optimized for code retrieval. It outperforms OpenAI-v3-large and CodeSage-large by an average of 13.80% and 16.81% on a suite of 32 code retrieval datasets, respectively. It supports embeddings of 2048, 1024, 512, and 256 dimensions and offers multiple embedding quantization options, including float (32-bit), int8 (8-bit signed integer), uint8 (8-bit unsigned integer), binary (bit-packed int8), and ubinary (bit-packed uint8). With a 32 K-token context length, it surpasses OpenAI's 8K and CodeSage Large's 1K context lengths. Voyage-code-3 employs Matryoshka learning to create embeddings with a nested family of various lengths within a single vector. This allows users to vectorize documents into a 2048-dimensional vector and later use shorter versions (e.g., 256, 512, or 1024 dimensions) without re-invoking the embedding model.
  • 31
    Fido

    Fido

    Fido

    Fido is a light-weight, open-source, and highly modular C++ machine learning library. The library is targeted towards embedded electronics and robotics. Fido includes implementations of trainable neural networks, reinforcement learning methods, genetic algorithms, and a full-fledged robotic simulator. Fido also comes packaged with a human-trainable robot control system as described in Truell and Gruenstein. While the simulator is not in the most recent release, it can be found for experimentation on the simulator branch.
  • 32
    VisionSense
    Real-time computer vision and advanced image processing solution that leverages advanced models of convolutional neural networks. The top application of the product has been in building management, identity verification and fraud detection, manufacturing and quality control. Winjit is one of India’s leading technology providers with over a decade of experience in innovating engineering solutions across industries.
  • 33
    Google Cloud AI Infrastructure
    Options for every business to train deep learning and machine learning models cost-effectively. AI accelerators for every use case, from low-cost inference to high-performance training. Simple to get started with a range of services for development and deployment. Tensor Processing Units (TPUs) are custom-built ASIC to train and execute deep neural networks. Train and run more powerful and accurate models cost-effectively with faster speed and scale. A range of NVIDIA GPUs to help with cost-effective inference or scale-up or scale-out training. Leverage RAPID and Spark with GPUs to execute deep learning. Run GPU workloads on Google Cloud where you have access to industry-leading storage, networking, and data analytics technologies. Access CPU platforms when you start a VM instance on Compute Engine. Compute Engine offers a range of both Intel and AMD processors for your VMs.
  • 34
    Tenstorrent DevCloud
    We developed Tenstorrent DevCloud to give people the opportunity to try their models on our servers without purchasing our hardware. We are building Tenstorrent AI in the cloud so programmers can try our AI solutions. The first log-in is free, after that, you get connected with our team who can help better assess your needs. Tenstorrent is a team of competent and motivated people that came together to build the best computing platform for AI and software 2.0. Tenstorrent is a next-generation computing company with the mission of addressing the rapidly growing computing demands for software 2.0. Headquartered in Toronto, Canada, Tenstorrent brings together experts in the field of computer architecture, basic design, advanced systems, and neural network compilers. ur processors are optimized for neural network inference and training. They can also execute other types of parallel computation. Tenstorrent processors comprise a grid of cores known as Tensix cores.
  • 35
    DeepCube

    DeepCube

    DeepCube

    DeepCube focuses on the research and development of deep learning technologies that result in improved real-world deployment of AI systems. The company’s numerous patented innovations include methods for faster and more accurate training of deep learning models and drastically improved inference performance. DeepCube’s proprietary framework can be deployed on top of any existing hardware in both datacenters and edge devices, resulting in over 10x speed improvement and memory reduction. DeepCube provides the only technology that allows efficient deployment of deep learning models on intelligent edge devices. After the deep learning training phase, the resulting model typically requires huge amounts of processing and consumes lots of memory. Due to the significant amount of memory and processing requirements, today’s deep learning deployments are limited mostly to the cloud.
  • 36
    IBM Watson Machine Learning Accelerator
    Accelerate your deep learning workload. Speed your time to value with AI model training and inference. With advancements in compute, algorithm and data access, enterprises are adopting deep learning more widely to extract and scale insight through speech recognition, natural language processing and image classification. Deep learning can interpret text, images, audio and video at scale, generating patterns for recommendation engines, sentiment analysis, financial risk modeling and anomaly detection. High computational power has been required to process neural networks due to the number of layers and the volumes of data to train the networks. Furthermore, businesses are struggling to show results from deep learning experiments implemented in silos.
  • 37
    AForge.NET

    AForge.NET

    AForge.NET

    AForge.NET is an open source C# framework designed for developers and researchers in the fields of Computer Vision and Artificial Intelligence - image processing, neural networks, genetic algorithms, fuzzy logic, machine learning, robotics, etc. The work on the framework's improvement is in constants progress, what means that new feature and namespaces are coming constantly. To get knowledge about its progress you may track source repository's log or visit project discussion group to get the latest information about it. The framework is provided not only with different libraries and their sources, but with many sample applications, which demonstrate the use of this framework, and with documentation help files, which are provided in HTML Help format.
  • 38
    InferKit

    InferKit

    InferKit

    InferKit offers a web interface and API for AI–based text generators. Whether you're a novelist looking for inspiration, or an app developer, there's something for you. InferKit's text generation tool takes text you provide and generates what it thinks comes next, using a state-of-the-art neural network. It's configurable and can produce any length of text on practically any topic. The tool can be used through either the web interface or the developer API. Get started by creating an account. Creative and fun uses of the network include writing stories or poetry. Other use cases might be marketing or auto-completion. The generator can only comprehend a certain amount of text at a time (currently at most 3000 characters) so if you give it a longer prompt then it won't use the beginning. The network is already trained and does not learn from the inputs you give it. Each request counts for a minimum of 100 characters.
  • 39
    MaiaOS

    MaiaOS

    Zyphra Technologies

    Zyphra is an artificial intelligence company based in Palo Alto with a growing presence in Montreal and London. We’re building MaiaOS, a multimodal agent system combining advanced research in next-gen neural network architectures (SSM hybrids), long-term memory & reinforcement learning. We believe the future of AGI will involve a combination of cloud and on-device deployment strategies with an increasing shift toward local inference. MaiaOS is built around a deployment framework that maximizes inference efficiency for real-time intelligence. Our AI & product teams come from leading organizations and institutions including Google DeepMind, Anthropic, StabilityAI, Qualcomm, Neuralink, Nvidia, and Apple. We have deep expertise across AI models, learning algorithms, and systems/infrastructure with a focus on inference efficiency and AI silicon performance. Zyphra's team is committed to democratizing advanced AI systems.
  • 40
    Neuri

    Neuri

    Neuri

    We conduct and implement cutting-edge research on artificial intelligence to create real advantage in financial investment. Illuminating the financial market with ground-breaking neuro-prediction. We combine novel deep reinforcement learning algorithms and graph-based learning with artificial neural networks for modeling and predicting time series. Neuri strives to generate synthetic data emulating the global financial markets, testing it with complex simulations of trading behavior. We bet on the future of quantum optimization in enabling our simulations to surpass the limits of classical supercomputing. Financial markets are highly fluid, with dynamics evolving over time. As such we build AI algorithms that adapt and learn continuously, in order to uncover the connections between different financial assets, classes and markets. The application of neuroscience-inspired models, quantum algorithms and machine learning to systematic trading at this point is underexplored.
  • 41
    bigjpg

    bigjpg

    bigjpg

    Using the latest deep convolutional neural networks, bigjpg intelligently reduces noise and serration in images. This allows the images to be enlarged without losing quality. With other software and tools, such as PS, an enlarged image can still look fuzzy and have visible blur as well as noise. Our product uses neural networks with a special algorithm adjusted specifically for the images' lines and color. This makes the resulting image much higher quality. The colors are well kept, and there is almost no 'glitter' or doubling visible. More importantly, the noise, which seriously influences quality, cannot be seen in the resulting images. Anime images and illustrations are nearly perfectly processed, and colors, details, and edges are all well-kept. Regular photos are supported as well. The estimated remaining time will be shown once the process starts. Based on the original size & enlarging configurations, the time needed is different.
  • 42
    Synaptic

    Synaptic

    Synaptic

    Neurons are the basic unit of the neural network. They can be connected to another neuron or gate connections between other neurons. This allows you to create complex and flexible architectures. Trainers can take any given network regardless of its architecture and use any training set. It includes built-in tasks to test networks, like learning an XOR, completing a Discrete Sequence Recall task or an Embeded Reber Grammar test. Networks can be imported/exported to JSON, converted to workers or standalone functions. They can be connected to other networks or gate connections. The Architect includes built-in useful architectures such as multilayer perceptrons, multilayer long short-term memory networks (LSTM), liquid state machines and Hopfield networks. Networks can also be optimized, extended, exported to JSON, converted to Workers or standalone Functions, and cloned. A network can project a connection to another, or gate a connection between two others networks.
  • 43
    Predictive Suite

    Predictive Suite

    Predictive Dynamix

    Automated variable selection identifies key variables & variable interactions. Insightful visualization of data and model dynamics. Execution of batch commands. SQL queries and dataset browsing. Pre & post-processing for creating variables, constraining outputs, etc. Models easily deployed via ActiveX (i.e., OCX) controls or DLLs. Powerful modeling algorithms include regression, neural networks, self-organizing maps, dynamic clustering, decision trees, fuzzy logic, genetic algorithms. Predictive Dynamix provides computational intelligence software for forecasting, predictive modeling, pattern recognition, classification, and optimization applications, across all industries. Modern neural network technology are powerful computational structure for solving difficult problems involving forecasting and pattern recognition. Multi-layer perceptron neural networks have an architecture that allows multiple coefficients per input variable.
  • 44
    RODIN

    RODIN

    Microsoft

    This 3D avatar diffusion model is an AI system that automatically produces highly detailed 3D digital avatars. The generated avatars can be freely viewed in 360 degrees with unprecedented quality. The model significantly accelerates traditionally sophisticated 3D modeling process and opens new opportunities for 3D artists. This 3D avatar diffusion model is trained to generate 3D digital avatars represented as neural radiance fields. We build on the state-of-the-art generative technique (diffusion models) for 3D modeling. We use tri-plane representation to factorize the neural radiance field of avatars, which can be explicitly modeled by diffusion models and rendered to images via volumetric rendering. The proposed 3D-aware convolution brings the much-needed computational efficiency while preserving the integrity of diffusion modeling in 3D. The whole generation is a hierarchical process with cascaded diffusion models for multi-scale modeling.
  • 45
    EXAONE Deep
    EXAONE Deep is a series of reasoning-enhanced language models developed by LG AI Research, featuring parameter sizes of 2.4 billion, 7.8 billion, and 32 billion. These models demonstrate superior capabilities in various reasoning tasks, including math and coding benchmarks. Notably, EXAONE Deep 2.4B outperforms other models of comparable size, EXAONE Deep 7.8B surpasses both open-weight models of similar scale and the proprietary reasoning model OpenAI o1-mini, and EXAONE Deep 32B shows competitive performance against leading open-weight models. The repository provides comprehensive documentation covering performance evaluations, quickstart guides for using EXAONE Deep models with Transformers, explanations of quantized EXAONE Deep weights in AWQ and GGUF formats, and instructions for running EXAONE Deep models locally using frameworks like llama.cpp and Ollama.
  • 46
    Halcyon.ai

    Halcyon.ai

    Halcyon

    Threats like ransomware are designed to evade modern security tools, and just one miss can have a catastrophic impact on your organization. Halcyon is the first anti-ransomware and cyber resilience platform with automated encryption key capture and autonomous decryption capabilities to keep your operations running 24/7/365. Most security vendors are quick to update their solutions once a threat is seen in the real world. Without a dedicated anti-ransomware engine, the protection gap can range from 24 hours to several days or even weeks. Traditional rules-based EDR and other endpoint protection products rely on convolutional neural network AI models for detection that are generally too complex to quickly train on emerging threats.
  • 47
    DeepRecs

    DeepRecs

    Algonomy

    DeepRecs makes recommendations for 'Similar Products' and ‘Complete the Look’ using product images and without manual merchandising. It leverages convolutional neural networks to detect and extract feature vectors and graph visual similarities between products. Further, DeepRecs helps shoppers discover new, seasonal, niche, and long-tail products—that otherwise remain buried due to lack of historical data—using NLP algorithms that leverage catalog descriptions and other textual data.
  • 48
    Amazon SageMaker Feature Store
    Amazon SageMaker Feature Store is a fully managed, purpose-built repository to store, share, and manage features for machine learning (ML) models. Features are inputs to ML models used during training and inference. For example, in an application that recommends a music playlist, features could include song ratings, listening duration, and listener demographics. Features are used repeatedly by multiple teams and feature quality is critical to ensure a highly accurate model. Also, when features used to train models offline in batch are made available for real-time inference, it’s hard to keep the two feature stores synchronized. SageMaker Feature Store provides a secured and unified store for feature use across the ML lifecycle. Store, share, and manage ML model features for training and inference to promote feature reuse across ML applications. Ingest features from any data source including streaming and batch such as application logs, service logs, clickstreams, sensors, etc.
  • 49
    ChatGLM

    ChatGLM

    Zhipu AI

    ChatGLM-6B is an open-source, Chinese-English bilingual dialogue language model based on the General Language Model (GLM) architecture with 6.2 billion parameters. Combined with model quantization technology, users can deploy locally on consumer-grade graphics cards (only 6GB of video memory is required at the INT4 quantization level). ChatGLM-6B uses technology similar to ChatGPT, optimized for Chinese Q&A and dialogue. After about 1T identifiers of Chinese and English bilingual training, supplemented by supervision and fine-tuning, feedback self-help, human feedback reinforcement learning and other technologies, ChatGLM-6B with 6.2 billion parameters has been able to generate answers that are quite in line with human preferences.
  • 50
    EdgeCortix

    EdgeCortix

    EdgeCortix

    Breaking the limits in AI processors and edge AI inference acceleration. Where AI inference acceleration needs it all, more TOPS, lower latency, better area and power efficiency, and scalability, EdgeCortix AI processor cores make it happen. General-purpose processing cores, CPUs, and GPUs, provide developers with flexibility for most applications. However, these general-purpose cores don’t match up well with workloads found in deep neural networks. EdgeCortix began with a mission in mind: redefining edge AI processing from the ground up. With EdgeCortix technology including a full-stack AI inference software development environment, run-time reconfigurable edge AI inference IP, and edge AI chips for boards and systems, designers can deploy near-cloud-level AI performance at the edge. Think about what that can do for these and other applications. Finding threats, raising situational awareness, and making vehicles smarter.