Business Software for Hugging Face - Page 2

Top Software that integrates with Hugging Face as of August 2025 - Page 2

  • 1
    Promptly

    Promptly

    Promptly

    Depending on the use case, pick an appropriate app type or template. Provide input and configuration required for the app, and choose how the output is rendered. Add your existing data from sources like files, website URLs, sitemaps, YouTube links, Google Drive, and Notion exports. Attach these data sources to your app in the app builder. Save and publish your app. Access it from the dedicated app page or embed it in your website using the embed code. Use our APIs to run the app from your applications. Promptly provides embeddable widgets that you can easily integrate into your website. Use these widgets to build conversational AI applications or to add a chatbot to your website. Customize the chatbot look and feel to match your website, including a logo for the bot.
    Starting Price: $99.99 per month
  • 2
    Lilac

    Lilac

    Lilac

    Lilac is an open source tool that enables data and AI practitioners to improve their products by improving their data. Understand your data with powerful search and filtering. Collaborate with your team on a single, centralized dataset. Apply best practices for data curation, like removing duplicates and PII to reduce dataset size and lower training cost and time. See how your pipeline impacts your data using our diff viewer. Clustering is a technique that automatically assigns categories to each document by analyzing the text content and putting similar documents in the same category. This reveals the overarching structure of your dataset. Lilac uses state-of-the-art algorithms and LLMs to cluster the dataset and assign informative, descriptive titles. Before we do advanced searching, like concept or semantic search, we can immediately use keyword search by typing a keyword in the search box.
    Starting Price: Free
  • 3
    Defense Unicorns

    Defense Unicorns

    Defense Unicorns

    Defense Unicorns provides continuous delivery for national security systems, spanning cloud, on-prem, and edge environments in land, sea, air, and space domains. UDS removes the complexity and inefficiencies of government secure technology stacks. Seamlessly integrating mission capabilities with an open source 800-53 compliant platform (70% NIST compliant) and Infrastructure as Code (IaC) for your target environment, UDS creates a secure, declarative software bundle repeatedly deployable to all your environments without vendor locking your mission. The Defense Unicorns platform is open source, accessible, avoids vendor lock-in, and provides data independence. Defense Unicorns accelerates ATO timelines by leveraging a declarative cybersecurity stack that can be deployed in any cloud, on-prem, or edge environment. Defense Unicorns specializes in platform technologies that enable rapid and reliable delivery of capabilities across a wide range of mission and technology areas.
    Starting Price: Free
  • 4
    Glowbom

    Glowbom

    Glowbom

    Create amazing apps with AI without knowing how to code. Export code to multiple platforms by sketching your ideas to life. Glowbom allows you to build games and apps for the most popular platforms out there. We are actively developing support for all the most commonly used platforms. With Glowbom, you can access and modify your app's source code, and write code with AI assistance to make your app truly unique. The desktop app guides you through the code and answers any coding questions. Glowbom can also turn everyday language or other programming languages into the one that Glowbom supports, making coding easier by providing multiplatform code generation.
    Starting Price: $40 per month
  • 5
    AIxBlock

    AIxBlock

    AIxBlock

    AIxBlock: The first unified and decentralized platform for end-to-end AI development and workflow automation - built natively on MCP. AIxBlock is a MCP-based, decentralized end-to-end AI development and workflow automation platform purpose-built for AI engineer teams. It empowers users to build, train, deploy AI models and build AI automation workflows using those models through a unified environment that integrates decentralized compute, models, datasets, and labeling resources - all at a fraction of the traditional cost. AIxBlock is the modular AI ecosystem - purpose-built for custom model creation, workflow automation, and open interoperability across MCP client tools like Cursor, Claude, WindSurf, etc.
    Starting Price: $19 per month
  • 6
    Lamini

    Lamini

    Lamini

    Lamini makes it possible for enterprises to turn proprietary data into the next generation of LLM capabilities, by offering a platform for in-house software teams to uplevel to OpenAI-level AI teams and to build within the security of their existing infrastructure. Guaranteed structured output with optimized JSON decoding. Photographic memory through retrieval-augmented fine-tuning. Improve accuracy, and dramatically reduce hallucinations. Highly parallelized inference for large batch inference. Parameter-efficient finetuning that scales to millions of production adapters. Lamini is the only company that enables enterprise companies to safely and quickly develop and control their own LLMs anywhere. It brings several of the latest technologies and research to bear that was able to make ChatGPT from GPT-3, as well as Github Copilot from Codex. These include, among others, fine-tuning, RLHF, retrieval-augmented training, data augmentation, and GPU optimization.
    Starting Price: $99 per month
  • 7
    CodeQwen

    CodeQwen

    Alibaba

    CodeQwen is the code version of Qwen, the large language model series developed by the Qwen team, Alibaba Cloud. It is a transformer-based decoder-only language model pre-trained on a large amount of data of codes. Strong code generation capabilities and competitive performance across a series of benchmarks. Supporting long context understanding and generation with the context length of 64K tokens. CodeQwen supports 92 coding languages and provides excellent performance in text-to-SQL, bug fixes, etc. You can just write several lines of code with transformers to chat with CodeQwen. Essentially, we build the tokenizer and the model from pre-trained methods, and we use the generate method to perform chatting with the help of the chat template provided by the tokenizer. We apply the ChatML template for chat models following our previous practice. The model completes the code snippets according to the given prompts, without any additional formatting.
    Starting Price: Free
  • 8
    Agenta

    Agenta

    Agenta

    Collaborate on prompts, evaluate, and monitor LLM apps with confidence. Agenta is a comprehensive platform that enables teams to quickly build robust LLM apps. Create a playground connected to your code where the whole team can experiment and collaborate. Systematically compare different prompts, models, and embeddings before going to production. Share a link to gather human feedback from the rest of the team. Agenta works out of the box with all frameworks (Langchain, Lama Index, etc.) and model providers (OpenAI, Cohere, Huggingface, self-hosted models, etc.). Gain visibility into your LLM app's costs, latency, and chain of calls. You have the option to create simple LLM apps directly from the UI. However, if you would like to write customized applications, you need to write code with Python. Agenta is model agnostic and works with all model providers and frameworks. The only limitation at present is that our SDK is available only in Python.
    Starting Price: Free
  • 9
    OpenLIT

    OpenLIT

    OpenLIT

    OpenLIT is an OpenTelemetry-native application observability tool. It's designed to make the integration process of observability into AI projects with just a single line of code. Whether you're working with popular LLM libraries such as OpenAI and HuggingFace. OpenLIT's native support makes adding it to your projects feel effortless and intuitive. Analyze LLM and GPU performance, and costs to achieve maximum efficiency and scalability. Streams data to let you visualize your data and make quick decisions and modifications. Ensures that data is processed quickly without affecting the performance of your application. OpenLIT UI helps you explore LLM costs, token consumption, performance indicators, and user interactions in a straightforward interface. Connect to popular observability systems with ease, including Datadog and Grafana Cloud, to export data automatically. OpenLIT ensures your applications are monitored seamlessly.
    Starting Price: Free
  • 10
    IllumiDesk

    IllumiDesk

    IllumiDesk

    Automate grading tasks and provide inline feedback for students. Collaborate effectively with your learners using discussion boards, mentions, and inline comments. real-time collaboration is coming soon. IllumiDesk's AI will help you create, maintain, and structure your content into interactive lessons. IllumiDesk's flexible integration options using the RESTful API and/or LTI v1.3 help leverage existing content and flows. Crafting remarkable courses can now be achievable in just minutes, not months. Mobilize your audiences with captivating content without devoting endless hours of work. Create a course effortlessly with AI in just a few clicks, tailoring it to your specific context, or start from scratch and build a course from the ground up. IllumiDesk's flexible integration options using the RESTful API and/or LTI v1.3 help leverage existing content and flows. IllumiDesk is the AI platform for instructors and content developers that helps teams create and monetize content.
    Starting Price: $240 per year
  • 11
    Qwen2

    Qwen2

    Alibaba

    Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud. Qwen2 is a series of large language models developed by the Qwen team at Alibaba Cloud. It includes both base language models and instruction-tuned models, ranging from 0.5 billion to 72 billion parameters, and features both dense models and a Mixture-of-Experts model. The Qwen2 series is designed to surpass most previous open-weight models, including its predecessor Qwen1.5, and to compete with proprietary models across a broad spectrum of benchmarks in language understanding, generation, multilingual capabilities, coding, mathematics, and reasoning.
    Starting Price: Free
  • 12
    Msty

    Msty

    Msty

    Chat with any AI model in a single click. No prior model setup experience is needed. Msty is designed to function seamlessly offline, ensuring reliability and privacy. For added flexibility, it also supports popular online model vendors, giving you the best of both worlds. Revolutionize your research with split chats. Compare and contrast multiple AI models' responses in real time, streamlining your workflow and uncovering new insights. Msty puts you in the driver's seat. Take your conversations wherever you want, and stop whenever you're satisfied. Replace an existing answer or create and iterate through several conversation branches. Delete branches that don't sound quite right. With delve mode, every response becomes a gateway to new knowledge, waiting to be discovered. Click on a keyword, and embark on a journey of discovery. Leverage Msty's split chat feature to move your desired conversation branches into a new split chat or a new chat session.
    Starting Price: $50 per year
  • 13
    Qwen2-VL

    Qwen2-VL

    Alibaba

    Qwen2-VL is the latest version of the vision language models based on Qwen2 in the Qwen model familities. Compared with Qwen-VL, Qwen2-VL has the capabilities of: SoTA understanding of images of various resolution & ratio: Qwen2-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc. Understanding videos of 20 min+: Qwen2-VL can understand videos over 20 minutes for high-quality video-based question answering, dialog, content creation, etc. Agent that can operate your mobiles, robots, etc.: with the abilities of complex reasoning and decision making, Qwen2-VL can be integrated with devices like mobile phones, robots, etc., for automatic operation based on visual environment and text instructions. Multilingual Support: to serve global users, besides English and Chinese, Qwen2-VL now supports the understanding of texts in different languages inside images
    Starting Price: Free
  • 14
    Toddle

    Toddle

    Toddle

    You don’t have to reinvent the wheel. Install native packages with rich functionality and zero need to manage dependencies, no more broken plugins. Install pre-built packages built by a world-class developer community and accelerate your time to market, with no coding required. Design cool components, actions, or formulas, package them, and use them everywhere, no need to mess around with complicated formulas. Toddle is a visual web app builder that rivals custom code but is accessible to your entire team. Professional no-code web app editor, built-in live collaboration, version control, and hosting. Toddle is a visual programming platform for building modern web applications without writing code. You may want to host your Toddle creations yourself instead of relying on Toddle's infrastructure. We offer code export via web components. Live updates when you edit your component in the editor.
    Starting Price: $29 per month
  • 15
    OpenHands

    OpenHands

    All Hands AI

    We believe agentic technology is too important to be controlled by a few corporations. So we're building all our agents in the open on GitHub, under the MIT license. Our agents can do anything a human developer can, they write code, run commands, and use the web. We're partnering with AI safety experts like Invariant Labs to balance innovation with security. Thousands of developers are working together to build the AI-powered future they want to see. Our agents are compatible with any large language model provider.
    Starting Price: Free
  • 16
    Beeceptor

    Beeceptor

    Beeceptor

    Discover how Beeceptor can streamline your development workflow, and speed up API integrations and software delivery. Explore the range of use cases Beeceptor can solve for you. Host your API contracts on Beeceptor and unblock your teams. With Beeceptor, a mock API server is up and running in seconds, with no coding required. Don't wait for backend APIs to be developed or deployed. Connect to an API mock server and start integrating your code immediately. Reduce dependency on backend/API teams with Beeceptor. Get a named sub-domain and send an HTTP request. You can inspect and debug the req/res payloads, make them look pretty, and share them with your peers as API contracts. Define an entity path and let Beeceptor automatically set up six essential JSON REST APIs for your CRUD operations. This JSONPlaceholder alternative has a flexible schema, storage, and super easy setup. It's like patching unavailable APIs on an existing API server, helping you integrate faster.
    Starting Price: $10 per month
  • 17
    LLMWare.ai

    LLMWare.ai

    LLMWare.ai

    Our open source research efforts are focused both on the new "ware" ("middleware" and "software" that will wrap and integrate LLMs), as well as building high-quality, automation-focused enterprise models available in Hugging Face. LLMWare also provides a coherent, high-quality, integrated, and organized framework for development in an open system that provides the foundation for building LLM-applications for AI Agent workflows, Retrieval Augmented Generation (RAG), and other use cases, which include many of the core objects for developers to get started instantly. Our LLM framework is built from the ground up to handle the complex needs of data-sensitive enterprise use cases. Use our pre-built specialized LLMs for your industry or we can customize and fine-tune an LLM for specific use cases and domains. From a robust, integrated AI framework to specialized models and implementation, we provide an end-to-end solution.
    Starting Price: Free
  • 18
    ID Privacy AI

    ID Privacy AI

    ID Privacy AI

    At ID Privacy, we are shaping the future of AI with a focus on privacy-first solutions. Our mission is simple, to deliver cutting-edge AI technologies that empower businesses to innovate without compromising the security and trust of their users. ID Privacy AI delivers secure, adaptable AI models built with privacy at the core. We empower businesses across industries to harness advanced AI, whether optimizing workflows, enhancing customer AI chat experiences, or driving insights, while safeguarding data. Built under a cloak of stealth, the team at ID Privacy began meeting and formulating the plan for our AI as a service solution. Launched with multi-modal, multi-lingual capabilities and the deepest knowledge base on ad tech currently available anywhere. ID Privacy AI is focused on privacy-first AI development for businesses and enterprises. Empowering businesses with a flexible AI framework that protects data while solving complex challenges across any vertical.
    Starting Price: $15 per month
  • 19
    Maxim

    Maxim

    Maxim

    Maxim is an agent simulation, evaluation, and observability platform that empowers modern AI teams to deploy agents with quality, reliability, and speed. Maxim's end-to-end evaluation and data management stack covers every stage of the AI lifecycle, from prompt engineering to pre & post release testing and observability, data-set creation & management, and fine-tuning. Use Maxim to simulate and test your multi-turn workflows on a wide variety of scenarios and across different user personas before taking your application to production. Features: Agent Simulation Agent Evaluation Prompt Playground Logging/Tracing Workflows Custom Evaluators- AI, Programmatic and Statistical Dataset Curation Human-in-the-loop Use Case: Simulate and test AI agents Evals for agentic workflows: pre and post-release Tracing and debugging multi-agent workflows Real-time alerts on performance and quality Creating robust datasets for evals and fine-tuning Human-in-the-loop workflows
    Starting Price: $29/seat/month
  • 20
    Lunary

    Lunary

    Lunary

    Lunary is an AI developer platform designed to help AI teams manage, improve, and protect Large Language Model (LLM) chatbots. It offers features such as conversation and feedback tracking, analytics on costs and performance, debugging tools, and a prompt directory for versioning and team collaboration. Lunary supports integration with various LLMs and frameworks, including OpenAI and LangChain, and provides SDKs for Python and JavaScript. Guardrails to deflect malicious prompts and sensitive data leaks. Deploy in your VPC with Kubernetes or Docker. Allow your team to judge responses from your LLMs. Understand what languages your users are speaking. Experiment with prompts and LLM models. Search and filter anything in milliseconds. Receive notifications when agents are not performing as expected. Lunary's core platform is 100% open-source. Self-host or in the cloud, get started in minutes.
    Starting Price: $20 per month
  • 21
    DeepEval

    DeepEval

    Confident AI

    DeepEval is a simple-to-use, open source LLM evaluation framework, for evaluating and testing large-language model systems. It is similar to Pytest but specialized for unit testing LLM outputs. DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation. Whether your application is implemented via RAG or fine-tuning, LangChain, or LlamaIndex, DeepEval has you covered. With it, you can easily determine the optimal hyperparameters to improve your RAG pipeline, prevent prompt drifting, or even transition from OpenAI to hosting your own Llama2 with confidence. The framework supports synthetic dataset generation with advanced evolution techniques and integrates seamlessly with popular frameworks, allowing for efficient benchmarking and optimization of LLM systems.
    Starting Price: Free
  • 22
    Marco-o1

    Marco-o1

    AIDC-AI

    Marco-o1 is a robust, next-generation AI model tailored for high-performance natural language processing and real-time problem-solving. It is engineered to deliver precise and contextually rich responses, combining deep language comprehension with a streamlined architecture for speed and efficiency. Marco-o1 excels in a variety of applications, including conversational AI, content creation, technical support, and decision-making tasks, adapting seamlessly to diverse user needs. With a focus on intuitive interactions, reliability, and ethical AI principles, Marco-o1 stands out as a cutting-edge solution for individuals and organizations seeking intelligent, adaptive, and scalable AI-driven tools. MCTS allows the exploration of multiple reasoning paths using confidence scores derived from softmax-applied log probabilities of the top-k alternative tokens, guiding the model to optimal solutions.
    Starting Price: Free
  • 23
    Teuken 7B

    Teuken 7B

    OpenGPT-X

    Teuken-7B is a multilingual, open source language model developed under the OpenGPT-X initiative, specifically designed to cater to Europe's diverse linguistic landscape. It has been trained on a dataset comprising over 50% non-English texts, encompassing all 24 official languages of the European Union, ensuring robust performance across these languages. A key innovation in Teuken-7B is its custom multilingual tokenizer, optimized for European languages, which enhances training efficiency and reduces inference costs compared to standard monolingual tokenizers. The model is available in two versions, Teuken-7B-Base, the foundational pre-trained model, and Teuken-7B-Instruct, which has undergone instruction tuning for improved performance in following user prompts. Both versions are accessible on Hugging Face, promoting transparency and collaboration within the AI community. The development of Teuken-7B underscores a commitment to creating AI models that reflect Europe's diversity.
    Starting Price: Free
  • 24
    Qwen2.5-Coder
    Qwen2.5-Coder-32B-Instruct has become the current SOTA open source code model, matching the coding capabilities of GPT-4o. While demonstrating strong and comprehensive coding abilities, it also possesses good general and mathematical skills. As of now, Qwen2.5-Coder has covered six mainstream model sizes to meet the needs of different developers. We explore the practicality of Qwen2.5-Coder in two scenarios, including code assistants and artifacts, with some examples showcasing the potential applications of Qwen2.5-Coder in real-world scenarios. Qwen2.5-Coder-32B-Instruct, as the flagship model of this open source release, has achieved the best performance among open source models on multiple popular code generation benchmarks and has competitive performance with GPT-4o. Code repair is an important programming skill. Qwen2.5-Coder-32B-Instruct can help users fix errors in their code, making programming more efficient.
    Starting Price: Free
  • 25
    NVIDIA TensorRT
    NVIDIA TensorRT is an ecosystem of APIs for high-performance deep learning inference, encompassing an inference runtime and model optimizations that deliver low latency and high throughput for production applications. Built on the CUDA parallel programming model, TensorRT optimizes neural network models trained on all major frameworks, calibrating them for lower precision with high accuracy, and deploying them across hyperscale data centers, workstations, laptops, and edge devices. It employs techniques such as quantization, layer and tensor fusion, and kernel tuning on all types of NVIDIA GPUs, from edge devices to PCs to data centers. The ecosystem includes TensorRT-LLM, an open source library that accelerates and optimizes inference performance of recent large language models on the NVIDIA AI platform, enabling developers to experiment with new LLMs for high performance and quick customization through a simplified Python API.
    Starting Price: Free
  • 26
    SmythOS

    SmythOS

    SmythOS

    Say goodbye to manual coding and build agents faster than ever. Describe what you need, and SmythOS builds it from your chat or image, using the best AI models and APIs for your task. Use any AI model or API. Integrate with OpenAI, Hugging Face, Amazon Bedrock, and hundreds of vendors without a line of code. A pre-built agent template library gives you agents that already work out of the box for dozens of use cases. Just hit the button and connect with your own API keys. Because your marketing team should not have access to agents that work with your code. We got you covered. Create a space for each client, team, and project with full user and permission management. Deploy on-prem or to AWS. Integrate with Bedrock, Vertex, Adobe, Salesforce, etc. Explainable AI with full control over data flows, audit logs, encryption, and auth. Chat with your agents, give them bulk work, inspect their work logs, assign them work schedules, and more.
    Starting Price: $30 per month
  • 27
    Bakery

    Bakery

    Bakery

    Easily fine-tune & monetize your AI models with one click. For AI startups, ML engineers, and researchers. Bakery is a platform that enables AI startups, machine learning engineers, and researchers to fine-tune and monetize AI models with ease. Users can create or upload datasets, adjust model settings, and publish their models on the marketplace. The platform supports various model types and provides access to community-driven datasets for project development. Bakery's fine-tuning process is streamlined, allowing users to build, test, and deploy models efficiently. The platform integrates with tools like Hugging Face and supports decentralized storage solutions, ensuring flexibility and scalability for diverse AI projects. The bakery empowers contributors to collaboratively build AI models without exposing model parameters or data to one another. It ensures proper attribution and fair revenue distribution to all contributors.
    Starting Price: Free
  • 28
    Weave

    Weave

    Chasm

    Weave is a no-code AI workflow builder that enables users to automate tasks by implementing multiple Large Language Models (LLMs) and connecting prompts without the need for coding. With an intuitive interface, users can select templates, personalize them, and transform workflows into automated solutions. Weave supports various AI models, including those from OpenAI, Meta, Hugging Face, and Mistral AI, allowing for seamless integration and fine-tuning to achieve industry-specific results. Key features include intuitive dataflow management, app-ready APIs for easy integration, AI hosting, cost-effective AI models, effortless personalization, and user-friendly modules. Weave is ideal for applications such as generating character dialogue and backstories, developing intelligent chatbots, and automating written content.
    Starting Price: $10
  • 29
    FauxPilot

    FauxPilot

    FauxPilot

    FauxPilot is an open source, self-hosted alternative to GitHub Copilot. It utilizes the SalesForce CodeGen models on NVIDIA's Triton Inference Server with the FasterTransformer backend for local code generation. It requires Docker, an NVIDIA GPU with sufficient VRAM, and the ability to split the model across multiple GPUs if needed. The setup involves downloading models from Hugging Face and converting them for FasterTransformer compatibility.
    Starting Price: Free
  • 30
    Qwen2.5-Max
    Qwen2.5-Max is a large-scale Mixture-of-Experts (MoE) model developed by the Qwen team, pretrained on over 20 trillion tokens and further refined through Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). In evaluations, it outperforms models like DeepSeek V3 in benchmarks such as Arena-Hard, LiveBench, LiveCodeBench, and GPQA-Diamond, while also demonstrating competitive results in other assessments, including MMLU-Pro. Qwen2.5-Max is accessible via API through Alibaba Cloud and can be explored interactively on Qwen Chat.
    Starting Price: Free