Alternatives to EvalFlow
Compare EvalFlow alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to EvalFlow in 2026. Compare features, ratings, user reviews, pricing, and more from EvalFlow competitors and alternatives in order to make an informed decision for your business.
-
1
CCM Platform
Azuba Corporation
Napersoft CCM Document Platform 8 is our latest distributed platform solution for Microsoft® Windows and Linux servers. It was developed for today's wired world and offers browser-based computing with many new and advanced features. A platform for medium to very large enterprises that enables Batch, Interactive and OnDemand composition, formatting and delivery of relevant and personalized customer communications via multiple channel support including print, email, text and many more.. And, it supports communications in over 100 different languages including English, Spanish, Chinese, German, French and many more.Starting Price: $40,000 USD -
2
MultiRater Surveys
Peoplogica
MultiRater Surveys delivers MyMentor Insights, a structured leadership feedback and development program designed for consultants, executive coaches, and HR teams who want feedback to drive real change. MyMentor Insights combines multi-source feedback, interactive online debriefs, personalised development planning, and ongoing progress tracking in one integrated platform. Users can launch 180° reviews, 360° leadership surveys, employee engagement, wellbeing, and customer pulse surveys — all configurable to align with your leadership framework or organisational language. Once surveys close, results are presented through interactive debriefs that allow leaders and coaches to explore competency insights and question-level data with clarity. Insights flow directly into tailored development plans, supported by progress surveys and AI coaching to reinforce sustained behaviour change. Start your 14-day free trial, no payment details required. -
3
Lattice
Lattice
Lattice is the AI-powered people platform that strategic HR teams use to make managers more effective, simplify people operations, and leverage workforce insights to drive business impact. More than 5,000 companies worldwide leverage Lattice's performance management, engagement, compensation, growth, goals, and analytics apps to build highly productive and efficient teams. Lattice is listed as an industry leader on G2’s 2024 Best Software Awards in multiple categories, including Highest Customer Satisfaction and HR Products.Starting Price: $9/month/user -
4
Leapsome
Leapsome
CEOs & HR teams in forward-thinking companies such as Spotify, Trivago and Babbel use Leapsome to create a continuous cycle of performance management and personalized learning that powers employee engagement and the success of their business. As a people management platform, Leapsome combines tools for Goals & OKRs Management, Performance Reviews & 360s, Employee Learning & Onboarding, Employee Engagement Surveys, Feedback & Praise, and Meetings.Starting Price: $7 per user per month -
5
20 Dollar Eval
SVI
With its user-friendly interface, 20 Dollar Eval provides easy-to-follow prompts and automated features, requiring no technical expertise to operate. 20 Dollar Eval is powered by SVI, an organizational development company that focuses on creating irresistible companies and extraordinary people. Over the years, SVI has launched thousands of performance reviews within some of the world’s largest and most complex organizations. You can rest comfortably knowing that, while the price is low, the system and industry expertise supporting it are proven to be best-in-class.Starting Price: $20 per review -
6
BiG EVAL
BiG EVAL
The BiG EVAL solution platform provides powerful software tools needed to assure and improve data quality during the whole lifecycle of information. BiG EVAL's data quality management and data testing software tools are based on the BiG EVAL platform - a comprehensive code base aimed for high performance and high flexibility data validation. All features provided were built by practical experience based on the cooperation with our customers. Assuring a high data quality during the whole life cycle of your data is a crucial part of your data governance and is very important to get the most business value out of your data. This is where the automation solution BiG EVAL DQM comes in and supports you in all tasks regarding data quality management. Ongoing quality checks validate your enterprise data continuously, provide a quality metric and supports you in solving the quality issues. BiG EVAL DTA lets you automate testing tasks in your data oriented project. -
7
SnapEval 2.0
SnapEval
Instantly capture and share feedback ‘snapshots’ using smartphones and computers. Automatically incorporate feedback snapshots into a Performance Summary. Nominate a feedback snapshot for public recognition of performance excellence within the organization. Drag and drop to establish relationships. Explore organization structure ‘what ifs.’ Live access and file export sharing. Instantly create and send custom rich push notification messages to smartphones. Align employees with the organization’s values and goals. Gain comprehensive visibility into performance levels and trends across the firm. Automatically create professional evaluations using Continuous Feedback. Universal support of employee performance feedback for all job functions across all industries. Feedback is captured and shared in intuitive snapshots called ‘Evals’.Starting Price: $2.25 per user per month -
8
Orbit Eval
Turning Point HR Solutions Ltd
Orbit Eval is part of the Orbit Software Suite and is analytical job evaluation software. Job evaluation is a consistent & systematic process for defining the relative size or ranking of jobs within an organisation, by applying a consistent set of criteria to job roles. Analytical schemes offer a higher degree of rigour and objectivity. They enable a systematic approach to be applied providing a rationale as to why jobs are ranked differently. Application of the same method throughout the evaluation ensures consistency while minimising subjectivity and gender bias Orbit Eval is easy to use, very transparent and ensures consistency. The tool has been designed to be ‘owned’ by the organisation & requires minimal amounts of training. . It is hosted in the cloud with access permission levels. You can also input your current paper based scheme into the web-based data storage facility in Orbit Eval© to accommodate various systems including: NJC, GLPC & others. -
9
eVal
eVal
eVal's free data and peer company analysis tools include historic valuation multiples, historical share price data, company financial information, and Valuation Multiples by Industry sector reports, for use in investment and business valuations. In addition to the provision of financial data and peer company analysis tools, eVal provides investment and company valuations. eVal offers expert business, investment, and company valuations based on our proprietary data-driven valuation software and platform. Our investment and business valuation service is tailored for valuation professionals, business owners, investors, and investment advisors. If you're a business owner and require a business valuation; or if you're an investor and require a private company valuation for your portfolio, please contact us directly regarding our business valuation service. Our outlier detection tool provides an overview of the peer group valuation multiples.Starting Price: Free -
10
viEval
viGlobal
Evaluate every professional's performance with ease, efficiency & precision. Your annual review process doesn't have to be time-consuming. With our help, simplify any number of evaluations into one easy annual workflow. We understand the results your professional services firm needs to capture, including performance on projects and client work. viEval is the best-in-class tool for performance evaluation of professional work. All client work and hours are automatically pulled in from billing systems, so evaluations can be completed quickly and easily. We build high-performance cultures with 360-degree annual evaluation and integration with real-time feedback for continuous improvement. Our system can be easily customized for any role, department, or practice area. Create a performance management process of any complexity with our intelligent process builder. Use our pre-built templates for professional services firms or design your own process to capture precise feedback. -
11
DeepEval
Confident AI
DeepEval is a simple-to-use, open source LLM evaluation framework, for evaluating and testing large-language model systems. It is similar to Pytest but specialized for unit testing LLM outputs. DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation. Whether your application is implemented via RAG or fine-tuning, LangChain, or LlamaIndex, DeepEval has you covered. With it, you can easily determine the optimal hyperparameters to improve your RAG pipeline, prevent prompt drifting, or even transition from OpenAI to hosting your own Llama2 with confidence. The framework supports synthetic dataset generation with advanced evolution techniques and integrates seamlessly with popular frameworks, allowing for efficient benchmarking and optimization of LLM systems.Starting Price: Free -
12
EvalExpert
AlgoDriven
EvalExpert empowers dealerships by giving them the vehicle appraisal tools to make data-driven decisions about used cars. We offer a fully automated, single platform for vehicle appraisal, price guidance and analysis. Our industry leading data, partnered with proprietary algorithms; help reduce paperwork, eliminate mistakes of manual entry, improve productivity & provide great service to your customers. Using our propriety algorithms and industry leading data, EvalExpert streamlines the appraisal process with our easy to use, 3 step appraisal process - scan the vehicles registration or VIN, take photos, enter current information & condition details - done! EvalExpert’s Web Dashboard instantly syncs all your dealerships evaluations from any device. It provides overview statistics for the dealership and sales team with the most advanced reporting tools available in the market. -
13
Valid Eval
Valid Eval
Complex group deliberations don't have to be painful. Whether you're tasked with ranking hundreds of competing proposals, judging a dozen live pitches, or managing a multi-phase innovation program, there's an easier way. A better way. Valid Eval is an online evaluation system for organizations that make and defend tough decisions. It's a secure SaaS platform that works efficiently at virtually any scale so you can involve as many applicants, subjects, domain experts, and judges as it takes to do the job right. Combining best practices from the learning sciences and systems engineering, Valid Eval delivers defensible, data driven results and provides robust reporting tools that help you measure and monitor performance and demonstrate mission alignment. Best of all, it provides an unprecedented degree of transparency that promotes accountability and builds trust in the process. -
14
Maxim
Maxim
Maxim is an agent simulation, evaluation, and observability platform that empowers modern AI teams to deploy agents with quality, reliability, and speed. Maxim's end-to-end evaluation and data management stack covers every stage of the AI lifecycle, from prompt engineering to pre & post release testing and observability, data-set creation & management, and fine-tuning. Use Maxim to simulate and test your multi-turn workflows on a wide variety of scenarios and across different user personas before taking your application to production. Features: Agent Simulation Agent Evaluation Prompt Playground Logging/Tracing Workflows Custom Evaluators- AI, Programmatic and Statistical Dataset Curation Human-in-the-loop Use Case: Simulate and test AI agents Evals for agentic workflows: pre and post-release Tracing and debugging multi-agent workflows Real-time alerts on performance and quality Creating robust datasets for evals and fine-tuning Human-in-the-loop workflowsStarting Price: $29/seat/month -
15
EvalsOne
EvalsOne
An intuitive yet comprehensive evaluation platform to iteratively optimize your AI-driven products. Streamline LLMOps workflow, build confidence, and gain a competitive edge. EvalsOne is your all-in-one toolbox for optimizing your application evaluation process. Imagine a Swiss Army knife for AI, equipped to tackle any evaluation scenario you throw its way. Suitable for crafting LLM prompts, fine-tuning RAG processes, and evaluating AI agents. Choose from rule-based or LLM-based approaches to automate the evaluation process. Integrate human evaluation seamlessly, leveraging the power of expert judgment. Applicable to all LLMOps stages from development to production environments. EvalsOne provides an intuitive process and interface, that empowers teams across the AI lifecycle, from developers to researchers and domain experts. Easily create evaluation runs and organize them in levels. Quickly iterate and perform in-depth analysis through forked runs. -
16
Martian
Martian
By using the best-performing model for each request, we can achieve higher performance than any single model. Martian outperforms GPT-4 across OpenAI's evals (open/evals). We turn opaque black boxes into interpretable representations. Our router is the first tool built on top of our model mapping method. We are developing many other applications of model mapping including turning transformers from indecipherable matrices into human-readable programs. If a company experiences an outage or high latency period, automatically reroute to other providers so your customers never experience any issues. Determine how much you could save by using the Martian Model Router with our interactive cost calculator. Input your number of users, tokens per session, and sessions per month, and specify your cost/quality tradeoff. -
17
Tapt Health
Tapt Health
Tapt Health completes your documentation while you treat. Leverage AI to better engage patients, expedite evals, and minimize after-hours documentation.Starting Price: $91/month/user -
18
Revolution FTO
Wayne Enterprises
Documenting the training of new officers is serious business. Liability is generally determined by training or the lack of it. Our police and sheriff FTO evaluation software was created by sworn officers having over 23 years of experience in managing FTOs and training new officers. This software is web-based and allows your training officers to document all daily and monthly activities of your newer officers. Through an annual contract with your agency, we can provide 24/7 phone, web, and onsite technical support. You will get direct assistance from a developer of the software. Create evaluations in half the time. FTO's can only change the evals they create. Finalization prevents changes in evaluations. Use from any computer inside the department. Use dailies to create monthlies, trainees can log on and sign evals without FTO. Chronological one-button approval of evaluations. Create statistical reports and track the effectiveness of police academies. -
19
Trusys AI
Trusys
Trusys.ai is a unified AI assurance platform that helps organizations evaluate, secure, monitor, and govern artificial intelligence systems across their full lifecycle, from early testing to production deployment. It offers a suite of tools: TRU SCOUT for automated security and compliance scanning against global standards and adversarial vulnerabilities, TRU EVAL for comprehensive functional evaluation of AI applications (text, voice, image, and agent) assessing accuracy, bias, and safety, and TRU PULSE for real-time production monitoring with alerts for drift, performance degradation, policy violations, and anomalies. It provides end-to-end observability and performance tracking, enabling teams to catch unreliable output, compliance gaps, and production issues early. Trusys supports model-agnostic evaluation with a no-code, intuitive interface and integrates human-in-the-loop reviews and custom scoring metrics to blend expert judgment with automated metrics.Starting Price: Free -
20
ProdEval
Texas Computer Works
There is no such thing as a typical user of this system. Users include; independent reservoir engineers doing reserve reports, production engineers working up AFE’s and monitoring daily production, bank engineers tracking petroleum loan packages, CFOs tracking their borrowing base, property tax professionals assessing ad-valorem value, plus investors buying and selling producing properties. TCW’s ProdEval software is a quick and comprehensive Economic Evaluation system for both reserve reporting and prospect analysis. ProdEval has a very easy-to-use and straightforward approach to economic analysis and this methodology serves the user well. For example, the projecting of future production using sophisticated curve fitting techniques that allow the user to simply adjust the curves is one of the big factors that new users find attractive. The system is a rather open-ended system in that it accepts data from many sources; excel worksheets, commercial data sources. -
21
Vizcab Eval
Vizcab
Vizcab Eval is the solution to allow you to produce reliable, robust building ACV studies and percussive in one minimum time. Import your DPGF-type measurements and your RSET in a few clicks. Complete your entry using our research panel by keyword. Automatically associate your components and make simple corrections with our alert system. View results globally or in batches in real-time in the form of tables and graphs and validate compliance with thresholds. Identify at a glance the most impactful cards of your project, and bring efficient optimizations. Choose the most virtuous products with our scoring system of FDES. Work together and exchange easily with our fashion collaborative. Export your results in the form of graphs, and study reports according to your needs. Recover one RSEE export from your study to Excel format. You import your data directly into Vizcab Eval, and your components are automatically associated with plugs. -
22
Small Improvements
Small Improvements
Small Improvements is a performance management platform for small and mid-sized teams that want to build a culture of continuous feedback. The software combines simplicity and flexibility, helping companies run structured performance reviews, 360-degree feedback, 1:1 meetings, lightweight goals and objectives, and real-time employee recognition. HR teams can easily customize review cycles, create shared or private 1:1 agendas, and enable employees to request and give feedback anytime. The platform supports continuous development without the complexity of traditional HR software. Built-in reporting tools help track progress, send reminders, and keep teams aligned. Trusted by over 1,000 teams worldwide, Small Improvements is ideal for organizations looking for a lean, easy-to-use solution to strengthen performance conversations and employee engagement.Starting Price: $3.00 p/user p/month -
23
Ayanza
Ayanza
Move faster with productivity platform built for entrepreneurs and their teams. Vision, strategy, and core beliefs are essential to guide productive teams toward success. These and related docs need to have a home. Execution is equally important: goals, tasks, updates feed, and chat. Having them together works like a charm: The strategy influences execution, and well-organized teams move faster. We think better when we write. Our thoughts are clearer, communication is better, and written notes are easy to share. What to write about? Team objectives? Task planning? Progress reflection? Performance eval? It's your team's key know-how. A great schedule allows the team to stay in sync and increases confidence in team results. Build your team rhythms in Ayanza. Let everyone contribute regularly, and the team productivity increases while saving time on meetings.Starting Price: $6/user/month -
24
LastRecord
LastRecord.com
Employee training & skills progression software for Fire Departments. Manage employee skill sheets, task books, succession progress and meet training deadlines all from one central platform. Record live video of task book and skill sheet completion. LastRecord is software for managing Agency Task Books, Performance Reviews, Competencies, Crewmember Observations & More. We believe in building exceptional software at an affordable price, and we've been doing so since 2012. We always put customer satisfaction before anything else. Ditch the outdated paper forms and excel spreadsheets - with LastRecord, it is easy to manage an Observation Reporting / Performance Evaluation program. Effortlessly build, maintain and complete Daily Observations (DORs), Tourly Observations (TORs), FTO, Annual Evals and more. Search, View and Include relevant documents like Skill Competencies , Task Book Task completions, User Engagements and more in employee Performance Reviews.Starting Price: $1,899 per year -
25
Light Table
Light Table
Connects you to your creation with instant feedback and showing data values flow through your code. Easily customizable from keybinds to extensions to be completely tailored to your specific project. Try new ideas quickly and easily. Ask questions about your software, to give you a more profound understanding of your code. Embed anything you want, from graphs to games to running visualizations. Everything from eval and debugging to a fuzzy finder for files and commands to fit seamlessly into your workflow. An elegant, lightweight, beautifully designed layout so your IDE is no longer cluttered. No more printing to the console in order to view your results. Simply evaluate your code and the results will be displayed inline. Developer tools should be open source. Every bit of Light Table's code is available to the community because none of us are as smart as all of us. -
26
Confident AI
Confident AI
Confident AI offers an open-source package called DeepEval that enables engineers to evaluate or "unit test" their LLM applications' outputs. Confident AI is our commercial offering and it allows you to log and share evaluation results within your org, centralize your datasets used for evaluation, debug unsatisfactory evaluation results, and run evaluations in production throughout the lifetime of your LLM application. We offer 10+ default metrics for engineers to plug and use.Starting Price: $39/month -
27
Pipehire
Pipehire
The All-in-One Hiring and Management Cleaning Business Software Pipehire gives you applicant tracking, HR software onboarding and performance tracking all in one platform. Why hiring is so hard? Because only you can do it. Time consuming/read through applications. Complicated/long job applications forms. Applicants don’t tell you all the truth. Not enough information on them. Lack of company culture. Help you visualize the process easily. Free up your mind with organized applicants in the right stage. Reduce no shows with SMS reminders to those scheduled for in person interviews. Professional high converting job applications form. Score candidates via custom questions. Bilingual: English and spanish forms. Capture SSN and DL info securely for background checks. Employee data, birthdays and anniversary reminders. Complaints and compliments tracker with alert. Attendance Tracker. -
28
Taito.ai
Taito.ai
Taito.ai is an AI-powered performance management platform built to help teams perform better without heavy review cycles. It transforms traditional performance reviews into continuous performance enablement through automated expectations, feedback, and coaching. Taito.ai operates directly inside Slack and Google Calendar, keeping performance conversations where work already happens. The platform uses AI to personalize feedback, coaching insights, and goal alignment with minimal manual effort. Managers can set clear goals, run smarter 1:1s, and encourage ongoing feedback through Slack-native workflows. Leadership teams gain bias-resistant summaries and performance insights to support fair talent decisions. Taito.ai helps organizations build a high-performance culture without adding administrative overhead. -
29
PERKÜL
Adisa
PERKÜL is a customizable Performance Culture Management System that enables organizations to systematically disseminate their goals and transparently manage competency and performance data in a digital environment. The goal is not only to measure, but also to create a participatory, traceable, and fair corporate culture that encourages development. Why PERKÜL? *Strategic goals are easily disseminated throughout the organization. *Competency and performance evaluation processes are digitized and simplified. *Feedback becomes systematic, recorded, and easily tracked. *Performance data is no longer manual; it is dynamic, integrated, and analyzable. *Role-based authorization ensures a flexible and controlled organizational structure. *Goal and evaluation archives can be securely stored for many years. *Flexible system configurations tailored to each organization's structure are supported. *Development trends and potential analyses can be reported -
30
Solar Pro 2
Upstage AI
Solar Pro 2 is Upstage’s latest frontier‑scale large language model, designed to power complex tasks and agent‑like workflows across domains such as finance, healthcare, and legal. Packaged in a compact 31 billion‑parameter architecture, it delivers top‑tier multilingual performance, especially in Korean, where it outperforms much larger models on benchmarks like Ko‑MMLU, Hae‑Rae, and Ko‑IFEval, while also excelling in English and Japanese. Beyond superior language understanding and generation, Solar Pro 2 offers next‑level intelligence through an advanced Reasoning Mode that significantly boosts multi‑step task accuracy on challenges ranging from general reasoning (MMLU, MMLU‑Pro, HumanEval) to complex mathematics (Math500, AIME) and software engineering (SWE‑Bench Agentless), achieving problem‑solving efficiency comparable to or exceeding that of models twice its size. Enhanced tool‑use capabilities enable the model to interact seamlessly with external APIs and data sources.Starting Price: $0.1 per 1M tokens -
31
LatPro
LatPro
Looking for Hispanic or bilingual professionals who speak Spanish or Portuguese? LatPro has the best and brightest candidates in the U.S. and Latin America. Whether you’re looking for language skills, multicultural insight or Latin American expertise, LatPro has provided consistent, award-winning results since 1997. The largest resume database of qualified Hispanic and bilingual job candidates. Over 90 exclusive partnerships with prestigious Hispanic organizations give you access to a greater number of Hispanic professionals than any other diversity recruiting source can provide. Post your job ad and receive responses from targeted, qualified candidates specifically interested in your opportunity. We provide powerful electronic filters that automatically screen out unwanted responses. -
32
EVALS
EVALS
EVALS is the most dynamic mobile skills assessment and tracking solution for public safety, providing students and instructors with powerful tools to enhance learning and performance. Record, stream, upload and review videos to reinforce the knowledge, skills, attitudes and beliefs associated with the proper process. Design realistic scenarios and situational evaluations that help students develop the specialized skills needed to be effective in the real world. Track on-the-job training hours and performance requirements using our unique Digital Taskbook and Time Tracking modules. Select the components you need to streamline and simplify your training evaluations, including Digital Taskbook, an embedded events calendar, attendance, and time tracking, private message boards, academic testing, and more. Access the platform from anywhere via a web-enabled device and use the iOS app to perform field and video assessments without an internet connection. -
33
Adaline
Adaline
Iterate quickly and ship confidently. Confidently ship by evaluating your prompts with a suite of evals like context recall, llm-rubric (LLM as a judge), latency, and more. Let us handle intelligent caching and complex implementations to save you time and money. Quickly iterate on your prompts in a collaborative playground that supports all the major providers, variables, automatic versioning, and more. Easily build datasets from real data using Logs, upload your own as a CSV, or collaboratively build and edit within your Adaline workspace. Track usage, latency, and other metrics to monitor the health of your LLMs and the performance of your prompts using our APIs. Continuously evaluate your completions in production, see how your users are using your prompts, and create datasets by sending logs using our APIs. The single platform to iterate, evaluate, and monitor LLMs. Easily rollbacks if your performance regresses in production, and see how your team iterated the prompt. -
34
Amalia
Amalia
Amalia is a compensation management software designed to streamline sales compensation processes for finance, human resources, and operations teams. The platform offers an intuitive user experience, empowering admin teams with autonomy to create and customize complex compensation plans using a user-friendly plan designer. Sales representatives benefit from a seamless interface available in multiple languages, including English, French, German, Spanish, Italian, and Portuguese. Amalia's agile and flexible platform is equipped with features such as complete audibility, forecasting, what-if scenarios, advanced reporting, team hierarchy, multi-currency management, and commission agreements, covering all use cases and handling even the most advanced compensation plans. The platform ensures enterprise-level security with SOC 2 Type II certification, GCP hosting in Brussels, and an SSO-only policy, providing peace of mind that valuable data remains protected and confidential. -
35
Eval&GO
Eval&Go
Easily create, publish and analyze your online survey, questionnaire, and quiz. Our powerful online survey maker allows you to use plenty of question types, insert images and videos, and add your logo and your brand’s colors to create a spectacular online survey, form, and questionnaire. Distribute your online survey by email or publish a link on a website, on your blog, use QR codes, Facebook, Twitter, and more, and get your results in real-time! Generate an advanced and professional survey report in 1 click. Customize it and publish it in a variety of formats (online, Word, PDF, Excel, PowerPoint). Use the PRO+ Team account to collaborate on creating and analyzing your online forms and get the best of your survey maker. Organize your PRO+ Team account based on your goals. Become successful together. Many companies are already using Eval&GO survey makers to create their surveys, forms, and questionnaires. Follow their lead, and create your own online survey!Starting Price: $29 per month -
36
Langfuse
Langfuse
Langfuse is an open source LLM engineering platform to help teams collaboratively debug, analyze and iterate on their LLM Applications. Observability: Instrument your app and start ingesting traces to Langfuse Langfuse UI: Inspect and debug complex logs and user sessions Prompts: Manage, version and deploy prompts from within Langfuse Analytics: Track metrics (LLM cost, latency, quality) and gain insights from dashboards & data exports Evals: Collect and calculate scores for your LLM completions Experiments: Track and test app behavior before deploying a new version Why Langfuse? - Open source - Model and framework agnostic - Built for production - Incrementally adoptable - start with a single LLM call or integration, then expand to full tracing of complex chains/agents - Use GET API to build downstream use cases and export dataStarting Price: $29/month -
37
PolySpeak
CLOUD WHALE INTERACTIVE TECHNOLOGY
Learn a language like Spanish and French for free - with our AI-powered app through immersive chat and conversation. Meet our revolutionary language learning app today! Our language training tool uses cutting-edge AI technology to help you learn Spanish, French, German, Chinese, and English for free! Everything you need to improve your language skills quickly and effortlessly is here. With our app, you'll never run out of things to talk about! Choose from a wide range of interesting topics, and talk directly to your favorite character from all over the world, including Spanish, French, German, and Chinese speaking characters. Our AI-powered chat and conversation feature allows you to practice speaking with hold-to-talk functionality, making language learning more interactive and engaging.Starting Price: Free -
38
Weavel
Weavel
Meet Ape, the first AI prompt engineer. Equipped with tracing, dataset curation, batch testing, and evals. Ape achieves an impressive 93% on the GSM8K benchmark, surpassing both DSPy (86%) and base LLMs (70%). Continuously optimize prompts using real-world data. Prevent performance regression with CI/CD integration. Human-in-the-loop with scoring and feedback. Ape works with the Weavel SDK to automatically log and add LLM generations to your dataset as you use your application. This enables seamless integration and continuous improvement specific to your use case. Ape auto-generates evaluation code and uses LLMs as impartial judges for complex tasks, streamlining your assessment process and ensuring accurate, nuanced performance metrics. Ape is reliable, as it works with your guidance and feedback. Feed in scores and tips to help Ape improve. Equipped with logging, testing, and evaluation for LLM applications.Starting Price: Free -
39
SpeechPulse
AV BEAM
SpeechPulse uses your computer’s microphone for real-time speech recognition. It can type into your favorite apps, including text editors, web browsers, and office applications. SpeechPulse works fully offline and doesn’t require any internet connectivity. It supports speech recognition in multiple languages, including English, French, Spanish, Italian, German, Japanese, Chinese, and Russian (a total of 100 languages). SpeechPulse supports both auto punctuation and manual punctuation for the English language. It supports auto punctuation for all other languages. SpeechPulse can also generate subtitles for your audio and video files with accurate timestamps. It supports SRT and VTT subtitle formats. You can also customize the width of a subtitle line to include only a limited number of characters. SpeechPulse has a one-time payment. You can pay for the product once and use it forever.Starting Price: $59.95/one-time payment -
40
FeedbackPulse
FeedbackPulse
FeedbackPulse is an employee engagement and performance platform built for growing teams and SMBs. It brings together engagement surveys, peer recognition, 360-degree feedback, and performance reviews in a single automated workflow, helping HR managers and People Ops teams build a continuous feedback culture. Employees complete self-assessments, peers contribute structured reviews, and managers provide feedback on a schedule the team sets. Assessments remain private until both parties submit, keeping evaluations fair and transparent. Engagement analytics track team sentiment over time and surface disengagement signals early, giving leaders the insight to act before problems affect retention. Key features include customizable pulse surveys, 360-degree peer and manager reviews, performance review cycles, an employee directory, engagement analytics and benchmarks, and ready-to-use survey templates. The platform supports monthly, quarterly, bi-annual, or annual review frequencies.Starting Price: $4/month -
41
AgentOps
AgentOps
Industry-leading developer platform to test and debug AI agents. We built the tools so you don't have to. Visually track events such as LLM calls, tools, and multi-agent interactions. Rewind and replay agent runs with point-in-time precision. Keep a full data trail of logs, errors, and prompt injection attacks from prototype to production. Native integrations with the top agent frameworks. Track, save, and monitor every token your agent sees. Manage and visualize agent spending with up-to-date price monitoring. Fine-tune specialized LLMs up to 25x cheaper on saved completions. Build your next agent with evals, observability, and replays. With just two lines of code, you can free yourself from the chains of the terminal and instead visualize your agents’ behavior in your AgentOps dashboard. After setting up AgentOps, each execution of your program is recorded as a session and the data is automatically recorded for you.Starting Price: $40 per month -
42
Entry Point AI
Entry Point AI
Entry Point AI is the modern AI optimization platform for proprietary and open source language models. Manage prompts, fine-tunes, and evals all in one place. When you reach the limits of prompt engineering, it’s time to fine-tune a model, and we make it easy. Fine-tuning is showing a model how to behave, not telling. It works together with prompt engineering and retrieval-augmented generation (RAG) to leverage the full potential of AI models. Fine-tuning can help you to get better quality from your prompts. Think of it like an upgrade to few-shot learning that bakes the examples into the model itself. For simpler tasks, you can train a lighter model to perform at or above the level of a higher-quality model, greatly reducing latency and cost. Train your model not to respond in certain ways to users, for safety, to protect your brand, and to get the formatting right. Cover edge cases and steer model behavior by adding examples to your dataset.Starting Price: $49 per month -
43
Quantum Workplace
Quantum Workplace
Quantum Workplace provides everything your managers need to cultivate a people-first culture that gets results. Measure, monitor, and improve employee engagement, and coach managers to better lead their teams. The software’s intuitive interface makes it easy for managers to adopt into their everyday workflow. It adapts to your company’s structure and way of working, so it’s always a good fit. Limitless resources and intelligent alerts give managers a personal, in-tool coach, guiding them on how to better serve their teams and even improve their own coaching skills. Increased visibility among employees and teams transforms the over-siloed, transactional workplace into something more conversational, inspirational, and human. Feedback leads to growth, and growth leads to high-performing organizations. The platform gives managers the insight they need to engage their teams in cycles of continuous improvement. -
44
Llama 3
Meta
We’ve integrated Llama 3 into Meta AI, our intelligent assistant, that expands the ways people can get things done, create and connect with Meta AI. You can see first-hand the performance of Llama 3 by using Meta AI for coding tasks and problem solving. Whether you're developing agents, or other AI-powered applications, Llama 3 in both 8B and 70B will offer the capabilities and flexibility you need to develop your ideas. With the release of Llama 3, we’ve updated the Responsible Use Guide (RUG) to provide the most comprehensive information on responsible development with LLMs. Our system-centric approach includes updates to our trust and safety tools with Llama Guard 2, optimized to support the newly announced taxonomy published by MLCommons expanding its coverage to a more comprehensive set of safety categories, code shield, and Cybersec Eval 2.Starting Price: Free -
45
buzon
Diverza
Choose a plan and invoice immediately! Efficiency, security and guarantee of continuity. A solution to take you by the hand and step by step in the billing process, from the simple manual capture, to the import of a flat file, in the same way that today you carry out the task of generating your receipts. Issuance and delivery. Import and export. Business addenda. Storage and backup. Certified custody. Translate. Translation types translate Text. Documents. Text translation. Spanish - detected. English Spanish. French English. Swedish Spanish Source. Translation results risk / tax. Module for the validation of validity of CFDIs and the presence of taxpayers in the "black lists" of the SAT (art. 69 and 69-B CFF). conciliation / tax Module for reconciliation, processing and issuance of Electronic Payment Receipts (REPs) of operations carried out on credit. vault / fiscal Module for synchronization with the SAT (for massive download and storage) and management of receipts issued -
46
WorkMeter
WorkMeter
WorkMeter is a Spanish company specializing in SaaS software for the automatic measurement of time and workload. Its technology provides accurate metrics on work activity, time tracking, calendars, absences, application usage, and project costs, which can be integrated into business dashboards to enhance productivity and support data-driven decision-making. Its solution ensures compliance with labor regulations such as time tracking, remote work, and digital disconnection, promoting transparency, flexibility, and employee well-being, always respecting individual privacy. Additionally, it contributes to the digitalization of HR, optimizing processes and reducing costs. WorkMeter offers solutions for time management, performance measurement, and project tracking, helping companies improve efficiency and regulatory compliance. More than 50,000 users in Spain and Latin America trust WorkMeter to streamline their workforce management. -
47
Tembo
Tembo
Want to read bilingual stories and improve your Spanish language skills? Then our app "Tembo: Bilingual Stories" is perfect for you! This app is specially designed to help you learn Spanish and other languages through engaging stories. Whether you are a beginner or already have some experience with the Spanish language, this app is suitable for all levels. • Spanish, French, German and other stories: Discover a wide selection of engaging and fun stories. These stories range in difficulty from easy to advanced, so you can read and learn at your own level. • Bilingual display: Want to understand the meaning of words or phrases? Use the built-in bilingual display for instant translations into English. • Pronunciation: Improve your pronunciation by listening to the high-quality audio recordings of each story. This way you can strengthen your listening and speaking skills in Spanish. • Interactive Exercises: Test your knowledgeStarting Price: $0 -
48
Claude Sonnet 3.5
Anthropic
Claude Sonnet 3.5 sets new industry benchmarks for graduate-level reasoning (GPQA), undergraduate-level knowledge (MMLU), and coding proficiency (HumanEval). It shows marked improvement in grasping nuance, humor, and complex instructions, and is exceptional at writing high-quality content with a natural, relatable tone. Claude Sonnet 3.5 operates at twice the speed of Claude Opus 3. This performance boost, combined with cost-effective pricing, makes Claude Sonnet 3.5 ideal for complex tasks such as context-sensitive customer support and orchestrating multi-step workflows. Claude Sonnet 3.5 is now available for free on Claude.ai and the Claude iOS app, while Claude Pro and Team plan subscribers can access it with significantly higher rate limits. It is also available via the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI. The model costs $3 per million input tokens and $15 per million output tokens, with a 200K token context window.Starting Price: Free -
49
Profit.co
Profit Apps Inc
Profit.co is an AI-powered strategy execution platform that bridges the gap between high-level strategy and daily execution. Founded in Silicon Valley and headquartered in Texas, the company serves customers across 70 countries, ranging from agile startups to more than 50 Fortune 500 giants. The platform is architected around three critical organizational pillars to drive measurable results: 𝐏𝐥𝐚𝐧: Defines strategy through advanced OKR Software, Balanced Scorecards, Hoshin Kanri and Strategy Roadmaps to ensure total organizational alignment. 𝐏𝐫𝐨𝐜𝐞𝐬𝐬: Drives execution via Project Portfolio Management (PPM), Task Management, Timesheets, and structured Meeting tools. 𝐏𝐞𝐨𝐩𝐥𝐞: Aligning the human side of performance through Employee Engagement, Recognition, Pulse Surveys, 360-degree Feedback, and continuous Performance Management.Starting Price: Custom pricing -
50
Claude Opus 3
Anthropic
Opus, our most intelligent model, outperforms its peers on most of the common evaluation benchmarks for AI systems, including undergraduate level expert knowledge (MMLU), graduate level expert reasoning (GPQA), basic mathematics (GSM8K), and more. It exhibits near-human levels of comprehension and fluency on complex tasks, leading the frontier of general intelligence. All Claude 3 models show increased capabilities in analysis and forecasting, nuanced content creation, code generation, and conversing in non-English languages like Spanish, Japanese, and French.Starting Price: Free