Literal AI
Literal AI is a collaborative platform designed to assist engineering and product teams in developing production-grade Large Language Model (LLM) applications. It offers a suite of tools for observability, evaluation, and analytics, enabling efficient tracking, optimization, and integration of prompt versions. Key features include multimodal logging, encompassing vision, audio, and video, prompt management with versioning and AB testing capabilities, and a prompt playground for testing multiple LLM providers and configurations. Literal AI integrates seamlessly with various LLM providers and AI frameworks, such as OpenAI, LangChain, and LlamaIndex, and provides SDKs in Python and TypeScript for easy instrumentation of code. The platform also supports the creation of experiments against datasets, facilitating continuous improvement and preventing regressions in LLM applications.
Learn more
Langfuse
Langfuse is an open source LLM engineering platform to help teams collaboratively debug, analyze and iterate on their LLM Applications.
Observability: Instrument your app and start ingesting traces to Langfuse
Langfuse UI: Inspect and debug complex logs and user sessions
Prompts: Manage, version and deploy prompts from within Langfuse
Analytics: Track metrics (LLM cost, latency, quality) and gain insights from dashboards & data exports
Evals: Collect and calculate scores for your LLM completions
Experiments: Track and test app behavior before deploying a new version
Why Langfuse?
- Open source
- Model and framework agnostic
- Built for production
- Incrementally adoptable - start with a single LLM call or integration, then expand to full tracing of complex chains/agents
- Use GET API to build downstream use cases and export data
Learn more
PromptHub
Test, collaborate, version, and deploy prompts, from a single place, with PromptHub. Put an end to continuous copy and pasting and utilize variables to simplify prompt creation. Say goodbye to spreadsheets, and easily compare outputs side-by-side when tweaking prompts. Bring your datasets and test prompts at scale with batch testing. Make sure your prompts are consistent by testing with different models, variables, and parameters. Stream two conversations and test different models, system messages, or chat templates. Commit prompts, create branches, and collaborate seamlessly. We detect prompt changes, so you can focus on outputs. Review changes as a team, approve new versions, and keep everyone on the same page. Easily monitor requests, costs, and latencies. PromptHub makes it easy to test, version, and collaborate on prompts with your team. Our GitHub-style versioning and collaboration makes it easy to iterate your prompts with your team, and store them in one place.
Learn more
Prompteams
Develop and version control your prompts. Auto-generated API to retrieve prompts. Automatically run end-to-end LLM testing before making updates to your prompts on production. Let your industry specialists and engineers collaborate on the same platform. Let your industry specialists and prompt engineers test and iterate on the same platform without any programming knowledge. With our testing suite, you can create and run unlimited test cases to ensure the quality of your your your your your prompt. Check for hallucinations, issues, edge cases, and more. Our suite is the most complex of prompts. Use Git-like features to manage your prompts. Create a repository for each project, and create multiple branches to iterate on your prompts. Commit your changes and test them in a separate environment. Easily revert back to a previous version. With our real-time APIs, one single click, and your prompt is updated and live.
Learn more