Alice
Alice (formerly ActiveFence) is a security, safety, and trust platform built to protect AI systems and online platforms in the GenAI era. Powered by the world’s largest adversarial intelligence dataset, Alice safeguards over 3 billion users across more than 120 languages. Its Rabbit Hole intelligence engine continuously analyzes billions of toxic and manipulative data samples to detect emerging threats in real time. The WonderSuite platform includes tools like WonderBuild for pre-launch stress testing, WonderFence for runtime guardrails, and WonderCheck for automated red-teaming. By defending against prompt injection, jailbreaks, governance gaps, and harmful AI behavior, Alice enables enterprises and foundation model labs to innovate with confidence.
Learn more
OpenAI Moderation
The OpenAI Moderation API provides developers with a dedicated endpoint to automatically evaluate whether text or images contain potentially harmful or policy-violating content, enabling safer AI applications through real-time filtering and classification. It works by analyzing inputs (and optionally outputs) and returning structured results that indicate whether the content is flagged, along with detailed category labels such as hate, harassment, self-harm, sexual content, or violence. It is designed to be integrated directly into application workflows, allowing developers to take immediate action, such as blocking, filtering, or escalating content, before it reaches end users. Moderation models like “omni-moderation-latest” are optimized for speed and accuracy, supporting scalable use across high-volume applications while maintaining consistent safety standards.
Learn more
WebOrion Protector Plus
WebOrion Protector Plus is a GPU-powered GenAI firewall engineered to provide mission-critical protection for generative AI applications. It offers real-time defenses against evolving threats such as prompt injection attacks, sensitive data leakage, and content hallucinations. Key features include prompt injection attack protection, safeguarding intellectual property and personally identifiable information (PII) from exposure, content moderation and validation to ensure accurate and on-topic LLM responses, and user input rate limiting to mitigate risks of security vulnerability exploitation and unbounded consumption. At the core of its capabilities is ShieldPrompt, a multi-layered defense system that utilizes context evaluation through LLM analysis of user prompts, canary checks by embedding fake prompts to detect potential data leaks, pand revention of jailbreaks using Byte Pair Encoding (BPE) tokenization with adaptive dropout.
Learn more
CrowdStrike Falcon AIDR
CrowdStrike Falcon AI Detection and Response (AIDR) is an enterprise security platform designed to protect the rapidly expanding AI attack surface by delivering real-time visibility, detection, and response across AI systems, users, and interactions. It provides unified visibility into how employees and AI agents use generative AI by mapping relationships between users, prompts, models, agents, and supporting infrastructure, while capturing detailed runtime logs for monitoring, compliance, and investigation. It continuously monitors AI activity across endpoints, cloud environments, and applications, enabling organizations to understand how data flows through AI systems and how agents operate within defined boundaries. AIDR detects and blocks AI-specific threats such as prompt injection, jailbreak attempts, malicious entities, harmful outputs, and unauthorized interactions, using behavioral analysis and integrated threat intelligence.
Learn more