director-ai Files

Real-time LLM hallucination guardrail — NLI + RAG fact-checking

Brought to you by: anulum

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2026-03-01	902 Bytes	0
v1.4.0 -- Batched NLI + ONNX Runtime source code.tar.gz	2026-03-01	1.4 MB	0
v1.4.0 -- Batched NLI + ONNX Runtime source code.zip	2026-03-01	1.5 MB	0
Totals: 3 Items		2.9 MB	0

What's New

Batched NLI Inference (3-5x faster)

score_batch() and score_chunked() now run a single padded forward pass instead of sequential calls. Chunked document scoring is 3-5x faster.

ONNX Export + Runtime (~30-50ms/chunk GPU)

export_onnx() converts FactCG model to ONNX via optimum (handles DeBERTa disentangled attention)
NLIScorer(backend="onnx", onnx_path=...) runs inference via ONNX Runtime with auto-CUDA detection
New optional dep: pip install director-ai[onnx]

Other

ascore_batch() async helper for batched scoring
AggreFact benchmark predictor now batches SummaC source chunks
GPU device handling fix in _model_score() — inputs now move to model device

Stats

680 tests passing (Python 3.10/3.11/3.12)
Rust crates: all green
Lint + type check: clean

Full Changelog: https://github.com/anulum/director-ai/compare/v1.3.0...v1.4.0

Source: README.md, updated 2026-03-01

Other Useful Business Software

MongoDB Atlas runs apps anywhere Icon

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Forever Free Full-Stack Observability | Grafana Cloud Icon

Forever Free Full-Stack Observability | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Recommended Projects

Node Director
The Node Director is a tool for managing distributed, hetergeneous UNIX Systems. Its functionality includes System Configuration, Application Distribution, NIS & NIS+ Management, User Creation and Dynamic System Documentation.
Opik
Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI
DocsGPT
Private AI platform for agents, enterprise search and RAG pipelines
DeepEval
DeepEval is a simple-to-use, open-source LLM evaluation framework, for evaluating and testing large-language model systems. It is similar to Pytest but specialized for unit testing LLM outputs. DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation. Whether your application is implemented via RAG or fine-tuning, LangChain, or LlamaIndex, DeepEval has you covered. With it, you can easily determine the optimal hyperparameters to improve your RAG pipeline, prevent prompt drifting, or even transition from OpenAI to hosting your own Llama2 with confidence.
Vector Admin
The universal tool suite for vector database management