Guidance Files

A guidance language for controlling large language models

This is an exact mirror of the Guidance project, hosted at https://github.com/guidance-ai/guidance/tree/main. SourceForge is not affiliated with Guidance. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
0.2.4 source code.tar.gz	2025-07-08	8.3 MB	0
0.2.4 source code.zip	2025-07-08	8.4 MB	0
README.md	2025-07-08	1.6 kB	0
Totals: 3 Items		16.7 MB	0

Guidance 0.2.4

Better sampling, better metrics, llama-cpp-python fixes (update to latest please!), uncountable visualization fixes.

Added

Allow changing sampling_params (top_p/top_k, min_p, repetition_penalty) on the fly Model.with_sampling_params(...)
Add top_k tokens back into vis after temporary removal in previous refactor

Removed

Model.token_count removed in favor of (currently private) Model._get_usage().output_tokens

Changed

Bookkeeping of metrics such as input_tokens, output_tokens, ff_tokens, token_savings, avg_latency_ms have been added to State and are now accessible via (private for now) Model._get_usage(). This replaces bookkeeping that was previously attached to Engine instances.
Factory functions create_azure_openai_model() and create_azure_openai_model() for accessing models hosted in AzureAI

Fixed

Intermittent double widget render fixed.
Widget doesn't always complete running, fixed.
Widget backtracking bug fixed
Widget now always show both inputs and outputs, sometimes would fail.
TraceHandler forests stripped of extra trace nodes, sometimes caused render glitches.
Widget latency displays now render.
Widget early race condition resolved (sometimes widget is ready after backend is firing messages)
Various linting and build improvements
Tokens generated with OpenAI now correctly tagged as generated for vis
Fix compatability with llama-cpp-python 0.3.12, bump dependency from 0.3.9 to 0.3.12 (first contrib: @jovemexausto)

Source: README.md, updated 2025-07-08

Other Useful Business Software

MongoDB Atlas runs apps anywhere Icon

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Recommended Projects

Qwen-2.5-VL
Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering...
Qwen2.5
Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering...
Qwen
Qwen is a series of large language models developed by Alibaba Cloud, consisting of various pretrained versions like Qwen-1.8B, Qwen-7B, Qwen-14B, and Qwen-72B. These models, which range from smaller to larger configurations, are designed for a wide range of natural language processing tasks....
CogView4
CogView4 is the latest generation in the CogView series of vision-language foundation models, developed as a bilingual (Chinese and English) open-source system for high-quality image understanding and generation. Built on top of the GLM framework, it supports multimodal tasks including...
llm
llm is an ecosystem of Rust libraries for working with large language models - it's built on top of the fast, efficient GGML library for machine learning. The primary entry point for developers is the llm crate, which wraps the llm-base and the supported model crates. Documentation for the...