format-factory free download

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

LLaMA-Factory is a fine-tuning and training framework for Meta's LLaMA language models. It enables researchers and developers to train and customize LLaMA models efficiently using advanced optimization techniques.

Downloads: 5 This Week

Last Update: 2025-12-31

See Project

KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

KVCache-Factory is an open-source research framework designed to explore and implement unified key-value cache compression techniques for autoregressive transformer models. In large language models, the key-value cache stores intermediate attention states that enable efficient token generation during inference, but these caches can consume large amounts of GPU memory when handling long contexts.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

Guardrails

Adding guardrails to large language models

Guardrails is a Python package that lets a user add structure, type and quality guarantees to the outputs of large language models (LLMs). At the heart of Guardrails is the rail spec. rail is intended to be a language-agnostic, human-readable format for specifying structure and type information, validators and corrective actions over LLM outputs. We create a RAIL spec to describe the expected structure and types of the LLM output, the quality criteria for the output to be considered valid, and corrective actions to be taken if the output is invalid.

Downloads: 1 This Week

Last Update: 2026-04-03

See Project

LiteLLM

lightweight package to simplify LLM API calls

Call all LLM APIs using the OpenAI format [Anthropic, Huggingface, Cohere, Azure OpenAI etc.] liteLLM supports streaming the model response back, pass stream=True to get a streaming iterator in response. Streaming is supported for OpenAI, Azure, Anthropic, and Huggingface models.

Downloads: 4 This Week

Last Update: 20 hours ago

See Project

Qwen3-Coder

Qwen3-Coder is the code version of Qwen3

...It is capable of handling 358 programming languages, from common to niche, making it versatile for a wide range of development environments. The model integrates a specially designed function call format and supports popular platforms such as Qwen Code and CLINE for agentic coding workflows.

1 Review

Downloads: 30 This Week

Last Update: 2026-03-24

See Project

CodeLlama

Inference code for CodeLlama models

...Typical usage includes prompt-driven generation, function or class completion, and zero-shot adherence to natural-language instructions about code changes. The ecosystem provides multiple distributions (e.g., HF format) so developers can integrate with standard toolchains and serving stacks. As part of the broader Llama effort, Code Llama complements instruction-tuned chat models by focusing on code-centric tasks and editor integrations.

Downloads: 5 This Week

Last Update: 2025-10-08

See Project

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat

...It keeps the series’ smooth dialog and low deployment cost while adding native tool use (function calling), a built-in code interpreter, and agent-style workflows. The family includes base and long-context variants (8K/32K/128K). The repo ships Python APIs, CLI and web demos (Gradio/Streamlit), an OpenAI-format API server, and a compact fine-tuning kit. Quantization (4/8-bit), CPU/MPS support, and accelerator backends (TensorRT-LLM, OpenVINO, chatglm.cpp) enable lightweight local or edge deployment.

Downloads: 5 This Week

Last Update: 4 days ago

See Project

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM

ChatGLM2-6B is the second-gen Chinese-English conversational LLM from ZhipuAI/Tsinghua. It upgrades the base model with GLM’s hybrid pretraining objective, 1.4 TB bilingual data, and preference alignment—delivering big gains on MMLU, CEval, GSM8K, and BBH. The context window extends up to 32K (FlashAttention), and Multi-Query Attention improves speed and memory use. The repo includes Python APIs, CLI & web demos, OpenAI-style/FASTAPI servers, and quantized checkpoints for lightweight local...

Downloads: 3 This Week

Last Update: 4 days ago

See Project

files-to-prompt

Concatenate a directory full of files into a single prompt

...It includes rich filtering controls, letting you limit by extension, include or skip hidden files, and ignore paths that match glob patterns or .gitignore rules. The output format is flexible: you can emit plain text, Markdown with fenced code blocks, or a Claude-XML style format designed for structured multi-file prompts. It can read file paths from stdin (including NUL-separated paths), which makes it easy to combine with find, rg, or other shell tools.

Downloads: 0 This Week

Last Update: 2025-11-27

See Project

Paper2Slides

From Paper to Presentation in One Click

Paper2Slides is an automation tool that converts research papers, reports, and other documents into polished slide decks and posters with minimal manual effort. It is designed to replace the repetitive work of turning dense technical documents into presentation-friendly structure by extracting key points, figures, and data into a coherent visual narrative. The system supports multiple input formats, so you can process PDFs and common office documents rather than being locked to a single file...

Downloads: 3 This Week

Last Update: 2026-03-15

See Project

Deep Lake

Data Lake for Deep Learning. Build, manage, and query datasets

Deep Lake (formerly known as Activeloop Hub) is a data lake for deep learning applications. Our open-source dataset format is optimized for rapid streaming and querying of data while training models at scale, and it includes a simple API for creating, storing, and collaborating on AI datasets of any size. It can be deployed locally or in the cloud, and it enables you to store all of your data in one place, ranging from simple annotations to large videos.

Downloads: 2 This Week

Last Update: 2026-02-12

See Project

Tencent-Hunyuan-Large

Open-source large language model family from Tencent Hunyuan

Tencent-Hunyuan-Large is the flagship open-source large language model family from Tencent Hunyuan, offering both pre-trained and instruct (fine-tuned) variants. It is designed with long-context capabilities, quantization support, and high performance on benchmarks across general reasoning, mathematics, language understanding, and Chinese / multilingual tasks. It aims to provide competitive capability with efficient deployment and inference. FP8 quantization support to reduce memory usage...

Downloads: 1 This Week

Last Update: 2025-09-24

See Project

LlamaIndex

Central interface to connect your LLM's with external data

LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. LlamaIndex is a simple, flexible interface between your external data and LLMs. It provides the following tools in an easy-to-use fashion. Provides indices over your unstructured and structured data for use with LLM's. These indices help to abstract away common boilerplate and pain points for in-context learning. Dealing with prompt limitations (e.g. 4096 tokens for Davinci) when...

Downloads: 1 This Week

Last Update: 2026-04-21

See Project

LLMSurvey

A Survey of Large Language Models

...The repository organizes hundreds of research papers into thematic sections that reflect the main areas of LLM research, including model architectures, training strategies, evaluation benchmarks, alignment techniques, and downstream applications. By structuring the literature in a navigable format, LLMSurvey allows researchers and practitioners to quickly explore important publications in the field without manually searching through multiple databases.

Downloads: 0 This Week

Last Update: 2026-03-04

See Project

OneFileLLM

Specify a github or local repo, github pull request

...Instead, the entire runtime environment, model interface, and application logic are bundled together into a single executable artifact. This design allows developers to share AI tools in a format that can be easily distributed and executed across different machines without complicated installation procedures. Such packaging strategies help make AI software easier to use in educational settings, demonstrations, and lightweight deployments.

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

LongBench

LongBench v2 and LongBench (ACL 25'&24')

LongBench is a comprehensive benchmark designed to evaluate the ability of large language models to understand and reason over very long textual contexts. Traditional language model benchmarks typically evaluate tasks involving relatively short inputs, which does not reflect many real-world applications such as analyzing large documents or entire code repositories. LongBench addresses this gap by providing datasets that require models to process and reason over long sequences of text across...

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

StarVector

StarVector is a foundation model for SVG generation

StarVector is a multimodal foundation model designed for generating Scalable Vector Graphics (SVG) from images or textual descriptions. The system treats vector graphics creation as a code generation problem, producing SVG code that can render detailed vector images. Its architecture combines computer vision techniques with language modeling capabilities so it can understand visual inputs and textual prompts simultaneously. The model converts raster images or text instructions into...

Downloads: 0 This Week

Last Update: 2026-03-05

See Project

LLaMA-Mesh

Unifying 3D Mesh Generation with Language Models

LLaMA-Mesh is a research framework that extends large language models so they can understand and generate 3D mesh data alongside text. The system introduces a method for representing 3D meshes in a textual format by encoding vertex coordinates and face definitions as sequences that can be processed by a language model. By serializing 3D geometry into text tokens, the approach allows existing transformer architectures to generate and interpret 3D models without requiring specialized visual tokenizers. The project includes a supervised fine-tuning dataset composed of interleaved text and mesh data, allowing the model to learn relationships between textual descriptions and 3D structures. ...

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

AIConfig

AIConfig is a config-based framework to build generative AI apps

...AIConfig supports multiple model providers and modalities, enabling developers to experiment with different models without rewriting application logic. The configuration format is JSON-serializable and integrates with tools such as Python and Node SDKs, allowing the same configuration file to be used across multiple environments.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring

...Tasks are intentionally heterogeneous: some are multiple-choice with exact scoring, others are free-form generation judged by model-based or human evaluation. The suite provides a common JSON task format and an evaluation harness so research groups can contribute new tasks and reproduce results consistently. It emphasizes robustness analysis—looking at scale trends, calibration, and areas where models systematically fail—to guide model development beyond raw accuracy. BIG-bench is as much a community process as a dataset, encouraging open sharing of tasks and findings to keep evaluations fresh and comprehensive.

Downloads: 0 This Week

Last Update: 2025-10-09

See Project

Functionary

Chat language model that can use tools and interpret the results

...The model extends traditional chat-based language models by enabling them to determine when external functions should be called and how to extract the necessary parameters from natural language input. Function definitions are typically provided in JSON schema format, allowing the model to generate structured function calls compatible with modern tool-calling interfaces used in AI applications. Functionary can decide whether to execute tools sequentially or in parallel and can analyze the outputs of those tools to produce context-aware responses. This capability allows AI systems to interact with external services, APIs, or computation engines rather than relying solely on knowledge embedded in the model.

Downloads: 0 This Week

Last Update: 2026-03-07

See Project

Chinese Llama 2 7B

The first Chinese LLaMA2 model in the open source community

...In addition to the model weights, the repository also includes supervised fine-tuning datasets and training resources that help developers build chat-optimized versions of the model. The project follows the input format used by the LLaMA-2 chat architecture, ensuring compatibility with existing optimization techniques and tools built for the LLaMA-2 ecosystem. By releasing both the model and associated datasets, the project allows researchers and developers to experiment with Chinese language models in a fully open environment.

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

$Grade School Math$

Grade School Math

8.5K high quality grade school math problems

The grade-school-math repository (sometimes called GSM8K) is a curated dataset of 8,500 high-quality grade school math word problems intended for evaluating mathematical reasoning capabilities of language models. It is structured into 7,500 training problems and 1,000 test problems. These aren’t trivial exercises — many require multi-step reasoning, combining arithmetic operations, and handling intermediate steps (e.g. “If she sold half as many in May… how many in total?”). The problems are...

Downloads: 0 This Week

Last Update: 2025-10-03

See Project

Search Results for "format-factory"

Showing 23 open source projects for "format-factory"

LLaMA-Factory

KVCache-Factory

Guardrails

LiteLLM

Qwen3-Coder

CodeLlama

ChatGLM3

ChatGLM2-6B

files-to-prompt

Paper2Slides

Deep Lake

Tencent-Hunyuan-Large

LlamaIndex

LLMSurvey

OneFileLLM

LongBench

StarVector

LLaMA-Mesh

AIConfig

BIG-bench

Functionary

Chinese Llama 2 7B

Grade School Math

Search Results for "format-factory"

Showing 23 open source projects for "format-factory"

LLaMA-Factory

KVCache-Factory

Guardrails

LiteLLM

Qwen3-Coder

CodeLlama

ChatGLM3

ChatGLM2-6B

files-to-prompt

Paper2Slides

Deep Lake

Tencent-Hunyuan-Large

LlamaIndex

LLMSurvey

OneFileLLM

LongBench

StarVector

LLaMA-Mesh

AIConfig

BIG-bench

Functionary

Chinese Llama 2 7B

Grade School Math

Related Searches

Related Categories