ChatGLM.cpp is a C++ implementation of the ChatGLM-6B model, enabling efficient local inference without requiring a Python environment. It is optimized for running on consumer hardware.

Features

  • Provides a C++ implementation of ChatGLM-6B
  • Supports running models on CPU and GPU
  • Optimized for low-memory hardware and edge devices
  • Allows quantization for reduced resource consumption
  • Works as a lightweight alternative to Python-based inference
  • Offers real-time chatbot capabilities

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow ChatGLM.cpp

ChatGLM.cpp Web Site

Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of ChatGLM.cpp!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

C++

Related Categories

C++ Large Language Models (LLM), C++ Natural Language Processing (NLP) Tool, C++ AI Models, C++ LLM Inference Tool

Registered

2025-01-21