Intel Extension for Transformers Files

Build your chatbot within minutes on your favorite device

This is an exact mirror of the Intel Extension for Transformers project, hosted at https://github.com/intel/intel-extension-for-transformers. SourceForge is not affiliated with Intel Extension for Transformers.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
Intel(r) Extension for Transformers v1.4.1 Release source code.tar.gz	2024-04-21	103.4 MB	0
Intel(r) Extension for Transformers v1.4.1 Release source code.zip	2024-04-21	106.3 MB	0
README.md	2024-04-21	2.2 kB	0
Totals: 3 Items		209.7 MB	0

Highlights Improvements Examples Bug Fixing

Highlights

Support Weight-only Quantization on MTL iGPU
Upgrade lm-eval to 0.4.2
Support Llama3

Improvements

Support TPP for Xeon Tensor Parallel (5f0430f )
Refine Model from_pretrained When use_neural_speed (39ecf38e )

Examples

Add vision front-end demo (1c6550 )
Add example for table extraction, and enabled multi-page table handling pipeline (db9e6fb )
Adapted textual inversion distillation for quantization example to latest transformers and diffusers packages (0ec83b1 )
Update NeuralChat Notebooks (83bb65a, 629b9d4 )

Bug Fixing

Fix QBits actshuf buf overflow under large batch (a6f3ab3 )
Fix TPP support for single socket (a690072 )
Fix retrieval dependency (281b0a3 )
Fix loading issue of woq model with parameters (37f9db25 )

Validated Configurations

Python 3.10
Ubuntu 22.04
PyTorch 2.2.0+cpu
Intel® Extension for Torch 2.2.0+cpu

Source: README.md, updated 2024-04-21

Other Useful Business Software

Full-stack observability with actually useful AI | Grafana Cloud Icon

Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account

Train ML Models With SQL You Already Know Icon

Train ML Models With SQL You Already Know

BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free

Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account

Recommended Projects

Intel LLM Library for PyTorch
Accelerate local LLM inference and finetuning
Curated Transformers
PyTorch library of curated Transformer models and their components
Transformer Engine
A library for accelerating Transformer models on NVIDIA GPUs
Transformers-Interpret
Model explainability that works seamlessly with Hugging Face
CTranslate2
Fast inference engine for Transformer models