Search Results for "python programming language" - Page 2

Sort By:

Relevance

Clear All Filters

Linux 57
Mac 57
Windows 57
More...
BSD 43
ChromeOS 43
Mobile Operating Systems 3

Showing 57 open source projects for "python programming language"

View related business solutions

AI Models Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

Janus-Pro

Janus-Series: Unified Multimodal Understanding and Generation Models

Janus is a cutting-edge, unified multimodal model designed to advance both multimodal understanding and generation. It features a decoupled visual encoding approach that allows it to handle visual tasks separately from the generative tasks, resulting in enhanced flexibility and performance. With a singular transformer architecture, Janus outperforms previous models by surpassing specialized task-specific models in its ability to handle diverse multimodal inputs and generate high-quality...

1 Review

Downloads: 3 This Week

Last Update: 2025-03-04
See Project
2

ChatGLM2-6B

An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM

ChatGLM2-6B is an advanced open-source bilingual dialogue model developed by THUDM. It is the second iteration of the ChatGLM series, designed to offer enhanced performance while maintaining the strengths of its predecessor, including smooth conversation flow and low deployment barriers. The model is fine-tuned for both Chinese and English languages, making it a versatile tool for various multilingual applications. ChatGLM2-6B aims to push the boundaries of natural language understanding...

Downloads: 1 This Week

Last Update: 2025-03-03
See Project
3

GLM-4-32B-0414

Open Multilingual Multimodal Chat LMs

GLM-4-32B-0414 is a powerful open-source large language model featuring 32 billion parameters, designed to deliver performance comparable to leading models like OpenAI’s GPT series. It supports multilingual and multimodal chat capabilities with an extensive 32K token context length, making it ideal for dialogue, reasoning, and complex task completion. The model is pre-trained on 15 trillion tokens of high-quality data, including substantial synthetic reasoning datasets, and further enhanced...

Downloads: 1 This Week

Last Update: 2025-06-27
See Project
4

LaMDA-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch

Open-source pre-training implementation of Google's LaMDA research paper in PyTorch. The totally not sentient AI. This repository will cover the 2B parameter implementation of the pre-training architecture as that is likely what most can afford to train. You can review Google's latest blog post from 2022 which details LaMDA here. You can also view their previous blog post from 2021 on the model.

Downloads: 0 This Week

Last Update: 2023-03-25
See Project
Deliver secure remote access with OpenVPN.
Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.

Get started — no credit card required.
5

GPT Neo

An implementation of model parallel GPT-2 and GPT-3-style models

An implementation of model & data parallel GPT3-like models using the mesh-tensorflow library. If you're just here to play with our pre-trained models, we strongly recommend you try out the HuggingFace Transformer integration. Training and inference is officially supported on TPU and should work on GPU as well. This repository will be (mostly) archived as we move focus to our GPU-specific repo, GPT-NeoX. NB, while neo can technically run a training step at 200B+ parameters, it is very...

Downloads: 8 This Week

Last Update: 2023-03-24
See Project
6

starcoder

Code generation model trained on 80+ languages with FIM support

StarCoder is a 15.5B parameter language model developed by BigCode for code generation tasks across more than 80 programming languages. It is trained on 1 trillion tokens from the permissively licensed dataset The Stack v1.2, using the Fill-in-the-Middle (FIM) objective and Multi-Query Attention to enhance performance. With an extended context window of 8192 tokens and pretraining in bfloat16, StarCoder can generate, complete, or refactor code in various languages, with English as the primary...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
7

bloom

Multilingual 176B language model for text and code generation tasks

BLOOM (BigScience Large Open-science Open-access Multilingual Language Model) is a 176-billion parameter autoregressive language model developed by the BigScience Workshop. It generates coherent text in 46 natural languages and 13 programming languages, making it one of the most multilingual LLMs publicly available. BLOOM was trained on 366 billion tokens using Megatron-DeepSpeed and large-scale computational resources. It can perform various tasks via prompt-based learning, even without task...

Downloads: 0 This Week

Last Update: 2025-06-26
See Project
8

phi-2

Small, high-performing language model for QA, chat, and code tasks

Phi-2 is a 2.7 billion parameter Transformer model developed by Microsoft, designed for natural language processing and code generation tasks. It was trained on a filtered dataset of high-quality web content and synthetic NLP texts created by GPT-3.5, totaling 1.4 trillion tokens. Phi-2 excels in benchmarks for common sense, language understanding, and logical reasoning, outperforming most models under 13B parameters despite not being instruction-tuned or aligned via RLHF. It performs best...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
9

GPT-2

GPT-2 is a 124M parameter English language model for text generation

GPT-2 is a pretrained transformer-based language model developed by OpenAI for generating natural language text. Trained on 40GB of internet data from outbound Reddit links (excluding Wikipedia), it uses causal language modeling to predict the next token in a sequence. The model was trained without human labels and learns representations of English that support text generation, feature extraction, and fine-tuning. GPT-2 uses a byte-level BPE tokenizer with a vocabulary of 50,257 and handles...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
Crowdtesting That Delivers | Testeum
Unfixed bugs delaying your launch? Test with real users globally – check it out for free, results in days.

Testeum connects your software, app, or website to a worldwide network of testers, delivering detailed feedback in under 48 hours. Ensure functionality and refine UX on real devices, all at a fraction of traditional costs. Trusted by startups and enterprises alike, our platform streamlines quality assurance with actionable insights. Click to perfect your product now.

Click to perfect your product now.
10

OpenVLA 7B

Vision-language-action model for robot control via images and text

OpenVLA 7B is a multimodal vision-language-action model trained on 970,000 robot manipulation episodes from the Open X-Embodiment dataset. It takes camera images and natural language instructions as input and outputs normalized 7-DoF robot actions, enabling control of multiple robot types across various domains. Built on top of LLaMA-2 and DINOv2/SigLIP visual backbones, it allows both zero-shot inference for known robot setups and parameter-efficient fine-tuning for new domains. The model...

Downloads: 0 This Week

Last Update: 2025-07-23
See Project
11

DeepSeek-R1-0528

DeepSeek-R1-0528 is a powerful reasoning-focused LLM with 64K context

DeepSeek-R1-0528 is an upgraded large language model developed by DeepSeek AI, designed to improve deep reasoning, inference, and programming capabilities. With a context length of up to 64K tokens and 685 billion parameters, it introduces enhanced algorithmic optimizations and expanded token usage per task. Compared to previous versions, it significantly improves benchmark scores in math (e.g., AIME 2025: 87.5%), logic, and coding tasks like LiveCodeBench and SWE Verified. It supports system...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
12

ERNIE-4.5-300B-A47B-FP8-Paddle

ERNIE 4.5 MoE model in FP8 for efficient high-performance inference

ERNIE-4.5-300B-A47B-FP8-Paddle is a quantized version of Baidu’s MoE large language model, post-trained for text generation tasks and optimized for FP8 precision. This variant retains the original’s 300 billion total parameters with 47 billion active per token, enabling powerful language understanding while dramatically improving inference efficiency. Built using PaddlePaddle, it supports multi-GPU distributed deployment and leverages advanced routing strategies and expert parallelism...

Downloads: 0 This Week

Last Update: 2025-06-30
See Project
13

Devstral

Agentic 24B LLM optimized for coding tasks with 128k context support

..., compatible with frameworks like vLLM, Transformers, llama.cpp, and Ollama. It is licensed under Apache 2.0 and is fully open for commercial and non-commercial use. Its Tekken tokenizer allows a 131k vocabulary size for high flexibility in programming languages and natural language inputs. Devstral is the preferred backend for OpenHands, where it acts as the reasoning engine for autonomous code agents.

Downloads: 0 This Week

Last Update: 2025-07-14
See Project
14

DeepSeek-V3-0324

Advanced multilingual LLM with enhanced reasoning and code generation

DeepSeek-V3-0324 is a powerful large language model by DeepSeek AI that significantly enhances performance over its predecessor, especially in reasoning, programming, and Chinese language tasks. It achieves major benchmark improvements, such as +5.3 on MMLU-Pro and +19.8 on AIME, and delivers more executable, aesthetically improved front-end code. Its Chinese writing and search-answering capabilities have also been refined, generating more fluent, contextually aware long-form outputs. Key...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
15

stable-diffusion-v1-4

Text-to-image diffusion model for high-quality image generation

stable-diffusion-v1-4 is a high-performance text-to-image latent diffusion model developed by CompVis. It generates photo-realistic images from natural language prompts using a pretrained CLIP ViT-L/14 text encoder and a UNet-based denoising architecture. This version builds on v1-2, fine-tuned over 225,000 steps at 512×512 resolution on the “laion-aesthetics v2 5+” dataset, with 10% text-conditioning dropout for improved classifier-free guidance. It is optimized for use with Hugging Face’s...

Downloads: 0 This Week

Last Update: 2025-06-26
See Project
16

Llama-2-7b-chat-hf

Dialogue-optimized 7B language model for safe and helpful chatting

Llama-2-7b-chat-hf is a fine-tuned large language model developed by Meta, designed specifically for dialogue use cases. With 7 billion parameters and built on an optimized transformer architecture, it uses supervised fine-tuning and reinforcement learning with human feedback (RLHF) to enhance helpfulness, coherence, and safety. It outperforms most open-source chat models and rivals proprietary systems like ChatGPT in human evaluations. Trained on 2 trillion tokens of public text and over 1...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
17

Llama-2-7b

7B-parameter foundational LLM by Meta for text generation tasks

Llama-2-7B is a foundational large language model developed by Meta as part of the Llama 2 family, designed for general-purpose text generation in English. It has 7 billion parameters and uses an optimized transformer-based, autoregressive architecture. Trained on 2 trillion tokens of publicly available data, it serves as the base for fine-tuned models like Llama-2-Chat. The model is pretrained only, meaning it is not optimized for dialogue but can be adapted for various natural language...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
18

chatglm-6b

Bilingual 6.2B parameter chatbot optimized for Chinese and English

ChatGLM-6B is a 6.2 billion parameter bilingual language model developed by THUDM, based on the General Language Model (GLM) framework. It is optimized for natural and fluent dialogue in both Chinese and English, supporting applications in conversational AI, question answering, and assistance. Trained on approximately 1 trillion tokens, the model benefits from supervised fine-tuning, feedback self-training, and reinforcement learning with human feedback to align its outputs with human...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
19

gpt-oss-20b

OpenAI’s compact 20B open model for fast, agentic, and local use

GPT-OSS-20B is OpenAI’s smaller, open-weight language model optimized for low-latency, agentic tasks, and local deployment. With 21B total parameters and 3.6B active parameters (MoE), it fits within 16GB of memory thanks to native MXFP4 quantization. Designed for high-performance reasoning, it supports Harmony response format, function calling, web browsing, and code execution. Like its larger sibling (gpt-oss-120b), it offers adjustable reasoning depth and full chain-of-thought visibility...

Downloads: 0 This Week

Last Update: 2025-08-05
See Project
20

ERNIE-4.5-0.3B-Base-PT

Compact 360M text model with high efficiency and fine-tuning support

ERNIE-4.5-0.3B-Base-PT is a compact, fully dense transformer model with 360 million parameters, optimized for general-purpose text generation tasks. It belongs to the ERNIE 4.5 series by Baidu and leverages advanced pretraining techniques without relying on a Mixture-of-Experts (MoE) structure. The model features 18 transformer layers, 16 attention heads, and a maximum context length of 131,072 tokens, offering strong language understanding for its size. It can be fine-tuned using ERNIEKit...

Downloads: 0 This Week

Last Update: 2025-06-30
See Project
21

Nanonets-OCR-s

State-of-the-art image-to-markdown OCR model

Nanonets-OCR-s is an advanced image-to-markdown OCR model that transforms documents into structured and semantically rich markdown. It goes beyond basic text extraction by intelligently recognizing content types and applying meaningful tags, making the output ideal for Large Language Models (LLMs) and automated workflows. The model expertly converts mathematical equations into LaTeX syntax, distinguishing between inline and display modes for accuracy. It also generates descriptive <img> tags...

Downloads: 0 This Week

Last Update: 2025-06-26
See Project
22

whisper-large-v3

High-accuracy multilingual speech recognition and translation model

... input and better support for Cantonese, achieving up to 20% error reduction over Whisper-large-v2. It handles zero-shot transcription and translation, performs language detection automatically, and supports features like word-level timestamps and long-form audio processing. The model integrates well with Hugging Face Transformers and supports optimizations such as batching, SDPA, and Flash Attention 2.

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
23

Llama-3.1-8B-Instruct

Multilingual 8B-parameter chat-optimized LLM fine-tuned by Meta

Llama-3.1-8B-Instruct is a multilingual, instruction-tuned language model developed by Meta, designed for high-quality dialogue generation across eight languages, including English, Spanish, French, German, Italian, Portuguese, Hindi, and Thai. It uses a transformer-based, autoregressive architecture with Grouped-Query Attention and supports a 128k token context window. The model was fine-tuned using a combination of supervised fine-tuning (SFT), reinforcement learning with human feedback (RLHF...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
24

Meta-Llama-3-8B-Instruct

Instruction-tuned 8B LLM by Meta for helpful, safe English dialogue

Meta-Llama-3-8B-Instruct is an instruction-tuned large language model from Meta’s Llama 3 family, optimized for safe and helpful English dialogue. It uses an autoregressive transformer architecture with Grouped-Query Attention (GQA) and supports an 8k token context length. Fine-tuned using supervised learning and reinforcement learning with human feedback (RLHF), the model achieves strong results on benchmarks like MMLU, GSM8K, and HumanEval. Trained on over 15 trillion tokens of publicly...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
25

Mistral-7B-Instruct-v0.2

Instruction-tuned 7B model for chat and task-oriented text generation

Mistral-7B-Instruct-v0.2 is a fine-tuned version of the Mistral-7B-v0.2 language model, designed specifically for following instructions in a conversational format. It supports a 32k token context window, enabling more detailed and longer interactions compared to its predecessor. The model is trained to respond to user prompts formatted with [INST] and [/INST] tags, and it performs well in instruction-following tasks like Q&A, summarization, and explanations. It can be used via the official...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project