Page 28 | ai model free download

Seq2seq Chatbot for Keras

This repository contains a new generative model of chatbot

This repository contains a new generative model of chatbot based on seq2seq modeling. The trained model available here used a small dataset composed of ~8K pairs of context (the last two utterances of the dialogue up to the current point) and respective response. The data were collected from dialogues of English courses online. This trained model can be fine-tuned using a closed-domain dataset to real-world applications. The canonical seq2seq model became popular in neural machine...

Downloads: 0 This Week

Last Update: 2023-03-21

See Project

Retrieval-Based Conversational Model

Dual LSTM Encoder for Dialog Response Generation

Retrieval-Based Conversational Model in Tensorflow is a project implementing a retrieval-based conversational model using a dual LSTM encoder architecture in TensorFlow, illustrating how neural networks can be trained to select appropriate responses from a fixed set of candidate replies rather than generate them from scratch. The core idea is to embed both the conversation context and potential replies into vector representations, then score how well each candidate fits the current dialogue,...

Downloads: 0 This Week

Last Update: 2026-02-13

See Project

Malware Analysis Network in Taiwan

MAN in Taiwan, MiT

Malware Analysis Network in Taiwan <Man in Taiwan, MiT> Welcome to contact us (TonTon@TWMAN.ORG) if you are interested in collaborating with us. This project is open source and distributed under the GNU General Public License version 3. Please feel free to add to or modify this source and propose changes or new converters. Developer & Copyrighted by : TonTon Hsien-De Huang Prompter: Jazz Yao-Tsung Wang, Figaro Chen-Ho Yang | Logo Desinger:Temaki Guo Community on...

Downloads: 0 This Week

Last Update: 2015-03-02

See Project

Leanstral

Open-source code agent designed for Lean 4

Leanstral is an open-weight large language model developed by Mistral AI and specifically designed as a code agent for the Lean 4 proof assistant, enabling advanced interaction with formal mathematics and program verification systems. The model is built to understand and generate Lean 4 code, which is used to express complex mathematical constructs as well as formal software specifications.

Downloads: 0 This Week

Last Update: 2026-03-17

See Project

Nemotron 3 Super

Open language model developed by NVIDIA as part of Nemotron-3 family

NVIDIA-Nemotron-3-Super-120B-A12B-FP8 is a large-scale open language model developed by NVIDIA as part of the Nemotron-3 family of generative AI systems designed for advanced reasoning, conversational interaction, and agent-based workflows. The model contains approximately 120 billion parameters, but employs a Mixture-of-Experts architecture that activates only a smaller subset of parameters during inference, improving computational efficiency while maintaining high capability. ...

Downloads: 0 This Week

Last Update: 2026-03-13

See Project

Nemotron 3 Nano

LL model providing reasoning and conversational capabilities

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 is a mid-sized open large language model created by NVIDIA to provide strong reasoning and conversational capabilities while maintaining efficient deployment requirements. The model contains roughly 30 billion parameters and is designed to balance performance and computational efficiency, making it suitable for developers building AI applications that cannot run extremely large models.

Downloads: 0 This Week

Last Update: 2026-03-13

See Project

Mistral Small 4

Model that fuses instruct, reasoning and agentic skills

The Mistral Small 4 collection is a set of open-weight large language models developed by Mistral AI that aim to unify multiple capabilities, including instruction following, reasoning, and coding, within a single efficient architecture. These models are part of the broader Mistral Small family, which is designed to deliver strong performance across a wide range of everyday AI tasks while maintaining relatively low latency and efficient deployment requirements. The collection reflects an...

Downloads: 0 This Week

Last Update: 2026-03-17

See Project

Hunyuan-MT-7B

Tencent’s 36-language state-of-the-art translation model

Hunyuan-MT-7B is a large-scale multilingual translation model developed by Tencent, designed to deliver state-of-the-art translation quality across 36 languages, including several Chinese ethnic minority languages. It forms part of the Hunyuan Translation Model family, alongside Hunyuan-MT-Chimera, which ensembles outputs for even higher accuracy. Trained with a comprehensive framework spanning pretraining, cross-lingual pretraining, supervised fine-tuning, enhancement, and ensemble...

Downloads: 0 This Week

Last Update: 2025-09-03

See Project

DeepSeek-V3.2

High-efficiency reasoning and agentic intelligence model

...The model was notably used in competitive AI challenges such as the 2025 International Mathematical Olympiad (IMO) and IOI, achieving top-tier results. DeepSeek-V3.2 also features a large-scale agentic task synthesis pipeline, which generates training data to enhance tool-use intelligence and multi-step reasoning. It introduces a new “thinking with tools” chat template, allowing it to reason and decide when to invoke specific tools during problem solving.

Downloads: 0 This Week

Last Update: 2025-12-01

See Project

Mellum-4b-base

JetBrains’ 4B parameter code model for completions

Mellum-4b-base is JetBrains’ first open-source large language model designed and optimized for code-related tasks. Built with 4 billion parameters and a LLaMA-style architecture, it was trained on over 4.2 trillion tokens across multiple programming languages, including datasets such as The Stack, StarCoder, and CommitPack. With a context window of 8,192 tokens, it excels at code completion, fill-in-the-middle tasks, and intelligent code suggestions for professional developer tools and IDEs....

Downloads: 0 This Week

Last Update: 2025-09-11

See Project

QSO-Graph

Ham radio MCP servers for AI Agents — 71 tools, 11 packages

QSO-Graph is a suite of 11 MCP (Model Context Protocol) servers for amateur radio operators. Provides AI-powered access to QRZ, eQSL, LoTW, HamQTH, POTA, SOTA, IOTA, WSPR, solar weather, ADIF parsing, and HF Description: Propagation analytics. Native installers for Windows (InnoSetup) and Linux (RPM). All servers also available via pip from PyPI. Source code at github.com/qso-graph.

Downloads: 0 This Week

Last Update: 2026-03-07

See Project

gpt-oss-20b

OpenAI’s compact 20B open model for fast, agentic, and local use

...It’s released under a permissive Apache 2.0 license, allowing unrestricted commercial and research use. GPT-OSS-20B is compatible with Transformers, vLLM, Ollama, PyTorch, and other tools. It is ideal for developers building lightweight AI agents or experimenting with fine-tuning on consumer-grade hardware.

Downloads: 0 This Week

Last Update: 2025-08-05

See Project

DeepSeek-V3.2-Speciale

High-compute ultra-reasoning model surpassing model surpassing GPT-5

DeepSeek-V3.2-Speciale is the high-compute, ultra-reasoning variant of DeepSeek-V3.2, designed specifically to push the boundaries of mathematical, logical, and algorithmic intelligence. It builds on the DeepSeek Sparse Attention (DSA) framework, delivering dramatically improved long-context efficiency while preserving full model quality. Unlike the standard version, Speciale is tuned exclusively for deep reasoning and therefore does not support tool-calling, focusing its full capacity on...

Downloads: 0 This Week

Last Update: 2025-12-01

See Project

Dia-1.6B

Dia-1.6B generates lifelike English dialogue and vocal expressions

Dia-1.6B is a 1.6 billion parameter text-to-speech model by Nari Labs that generates high-fidelity dialogue directly from transcripts. Designed for realistic vocal performance, Dia supports expressive features like emotion, tone control, and non-verbal cues such as laughter, coughing, or sighs. The model accepts speaker conditioning through audio prompts, allowing limited voice cloning and speaker consistency across generations. It is optimized for English and built for real-time performance...

Downloads: 0 This Week

Last Update: 2025-06-27

See Project

Search Results for "ai model" - Page 28

Showing 689 open source projects for "ai model"

Seq2seq Chatbot for Keras

Retrieval-Based Conversational Model

Malware Analysis Network in Taiwan

Leanstral

Nemotron 3 Super

Nemotron 3 Nano

Mistral Small 4

Hunyuan-MT-7B

DeepSeek-V3.2

Mellum-4b-base

QSO-Graph

gpt-oss-20b

DeepSeek-V3.2-Speciale

Dia-1.6B

Search Results for "ai model" - Page 28

Showing 689 open source projects for "ai model"

Seq2seq Chatbot for Keras

Retrieval-Based Conversational Model

Malware Analysis Network in Taiwan

Leanstral

Nemotron 3 Super

Nemotron 3 Nano

Mistral Small 4

Hunyuan-MT-7B

DeepSeek-V3.2

Mellum-4b-base

QSO-Graph

gpt-oss-20b

DeepSeek-V3.2-Speciale

Dia-1.6B

Related Searches

Related Categories