multiple choice question free download

TigerBot

TigerBot: A multi-language multi-task LLM

...The project focuses on building high-performance models capable of handling both English and Chinese tasks while maintaining strong reasoning and conversational abilities. TigerBot models are based on modern transformer architectures and are trained on large datasets that cover multiple domains and languages. The project provides both base models and chat-optimized variants that can be used for dialogue systems, question answering, and general language understanding tasks. In addition to model weights, the repository includes training scripts, inference tools, and configuration files that allow researchers and developers to reproduce experiments or fine-tune the models for specific applications.

Downloads: 1 This Week

Last Update: 2026-03-06

See Project

Bespoke Curator

Synthetic data curation for post-training and data extraction

...The system helps developers generate, transform, and curate high-quality datasets by combining automated generation with structured validation and filtering. It supports workflows where models are used to produce synthetic examples that can later be refined into reliable training datasets for reasoning, question answering, or structured information extraction tasks. Curator includes tools for monitoring data generation processes and managing dataset quality while large batches of examples are being created. The framework also integrates with multiple inference systems and APIs, allowing users to generate data using different model providers or open-source inference engines.

Downloads: 1 This Week

Last Update: 2026-03-14

See Project

LongBench

LongBench v2 and LongBench (ACL 25'&24')

...Traditional language model benchmarks typically evaluate tasks involving relatively short inputs, which does not reflect many real-world applications such as analyzing large documents or entire code repositories. LongBench addresses this gap by providing datasets that require models to process and reason over long sequences of text across multiple tasks. The benchmark includes multiple categories such as single-document question answering, multi-document reasoning, summarization, long dialogue understanding, and code analysis. It supports bilingual evaluation in English and Chinese to assess multilingual capabilities across extended contexts. Newer versions of the benchmark introduce extremely long context windows ranging from thousands to millions of tokens, enabling researchers to test the limits of modern long-context models.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

Tongyi DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

...The model is about 30.5 billion parameters in size, though at any given token only ~3.3B parameters are active. It uses a mix of synthetic data generation, fine-tuning and reinforcement learning; supports benchmarks like web search, document understanding, question answering, “agentic” tasks; provides inference tools, evaluation scripts, and “web agent” style interfaces. The aim is to enable more autonomous, agentic models that can perform sustained knowledge gathering, reasoning, and synthesis across multiple modalities (web, files, etc.).

Downloads: 0 This Week

Last Update: 2026-02-27

See Project

RAPTOR

The official implementation of RAPTOR

...During inference, the system can navigate this hierarchical representation to retrieve information that best matches the user’s query while preserving broader contextual understanding. This approach improves question-answering performance on complex tasks that require reasoning across long documents or multiple sources.

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

Qwen-VL

Chat & pretrained large vision language model

Qwen-VL is Alibaba Cloud’s vision-language large model family, designed to integrate visual and linguistic modalities. It accepts image inputs (with optional bounding boxes) and text, and produces text (and sometimes bounding boxes) as output. The model variants (VL-Plus, VL-Max, etc.) have been upgraded for better visual reasoning, text recognition from images, fine-grained understanding, and support for high image resolutions / extreme aspect ratios. Qwen-VL supports multilingual inputs...

Downloads: 0 This Week

Last Update: 2025-09-23

See Project

YAYI

Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM

YAYI is an open-source large language model project developed to provide a multilingual conversational AI system capable of performing a wide variety of natural language processing tasks. The model is trained on diverse datasets covering multiple languages and domains so that it can support applications ranging from dialogue systems to text analysis and knowledge retrieval. The architecture is based on transformer-style language models optimized for conversational understanding and generation. In addition to producing coherent responses, the system is designed to handle tasks such as summarization, translation, question answering, and text classification. ...

Downloads: 0 This Week

Last Update: 2026-03-05

See Project

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring

...Rather than focusing on a single metric or domain, it aggregates many hand-authored tasks that test reasoning, commonsense, math, linguistics, ethics, and creativity. Tasks are intentionally heterogeneous: some are multiple-choice with exact scoring, others are free-form generation judged by model-based or human evaluation. The suite provides a common JSON task format and an evaluation harness so research groups can contribute new tasks and reproduce results consistently. It emphasizes robustness analysis—looking at scale trends, calibration, and areas where models systematically fail—to guide model development beyond raw accuracy. ...

Downloads: 0 This Week

Last Update: 2025-10-09

See Project

EvaDB

Database system for building simpler and faster AI-powered application

Over the last decade, AI models have radically changed the world of natural language processing and computer vision. They are accurate on various tasks ranging from question answering to object tracking in videos. To use an AI model, the user needs to program against multiple low-level libraries, like PyTorch, Hugging Face, Open AI, etc. This tedious process often leads to a complex AI app that glues together these libraries to accomplish the given task. This programming complexity prevents people who are experts in other domains from benefiting from these models. ...

Downloads: 0 This Week

Last Update: 2023-11-19

See Project

Language Models

Explore large language models in 512MB of RAM

...It is particularly useful for educational purposes, as it demonstrates the fundamental mechanics of language model inference and prompt-based applications. The repository includes multiple example applications such as chatbots, document question answering systems, and information retrieval tools.

Downloads: 0 This Week

Last Update: 2026-03-15

See Project

Search Results for "multiple choice question"

Showing 10 open source projects for "multiple choice question"

TigerBot

Bespoke Curator

LongBench

Tongyi DeepResearch

RAPTOR

Qwen-VL

YAYI

BIG-bench

EvaDB

Language Models

Search Results for "multiple choice question"

Showing 10 open source projects for "multiple choice question"

TigerBot

Bespoke Curator

LongBench

Tongyi DeepResearch

RAPTOR

Qwen-VL

YAYI

BIG-bench

EvaDB

Language Models

Related Categories