Search Results for "python data analysis" - Page 28

Sort By:

Showing 1214 open source projects for "python data analysis"

View related business solutions

Artificial Intelligence Mac Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Earn up to 16% annual interest with Nexo.
More flexibility. More control.

Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
1

Self-hosted AI Package

Run all your local AI together in one package

Self-hosted AI Package is an open-source Docker Compose-based starter kit that makes it easy to bootstrap a full local AI and low-code development environment with commonly used open tools, empowering developers to run LLMs and AI workflows entirely on their infrastructure. The stack typically includes Ollama for running local large language models, n8n as a low-code workflow automation platform, Supabase for database and vector storage, Open WebUI for interacting with models, Flowise for...

Downloads: 1 This Week

Last Update: 2026-02-01
See Project
2

Agents Towards Production

Code-first tutorials covering every layer of GenAI agents

Agents Towards Production is an opinionated, code-first playbook for taking AI agents from prototype to production-ready systems. Instead of focusing only on toy examples, it dives into every layer of an agent stack: orchestration, memory, RAG, tool and API integration, security, observability, deployment, evaluation, and UI. The repository is built around runnable tutorials, each in its own folder, often sponsored by or built in collaboration with infrastructure providers like LangChain,...

Downloads: 0 This Week

Last Update: 1 day ago
See Project
3

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper

WhisperSpeech is an open-source text-to-speech system created by “inverting” OpenAI’s Whisper, reusing its strengths as a semantic audio model to generate speech instead of only transcribing it. The project aims to be for speech what Stable Diffusion is for images: powerful, hackable, and safe for commercial use, with code under Apache-2.0/MIT and models trained only on properly licensed data. Its architecture follows a token-based, multi-stage pipeline inspired by AudioLM and SPEAR-TTS:...

Downloads: 2 This Week

Last Update: 2025-11-28
See Project
4

NeuroMatch Academy (NMA)

NMA Computational Neuroscience course

...You will learn how to code in Python from scratch using a simple neural model, the leaky integrate-and-fire model, as a motivation. Then, you will cover linear algebra, calculus and probability & statistics. The topics covered on these days were carefully chosen based on what you need for the comp neuro course.

Downloads: 3 This Week

Last Update: 2026-03-20
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

Sa2VA

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA

Sa2VA is a cutting-edge open-source multi-modal large language model (MLLM) developed by ByteDance that unifies dense segmentation, visual understanding, and language-based reasoning across both images and videos. It merges the segmentation power of a state-of-the-art video segmentation model (based on SAM‑2) with the vision-language reasoning capabilities of a strong LLM backbone (derived from models like InternVL2.5 / Qwen-VL series), yielding a system that can answer questions about...

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
6

Trae Agent

LLM-based agent for general purpose software engineering tasks

Trae Agent is an open-source, LLM-based agent system also developed by ByteDance, focused primarily on automating software engineering workflows. It provides a command-line interface (CLI) that accepts natural-language instructions (e.g. “refactor this module,” “write a unit test,” “generate a REST API skeleton”), and then orchestrates tool-based workflows — such as file editing, shell/batch commands, code generation, code formatting or refactoring — to carry out complex engineering tasks....

Downloads: 0 This Week

Last Update: 2026-02-05
See Project
7

MCP Server Chart

A visualization mcp contains 25+ visual charts

mcp-server-chart is a TypeScript Model Context Protocol (MCP) server that turns AntV’s visualization stack into agent-callable tools for automatic chart generation and lightweight data analysis. Out of the box it exposes more than 20–25 chart generators—covering staples like bar, line, area, histogram and pie, plus advanced visuals such as dual-axes, heatmaps, radar, flow and fishbone diagrams—so an AI client can request a chart and receive an image URL in return. The server can run over stdio for desktop IDEs or via SSE/“streamable” HTTP transport, making it easy to plug into MCP-capable clients and platforms (including Dify) without custom glue code. ...

Downloads: 5 This Week

Last Update: 2026-02-25
See Project
8

ChatGPT Retrieval Plugin

The ChatGPT Retrieval Plugin lets you easily find personal documents

The chatgpt-retrieval-plugin repository implements a semantic retrieval backend that lets ChatGPT (or GPT-powered tools) access private or organizational documents in natural language by combining vector search, embedding models, and plugin infrastructure. It can serve as a custom GPT plugin or function-calling backend so that a chat session can “look up” relevant documents based on user queries, inject those results into context, and respond more knowledgeably about a private knowledge...

Downloads: 1 This Week

Last Update: 2025-10-02
See Project
9

TADA

Open Source Speech Language Model

TADA is an open-source speech-language modeling framework designed to unify spoken audio and text representations within a single generative architecture. The system focuses on aligning speech and text streams using a dual-alignment mechanism that synchronizes the acoustic signal with its textual representation. By modeling both modalities together, the framework allows developers to build systems capable of generating, understanding, and transforming speech and language simultaneously. This...

Downloads: 0 This Week

Last Update: 2026-03-24
See Project
Application Monitoring That Won't Slow Your App Down
AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.

Start Free
10

Jina-Serve

Build multimodal AI applications with cloud-native stack

Jina Serve is an open-source framework designed for building, deploying, and scaling AI services and machine learning pipelines in production environments. The framework allows developers to create microservices that expose machine learning models through APIs that communicate using protocols such as HTTP, gRPC, and WebSockets. It is built with a cloud-native architecture that supports deployment on local machines, containerized environments, or large orchestration platforms such as...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
11

VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs)

VLMEvalKit is an open-source evaluation toolkit designed for benchmarking large vision-language models that combine visual understanding with natural language reasoning. The toolkit provides a unified framework that allows researchers and developers to evaluate multimodal models across a wide range of datasets and standardized benchmarks with minimal setup. Instead of requiring complex data preparation pipelines or multiple repositories for each benchmark, the system enables evaluation...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
12

video2robot

End-to-end pipeline converting generative videos

video2robot is an end-to-end open-source pipeline that converts generative video or prompt-driven motion content into executable humanoid robot motion sequences, enabling researchers and developers to go from high-level action descriptions or videos to robot-ready motion data. The pipeline supports both prompt-to-video generation using models like Veo/Sora and video upload processing, followed by human pose extraction through a 3D pose model and retargeting of that motion to robot joints...

Downloads: 0 This Week

Last Update: 2026-01-30
See Project
13

MobileCLIP

Implementation of "MobileCLIP" CVPR 2024

MobileCLIP is a family of efficient image-text embedding models designed for real-time, on-device retrieval and zero-shot classification. The repo provides training, inference, and evaluation code for MobileCLIP models trained on DataCompDR, and for newer MobileCLIP2 models trained on DFNDR. It includes an iOS demo app and Core ML artifacts to showcase practical, offline photo search and classification on iPhone-class hardware. Project notes highlight latency/accuracy trade-offs, with...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
14

CLIP

CLIP, Predict the most relevant text snippet given an image

CLIP (Contrastive Language-Image Pretraining) is a neural model that links images and text in a shared embedding space, allowing zero-shot image classification, similarity search, and multimodal alignment. It was trained on large sets of (image, caption) pairs using a contrastive objective: images and their matching text are pulled together in embedding space, while mismatches are pushed apart. Once trained, you can give it any text labels and ask it to pick which label best matches a given...

Downloads: 0 This Week

Last Update: 2026-03-25
See Project
15

GitMCP

Turn any GitHub repository into an MCP documentation server for AI

...Its architecture retrieves documentation, analyzes code, and provides searchable access to repository information through semantic search and code analysis capabilities.

Downloads: 1 This Week

Last Update: 2 days ago
See Project
16

Transformers in Time Series

A professionally curated list of awesome resources

Transformers in Time Series is a curated research repository that collects academic papers, code implementations, datasets, and learning resources related to transformer models for time series analysis. The project was created to systematically organize the rapidly growing research field that applies transformer architectures to time series modeling tasks. It compiles literature from major conferences and journals and categorizes them by application domains such as forecasting, anomaly detection, and classification. The repository also provides a taxonomy that helps researchers understand different architectural variations of transformers designed for time series data. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
17

Extractous

Fast and efficient unstructured data extraction

Extractous is a Rust-based unstructured data extraction library focused on fast local parsing of documents and other content-heavy files. Its purpose is to extract text and metadata efficiently from formats such as PDF, Word, HTML, email archives, images, and more, without depending on external APIs or separate parsing servers. The project emphasizes performance and low memory usage, and its maintainers describe it as a local-first alternative to heavier extraction stacks. For broader format...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
18

hls4ml

Machine learning on FPGAs using HLS

hls4ml is an open-source framework that enables machine learning models to be implemented directly on hardware such as FPGAs and ASICs using high-level synthesis techniques. The system converts trained neural network models from common machine learning frameworks into hardware description code suitable for ultra-low-latency inference. This approach allows machine learning algorithms to run directly on specialized hardware, making them suitable for applications that require extremely fast...

Downloads: 0 This Week

Last Update: 2026-03-20
See Project
19

MuseGAN

An AI for Music Generation

MuseGAN is a deep learning research project designed to generate symbolic music using generative adversarial networks. The system focuses specifically on generating multi-track polyphonic music, meaning that it can simultaneously produce multiple instrument parts such as drums, bass, piano, guitar, and strings. Instead of generating raw audio, the model operates on piano-roll representations of music, which encode notes as time-pitch matrices for each instrument track. This representation...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
20

Advanced NLP with spaCy

Advanced NLP with spaCy: A free online course

Advanced NLP with spaCy is an open-source educational repository that provides the materials for an interactive course on advanced natural language processing using the spaCy library. The course is designed to teach developers how to build real-world NLP systems by combining rule-based techniques with machine learning models. The repository includes lessons, exercises, and examples that guide learners through tasks such as tokenization, named entity recognition, text classification, and...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
21

LangChain Extract

Did you say you like data?

LangChain Extract is an open-source reference application designed to demonstrate how large language models can be used to extract structured data from unstructured text and document files. The project implements a lightweight web service that allows developers to define extraction schemas and apply them to various sources such as plain text, HTML, or PDF documents. Built using FastAPI and the LangChain framework, the application exposes a REST API that can process documents and return...

Downloads: 1 This Week

Last Update: 2026-03-09
See Project
22

DriveLM

Driving with Graph Visual Question Answering

DriveLM is a research-oriented framework and dataset designed to explore how vision-language models can be integrated into autonomous driving systems. The project introduces a new paradigm called graph visual question answering that structures reasoning about driving scenes through interconnected tasks such as perception, prediction, planning, and motion control. Instead of treating autonomous driving as a purely sensor-driven pipeline, DriveLM frames it as a reasoning problem where models...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
23

PKU Beaver

Constrained Value Alignment via Safe Reinforcement Learning

PKU Beaver is an open-source research project focused on improving the safety alignment of large language models through reinforcement learning from human feedback under explicit safety constraints. The framework introduces techniques that separate helpfulness and harmlessness signals during training, allowing models to optimize for useful responses while minimizing harmful behavior. To support this process, the project provides datasets containing human-labeled examples that encode both...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
24

WebGLM

An Efficient Web-enhanced Question Answering System

WebGLM is a web-enhanced question-answering system that combines a large language model with web search and retrieval capabilities to produce more accurate answers. The system is based on the General Language Model architecture and was designed to enable language models to interact directly with web information during the question-answering process. Instead of relying solely on knowledge stored in the model’s training data, the system retrieves relevant web content and integrates it into the...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
25

PRIME

Scalable RL solution for advanced reasoning of language models

PRIME is an open-source reinforcement learning framework designed to improve the reasoning capabilities of large language models through process-level rewards rather than relying only on final outputs. The system introduces the concept of process reinforcement through implicit rewards, allowing models to receive feedback on intermediate reasoning steps instead of evaluating only the final answer. This approach helps models learn better reasoning strategies and encourages them to generate...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project