Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "ai data analyst" - Page 13

x

Sort By:

Relevance

Clear All Filters

OS

Windows 432
Linux 429
Mac 419
More...
BSD 194
ChromeOS 191
Mobile Operating Systems 8
Desktop Operating Systems 1

Category

Artificial Intelligence 401
Software Development 43
Business 34
Scientific/Engineering 11
System 11
Internet 9
Education 8
Multimedia 7
Database 6
Formats and Protocols 6
Blockchain 5
Security 5
Games 2
Productivity 1

License

OSI-Approved Open Source 411
Creative Commons Attribution License 5
Other License 5
GNU Free Documentation License 1

Translations

English 13
Chinese (Simplified) 4
Spanish 4
French 3
More...
German 3
Chinese (Traditional) 2
Czech 2
Croatian 1
Dutch 1
Japanese 1
Russian 1

Programming Language

Python 464
TypeScript 24
JavaScript 23
Unix Shell 17
C++ 10
More...
C 7
Go 7
Java 4
Rust 3
PL/SQL 2
PowerShell 2
Ruby 2
C# 1
Julia 1
Lua 1
Perl 1
PHP 1
R 1
Yacc 1
Zope 1

Status

Production/Stable 13
Beta 8
Alpha 2
Mature 2
More...
Pre-Alpha 1

Showing 464 open source projects for "ai data analyst"

View related business solutions

Python Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

video2robot

End-to-end pipeline converting generative videos

video2robot is an end-to-end open-source pipeline that converts generative video or prompt-driven motion content into executable humanoid robot motion sequences, enabling researchers and developers to go from high-level action descriptions or videos to robot-ready motion data. The pipeline supports both prompt-to-video generation using models like Veo/Sora and video upload processing, followed by human pose extraction through a 3D pose model and retargeting of that motion to robot joints...

Downloads: 0 This Week

Last Update: 2026-01-30
See Project
2

MobileCLIP

Implementation of "MobileCLIP" CVPR 2024

MobileCLIP is a family of efficient image-text embedding models designed for real-time, on-device retrieval and zero-shot classification. The repo provides training, inference, and evaluation code for MobileCLIP models trained on DataCompDR, and for newer MobileCLIP2 models trained on DFNDR. It includes an iOS demo app and Core ML artifacts to showcase practical, offline photo search and classification on iPhone-class hardware. Project notes highlight latency/accuracy trade-offs, with...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
3

CLIP

CLIP, Predict the most relevant text snippet given an image

CLIP (Contrastive Language-Image Pretraining) is a neural model that links images and text in a shared embedding space, allowing zero-shot image classification, similarity search, and multimodal alignment. It was trained on large sets of (image, caption) pairs using a contrastive objective: images and their matching text are pulled together in embedding space, while mismatches are pushed apart. Once trained, you can give it any text labels and ask it to pick which label best matches a given...

Downloads: 0 This Week

Last Update: 2026-03-25
See Project
4
$Book3_Elements-of-Mathematics$

Book3_Elements-of-Mathematics

From Addition, Subtraction, Multiplication, and Division to ML

Book3_Elements-of-Mathematics is an open learning resource in the Visualize-ML collection that introduces core mathematical foundations required for modern data science and AI. The repository presents topics such as algebra, calculus fundamentals, and mathematical reasoning using a highly visual and beginner-friendly approach. Its goal is to reduce the intimidation barrier often associated with formal mathematics by combining diagrams, structured explanations, and applied examples. The content is organized progressively so learners can build confidence before moving into more advanced quantitative subjects. ...

Downloads: 0 This Week

Last Update: 2026-02-24
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

RecAI

Bridging LLM and Recommender System

RecAI is an open-source research platform developed by Microsoft to explore how large language models can be integrated into modern recommender systems. Traditional recommender systems rely on structured behavioral data such as user interactions and item embeddings, while large language models excel at understanding language and reasoning about user preferences. RecAI aims to bridge these two domains by creating architectures and training methods that allow LLMs to function as intelligent...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
6

Perception Models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models

Perception Models is a state-of-the-art framework developed by Facebook Research for advanced image and video perception tasks. It introduces two primary components: the Perception Encoder (PE) for visual feature extraction and the Perception Language Model (PLM) for multimodal decoding and reasoning. The PE module is a family of vision encoders designed to excel in image and video understanding, surpassing models like SigLIP2, InternVideo2, and DINOv2 across multiple benchmarks. Meanwhile,...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
7

CoreNet

CoreNet: A library for training deep neural networks

CoreNet is Apple’s internal deep learning framework for distributed neural network training, designed for high scalability, low-latency communication, and strong hardware efficiency. It focuses on enabling large-scale model training across clusters of GPUs and accelerators by optimizing data flow and parallelism strategies. CoreNet provides abstractions for data, tensor, and pipeline parallelism, allowing models to scale without code duplication or heavy manual configuration. Its distributed runtime manages synchronization, load balancing, and mixed-precision computation to maximize throughput while minimizing communication bottlenecks. ...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
8

ChatGPT Retrieval Plugin

The ChatGPT Retrieval Plugin lets you easily find personal documents

The chatgpt-retrieval-plugin repository implements a semantic retrieval backend that lets ChatGPT (or GPT-powered tools) access private or organizational documents in natural language by combining vector search, embedding models, and plugin infrastructure. It can serve as a custom GPT plugin or function-calling backend so that a chat session can “look up” relevant documents based on user queries, inject those results into context, and respond more knowledgeably about a private knowledge...

Downloads: 1 This Week

Last Update: 2025-10-02
See Project
9

Simple StyleGan2 for Pytorch

Simplest working implementation of Stylegan2

Simple Pytorch implementation of Stylegan2 that can be completely trained from the command-line, no coding needed. You will need a machine with a GPU and CUDA installed. You can also specify the location where intermediate results and model checkpoints should be stored. You can increase the network capacity (which defaults to 16) to improve generation results, at the cost of more memory. By default, if the training gets cut off, it will automatically resume from the last checkpointed file....

Downloads: 1 This Week

Last Update: 2025-01-12
See Project
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
10

CUDA Python

Performance meets Productivity

...It integrates tightly with the broader Python GPU ecosystem, including Numba for kernel compilation and CCCL for parallel primitives, allowing developers to write performant code without leaving Python. The toolkit also includes utilities for profiling, memory management, distributed computing, and numerical operations, making it suitable for scientific computing, AI, and data processing workloads.

Downloads: 5 This Week

Last Update: 4 days ago
See Project
11

Cosmos-RL

Cosmos-RL is a flexible and scalable Reinforcement Learning framework

Cosmos-RL is a scalable reinforcement learning framework designed specifically for physical AI systems such as robotics, autonomous agents, and multimodal models. It provides a distributed training architecture that separates policy learning and environment rollout processes, enabling efficient and asynchronous reinforcement learning at scale. The framework supports multiple parallelism strategies, including tensor, pipeline, and data parallelism, allowing it to leverage large GPU clusters effectively. ...

Downloads: 8 This Week

Last Update: 2026-03-31
See Project
12

TimesFM

Pretrained time-series foundation model developed by Google Research

TimesFM is a pretrained time-series foundation model from Google Research built for forecasting tasks, designed to generalize across many domains without requiring extensive per-dataset retraining. It provides a decoder-only model approach to forecasting, aiming for strong performance even in zero-shot or low-data settings where traditional models often struggle. The project includes code and an inference API intended to make it practical to run forecasts programmatically, with options to...

Downloads: 0 This Week

Last Update: 2026-02-19
See Project
13

DINOv2

PyTorch code and models for the DINOv2 self-supervised learning

DINOv2 is a self-supervised vision learning framework that produces strong, general-purpose image representations without using human labels. It builds on the DINO idea of student–teacher distillation and adapts it to modern Vision Transformer backbones with a carefully tuned recipe for data augmentation, optimization, and multi-crop training. The core promise is that a single pretrained backbone can transfer well to many downstream tasks—from linear probing on classification to retrieval,...

Downloads: 0 This Week

Last Update: 2026-02-24
See Project
14

claude-code-transcripts

Tools for publishing transcripts for Claude Code sessions

claude-code-transcripts is a command-line utility that takes session files exported from Claude Code (in JSON or JSONL format) and turns them into clean, navigable HTML transcripts that can be viewed in any modern web browser. It is designed to make the often dense and verbose outputs from AI coding sessions easier to read, share, and archive by breaking conversations into paginated, annotated pages with navigable timelines of prompts and responses. Users can run this tool locally or fetch...

Downloads: 10 This Week

Last Update: 2026-01-30
See Project
15

All-in-RAG

Big Model Application Development Practice 1

...The repository provides a structured learning path that covers both theoretical foundations and practical implementation steps for RAG systems. It explains the full development pipeline required to create knowledge-aware AI assistants, including data preparation, document indexing, vector embedding generation, and retrieval strategies. The project also explores advanced topics such as hybrid retrieval methods, query optimization, and evaluation techniques for improving system accuracy. Alongside theoretical explanations, the repository includes hands-on exercises and example projects that demonstrate how to build production-ready RAG systems. ...

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
16

Agents 2.0

An Open-source Framework for Data-centric Language Agents

Agents is an open-source framework designed to build and train autonomous language agents through a data-centric and learning-oriented architecture. The project introduces a concept known as agent symbolic learning, which treats an agent pipeline similarly to a neural network computational graph. In this framework, each node in the pipeline represents a step in the reasoning or action process, while prompts and tools act as adjustable parameters analogous to neural network weights. During...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
17

OmAgent

Build multimodal language agents for fast prototype and production

OmAgent is an open-source Python framework designed to simplify the development of multimodal language agents that can reason, plan, and interact with different types of data sources. The framework provides abstractions and infrastructure for building AI agents that operate on text, images, video, and audio while maintaining a relatively simple interface for developers. Instead of forcing developers to implement complex orchestration logic manually, the system manages task scheduling, worker coordination, and node optimization behind the scenes. ...

Downloads: 8 This Week

Last Update: 2026-03-05
See Project
18

HunyuanOCR

OCR expert VLM powered by Hunyuan's native multimodal architecture

HunyuanOCR is an open-source, end-to-end OCR (optical character recognition) Vision-Language Model (VLM) developed by Tencent‑Hunyuan. It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a...

Downloads: 1 This Week

Last Update: 4 days ago
See Project
19

DeepSeed

Deep learning optimization library making distributed training easy

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective. DeepSpeed delivers extreme-scale model training for everyone, from data scientists training on massive supercomputers to those training on low-end clusters or even on a single GPU. Using current generation of GPU clusters with hundreds of devices, 3D parallelism of DeepSpeed can efficiently train deep learning models with trillions of parameters. With just a single GPU,...

Downloads: 4 This Week

Last Update: 2026-03-30
See Project
20

iX

Autonomous GPT-4 agent platform

IX is a platform for designing and deploying autonomous and [semi]-autonomous LLM-powered agents and workflows. IX provides a flexible and scalable solution for delegating tasks to AI-powered agents. Agents created with the platform can automate a wide variety of tasks while running in parallel and communicating with each other.

Downloads: 5 This Week

Last Update: 2024-09-02
See Project
21

Agent Behavior Monitoring

The open source post-building layer for agents

Agent Behavior Monitoring is an open-source framework designed to monitor, evaluate, and improve the behavior of AI agents operating in real or simulated environments. The system focuses on agent behavior monitoring by collecting interaction data and analyzing how agents perform across different scenarios and tasks. Developers can use the framework to observe agent actions in both online production environments and offline evaluation settings, making it useful for debugging and performance analysis. ...

Downloads: 5 This Week

Last Update: 3 days ago
See Project
22

Parallax

Parallax is a distributed model serving framework

Parallax is a decentralized inference framework designed to run large language models across distributed computing resources. Instead of relying on centralized GPU clusters in data centers, the system allows multiple heterogeneous machines to collaborate in serving AI inference workloads. Parallax divides model layers across different nodes and dynamically coordinates them to form a complete inference pipeline. A two-stage scheduling architecture determines how model layers are allocated to available hardware and how requests are routed across nodes during execution. ...

Downloads: 3 This Week

Last Update: 2026-03-09
See Project
23

MetricFlow

MetricFlow allows you to define, build, and maintain metrics in code

...Because metric definitions live centrally, you avoid duplication across teams and tools, reduce risk of inconsistent numbers, and make it easier to audit and evolve the logic over time. The project emphasizes explainability, performance and portability: you define metrics once and then they can be consumed in BI tools, notebooks, or even AI/agent-driven workflows.

Downloads: 0 This Week

Last Update: 2025-10-28
See Project
24

Jina

Build cross-modal and multimodal applications on the cloud

...Fast deployment to Kubernetes, Docker Compose and Jina Cloud. Improved engineering efficiency thanks to the Jina AI ecosystem, so you can focus on innovating with the data applications you build.

Downloads: 0 This Week

Last Update: 2024-11-12
See Project
25

Conversational Health Agents (CHA)

A Personalized LLM-powered Agent Frameworks

CHA, or Conversational Health Agents, is an open-source framework designed to build intelligent healthcare assistants powered by large language models and external data sources. The system enables developers to create personalized AI agents that can interact with users through natural language while performing multi-step reasoning and task execution. It integrates orchestration capabilities that allow the agent to gather information from APIs, knowledge bases, and external services in order to generate more accurate and context-aware responses. ...

Downloads: 0 This Week

Last Update: 2026-03-17
See Project

Previous
9
10
11
12
You're on page 13
14
15
16
17
Next

Related Searches

chatgpt

ocr

ai

3d

video

self-learning ai

ai code

Related Categories

Artificial Intelligence

Software Development

Business

Scientific/Engineering

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise