Search Results for "python text" - Page 10

Sort By:

Showing 1480 open source projects for "python text"

View related business solutions

AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Transformers4Rec

Transformers4Rec is a flexible and efficient library

Transformers4Rec is an advanced recommendation system library that leverages Transformer models for sequential and session-based recommendations. The library works as a bridge between natural language processing (NLP) and recommender systems (RecSys) by integrating with one of the most popular NLP frameworks, Hugging Face Transformers (HF). Transformers4Rec makes state-of-the-art transformer architectures available for RecSys researchers and industry practitioners. Traditional recommendation...

Downloads: 0 This Week

Last Update: 2025-01-24
See Project
2

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training

MedicalGPT training medical GPT model with ChatGPT training pipeline, implementation of Pretraining, Supervised Finetuning, Reward Modeling and Reinforcement Learning. MedicalGPT trains large medical models, including secondary pre-training, supervised fine-tuning, reward modeling, and reinforcement learning training.

Downloads: 0 This Week

Last Update: 2025-02-16
See Project
3

LLM Foundry

LLM training code for MosaicML foundation models

Introducing MPT-7B, the first entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available for commercial use, and matches the quality of LLaMA-7B. MPT-7B was trained on the MosaicML platform in 9.5 days with zero human intervention at a cost of ~$200k. Large language models (LLMs) are changing the world, but for those outside well-resourced industry labs, it can be extremely difficult to train and deploy...

Downloads: 0 This Week

Last Update: 2025-07-29
See Project
4

Cleanlab

The standard data-centric AI package for data quality and ML

cleanlab helps you clean data and labels by automatically detecting issues in a ML dataset. To facilitate machine learning with messy, real-world data, this data-centric AI package uses your existing models to estimate dataset problems that can be fixed to train even better models. cleanlab cleans your data's labels via state-of-the-art confident learning algorithms, published in this paper and blog. See some of the datasets cleaned with cleanlab at labelerrors.com. This package helps you...

Downloads: 0 This Week

Last Update: 2025-02-27
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
5

spacy-transformers

Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

spaCy supports a number of transfer and multi-task learning workflows that can often help improve your pipeline’s efficiency or accuracy. Transfer learning refers to techniques such as word vector tables and language model pretraining. These techniques can be used to import knowledge from raw text into your pipeline, so that your models are able to generalize better from your annotated examples. You can convert word vectors from popular tools like FastText and Gensim, or you can load in any pre...

Downloads: 0 This Week

Last Update: 2025-05-26
See Project
6

Key-book

Proofs, cases, concept supplements, and reference explanations

The book "Introduction to Machine Learning Theory" (hereinafter referred to as "Introduction") written by Zhou Zhihua, Wang Wei, Gao Wei, and other teachers fills the regret of the lack of introductory works on machine learning theory in China. This book attempts to provide an introductory guide for readers interested in learning machine learning theory and researching machine learning theory in an easy-to-understand language. "Guide" mainly covers seven parts, corresponding to seven...

Downloads: 0 This Week

Last Update: 2025-03-26
See Project
7

Go support for Protocol Buffers

The Go support for Google's protocol buffers

Protocol buffers are Google's language-neutral, platform-neutral, extensible mechanism for serializing structured data, think XML, but smaller, faster, and simpler. You define how you want your data to be structured once, then you can use special generated source code to easily write and read your structured data to and from a variety of data streams and using a variety of languages. Protocol buffers currently support generated code in Java, Python, Objective-C, and C++. With our new proto3...

Downloads: 0 This Week

Last Update: 2024-03-06
See Project
8

OpenFlamingo

An open-source framework for training large multimodal models

Welcome to our open source version of DeepMind's Flamingo model! In this repository, we provide a PyTorch implementation for training and evaluating OpenFlamingo models. We also provide an initial OpenFlamingo 9B model trained on a new Multimodal C4 dataset (coming soon). Please refer to our blog post for more details. This repo is still under development, and we hope to release better-performing and larger OpenFlamingo models soon. If you have any questions, please feel free to open an...

Downloads: 0 This Week

Last Update: 2023-08-15
See Project
9

doom-modeline

A fancy and fast mode-line inspired by minimalism design

A fancy and fast mode-line inspired by minimalism design. A fancy and fast mode-line inspired by minimalism design. It’s integrated into Centaur Emacs, Doom Emacs, and Spacemacs. A match count panel (for anzu, iedit, multiple-cursors, symbol-overlay, and evil-search, etc.) An indicator for recording a macro. Current environment version (e.g. python, ruby, go, etc.) in the major-mode. A customizable mode-line height (see doom-modeline-height). A minor modes segment that is compatible...

Downloads: 0 This Week

Last Update: 2023-08-15
See Project
Picsart Enterprise Background Removal API for Stunning eCommerce Visuals
Instantly remove the background from your images in just one click.

With our Remove Background API tool, you can access the transformative capabilities of automation , which will allow you to turn any photo asset into compelling product imagery. With elevated visuals quality on your digital platforms, you can captivate your audience, and therefore achieve higher engagement and sales.

Learn More
10

Sound to Text

Convert a sound file to text to analyze it

Convert a sound file to text you can analyze to spot hidden words for psychological or universal-philosophical interpretation. The program will convert a sound file to text, with the purpose of analyzing it to spot intelligible words. Use the program with songs, movie or TV dialogue, private recordings, to reveal the hidden text messages of the sound. The resulting text will be a long string of mostly repeating characters but, every so often, you will notice an intelligible word, either...

Downloads: 0 This Week

Last Update: 2023-12-09
See Project
11

Image to Text

Convert an image to text to spot intelligible words.

The program will convert to text an image, such as a photo , with the purpose of analyzing it to spot intelligible words. Use the program with photos of clouds, sea, soil, vegetation or any other photo of natural or man-made semi-homogeneous configuration, to reveal the hidden universal-philosophical messages of the image. You can also use it on photos of people or art pieces to have a psychological insight of the person portrayed or of the image author. The resulting text will be a long...

Downloads: 0 This Week

Last Update: 2024-11-21
See Project
12

UniEM

Unified embedding model

UniEM is a unified embedding model designed to create high-quality text embeddings for various natural language processing tasks.

Downloads: 0 This Week

Last Update: 2025-01-30
See Project
13

smolOS

A tiny and simple research operating system

smolOS is a minimalist 16-bit operating system written in pure Assembly for x86 real mode. Its goal is to demonstrate the fundamentals of how an OS boots, handles input, and displays output using only the most essential code. smolOS fits in a single boot sector and runs directly on BIOS-compatible hardware or emulators like QEMU. It’s ideal for those seeking to understand bootloaders, BIOS interrupts, and ultra-low-level system programming through a practical and tiny project.

Downloads: 0 This Week

Last Update: 2025-04-03
See Project
14

doccano

Open source annotation tool for machine learning practitioners

doccano is an open-source text annotation tool for humans. It provides annotation features for text classification, sequence labeling and sequence-to-sequence tasks. So, you can create labeled data for sentiment analysis, named entity recognition, text summarization and so on. Just create a project, upload data and start annotating. You can build a dataset in hours.

Downloads: 0 This Week

Last Update: 2023-07-20
See Project
15

finetuner

Task-oriented finetuning for better embeddings on neural search

...-quality embeddings for semantic search, visual similarity search, cross-modal text image search, recommendation systems, clustering, duplication detection, anomaly detection, or other uses. Bring considerable improvements to model performance, making the most out of as little as a few hundred training samples, and finish fine-tuning in as little as an hour.

Downloads: 0 This Week

Last Update: 2023-08-21
See Project
16

Chinese-LLaMA-Alpaca-2 v2.0

Chinese LLaMA & Alpaca large language model + local CPU/GPU training

This project has open-sourced the Chinese LLaMA model and the Alpaca large model with instruction fine-tuning to further promote the open research of large models in the Chinese NLP community. Based on the original LLaMA , these models expand the Chinese vocabulary and use Chinese data for secondary pre-training, which further improves the basic semantic understanding of Chinese. At the same time, the Chinese Alpaca model further uses Chinese instruction data for fine-tuning, which...

Downloads: 0 This Week

Last Update: 2023-08-21
See Project
17

MMOCR

OpenMMLab Text Detection, Recognition and Understanding Toolbox

MMOCR is an open-source toolbox based on PyTorch and mmdetection for text detection, text recognition, and the corresponding downstream tasks including key information extraction. It is part of the OpenMMLab project. The toolbox supports not only text detection and text recognition, but also their downstream tasks such as key information extraction. The toolbox supports a wide variety of state-of-the-art models for text detection, text recognition and key information extraction. The modular...

Downloads: 1 This Week

Last Update: 2023-07-04
See Project
18

Repo of Tree of Thoughts (ToT)

Implementation of "Tree of Thoughts

... of Thought approach to prompting language models and enables exploration over coherent units of text (thoughts) that serve as intermediate steps toward problem-solving. ToT allows LMs to perform deliberate decision-making by considering multiple different reasoning paths and self-evaluating choices to decide the next course of action, as well as looking ahead or backtracking when necessary to make global choices.

Downloads: 0 This Week

Last Update: 2023-08-21
See Project
19

Tokenized Text Aligner

Aligns tokens in two versions of a text with differing tokenization.

This tool performs token-by-token alignment of two versions of a text with differing tokenization by interpreting the results of a file diff (https://docs.python.org/3/library/difflib.html). It is intended for use in the preparation of annotated linguistic corpora, where differences in tokenization may arise (i) following corrections or modifications to the source text or (ii) through the creation of different layers of annotation (part-of-speech, treebank) requiring different tokenization...

Downloads: 0 This Week

Last Update: 2024-07-31
See Project
20

Milvus Bootcamp

Dealing with all unstructured data, such as reverse image search

Milvus Bootcamp is a collection of tutorials, examples, and best practices for using Milvus, an open-source vector database designed for AI-powered similarity search and retrieval applications.

Downloads: 0 This Week

Last Update: 2025-01-22
See Project
21

Prime QA

State-of-the-art Multilingual Question Answering research

PrimeQA is a public open source repository that enables researchers and developers to train state-of-the-art models for question answering (QA). By using PrimeQA, a researcher can replicate the experiments outlined in a paper published in the latest NLP conference while also enjoying the capability to download pre-trained models (from an online repository) and run them on their own custom data. PrimeQA is built on top of the Transformers toolkit and uses datasets and models that are directly...

Downloads: 0 This Week

Last Update: 2023-08-21
See Project
22

PromethAI

Open-source framework that gives you AI Agents

PromethAI-Backend is a backend framework for AI-driven automation and knowledge extraction. It is designed to integrate with large language models (LLMs) to provide AI-enhanced workflows, including content generation, summarization, and data analysis.

Downloads: 0 This Week

Last Update: 2025-02-25
See Project
23

Text to Chord

Turn words into chords

Convert words and sentences to 5 note chords you can use to inspire music creation. Have fun turning your name, your city name, your friends' names, your team's name, your pet's name into wild and original harmonies that go beyond serialism and classic jazz.

Downloads: 6 This Week

Last Update: 2024-09-19
See Project
24

doccano client

A simple client for doccano API

doccano-client is a simple client wrapper for the doccano API. We're introducing a newly revamped Doccano API Client that features more Pythonic interaction as well as more testing and documentation. It also adds more regulated compatibility with specific Doccano release versions.

Downloads: 0 This Week

Last Update: 2023-06-13
See Project
25

whatsapp-chatbot-python

This library helps you easily create a Python chatbot with WhatsApp API. https://green-api.com/en/

Downloads: 1 This Week

Last Update: 2024-03-13
See Project