Spark NLP

Experience the power of large language models like never before, unleashing the full potential of Natural Language Processing (NLP) with Spark NLP, the open source library that delivers scalable LLMs. The full code base is open under the Apache 2.0 license, including pre-trained models and pipelines. The only NLP library built natively on Apache Spark. The most widely used NLP library in the enterprise. Spark ML provides a set of machine learning applications that can be built using two main components, estimators and transformers. The estimators have a method that secures and trains a piece of data to such an application. The transformer is generally the result of a fitting process and applies changes to the target dataset. These components have been embedded to be applicable to Spark NLP. Pipelines are a mechanism for combining multiple estimators and transformers in a single workflow. They allow multiple chained transformations along a machine-learning task.

Features

Text Preprocessing
Parsing and Analysis
Sentiment and Classification
Classification and Question Answering
Machine Translation and Generation
Integration and Interoperability (ONNX, OpenVINO)
Pre-trained Models (36000+ in +200 languages)
Multi-lingual Support

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Spark NLP

Spark NLP Web Site

User Reviews

Be the first to post a review of Spark NLP!

Additional Project Details

Registered

3 days ago

Similar Business Software

Spark NLP

Experience the power of large language models like never before, unleashing the full potential of Natural Language Processing (NLP) with Spark NLP, the open source library that delivers scalable LLMs. The full code base is open under the Apache 2.0 license, including pre-trained models and...

See Software
ChatGPT

ChatGPT is a language model developed by OpenAI. It has been trained on a diverse range of internet text, allowing it to generate human-like responses to a variety of prompts. ChatGPT can be used for various natural language processing tasks, such as question answering, conversation, and text...

See Software
GPT-4

GPT-4 (Generative Pre-trained Transformer 4) is a large-scale unsupervised language model, yet to be released by OpenAI. GPT-4 is the successor to GPT-3 and part of the GPT-n series of natural language processing models, and was trained on a dataset of 45TB of text to produce human-like text...

See Software

Report inappropriate content

Spark NLP

State of the Art Natural Language Processing

Features

Project Samples

Project Activity

Categories

License

Follow Spark NLP

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered