bert-base-uncased

BERT-base-uncased is a 110-million-parameter English language model developed by Google, pretrained using masked language modeling and next sentence prediction on BookCorpus and English Wikipedia. It is case-insensitive and tokenizes text using WordPiece, enabling it to learn contextual relationships between words in a sentence bidirectionally. The model excels at feature extraction for downstream NLP tasks like sentence classification, named entity recognition, and question answering when fine-tuned appropriately. Its pretraining involved randomly masking 15% of tokens and predicting them based on surrounding context, allowing it to learn deep semantic and syntactic patterns. It has been widely used as a baseline and component in various fine-tuned models, achieving strong results on benchmarks like GLUE. Despite its success, BERT-base-uncased can exhibit social biases learned from its training data and is not designed for factual generation or open-ended text production.

Features

110M parameters, uncased English version
Pretrained using MLM and next sentence prediction
Trained on BookCorpus and English Wikipedia
Outputs bidirectional contextual embeddings
Strong performance on GLUE and other NLP tasks
Compatible with multiple frameworks (PyTorch, TensorFlow, etc.)
Tokenized with WordPiece (30k vocab)
Openly licensed under Apache 2.0

Project Samples

Project Activity

See All Activity >

Follow bert-base-uncased

bert-base-uncased Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of bert-base-uncased!

Additional Project Details

Registered

2025-06-27

Similar Business Software

ALBERT

ALBERT is a self-supervised Transformer model that was pretrained on a large corpus of English data. This means it does not require manual labelling, and instead uses an automated process to generate inputs and labels from raw texts. It is trained with two distinct objectives in mind. The first...

See Software
RoBERTa

RoBERTa builds on BERT’s language masking strategy, wherein the system learns to predict intentionally hidden sections of text within otherwise unannotated language examples. RoBERTa, which was implemented in PyTorch, modifies key hyperparameters in BERT, including removing BERT’s next-sentence...

See Software
Alpa

Alpa aims to automate large-scale distributed training and serving with just a few lines of code. Alpa was initially developed by folks in the Sky Lab, UC Berkeley. Some advanced techniques used in Alpa have been written in a paper published in OSDI'2022. Alpa community is growing with new...

See Software