Bio_ClinicalBERT is a domain-specific language model tailored for clinical natural language processing (NLP), extending BioBERT with additional training on clinical notes. It was initialized from BioBERT-Base v1.0 and further pre-trained on all clinical notes from the MIMIC-III database (~880M words), which includes ICU patient records. The training focused on improving performance in tasks like named entity recognition and natural language inference within the healthcare domain. Notes were processed using rule-based sectioning and tokenized with SciSpacy. Training was done for 150,000 steps using a batch size of 32, max sequence length of 128, and a masked language modeling objective with a 0.15 mask probability. Bio_ClinicalBERT is available through Hugging Face's Transformers library for easy integration. It supports medical AI research and applications involving electronic health record understanding, clinical decision support, and biomedical information extraction.

Features

  • Pre-trained on all MIMIC-III clinical notes (~880M words)
  • Initialized from BioBERT, which was trained on PubMed and PMC data
  • Optimized for clinical NLP tasks like NER and NLI
  • Processes text using medical-specific sentence splitting (SciSpacy)
  • Compatible with Hugging Face Transformers (PyTorch, TensorFlow, JAX)
  • Masked language model with 0.15 masking probability
  • Trained with max sequence length of 128 for real-world clinical note length
  • Licensed under MIT, supporting open and flexible usage

Project Samples

Project Activity

See All Activity >

Categories

AI Models

Follow Bio_ClinicalBERT

Bio_ClinicalBERT Web Site

Other Useful Business Software
AI-powered service management for IT and enterprise teams Icon
AI-powered service management for IT and enterprise teams

Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Try it Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
0
0
0
1
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • Who can tell? Download does not work - cycles back to the same project page ad infinitum.
Read more reviews >

Additional Project Details

Registered

2025-07-02