Flexible Photo Recrafting While Preserving Your Identity
Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input
Russian ASR model fine-tuned on Common Voice and CSS10 datasets
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
CTC-based forced aligner for audio-text in 158 languages