Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Stanford NLP Python library for many human languages
Convert files like docx, xlsx, pptx, html, and more to MarkDown
csv2odf can convert csv data to formatted spreadsheets and documents.
Docx-2-PDF: The Converter [Improved.Simplified.Alternative]
fastNLP: A Modularized and Extensible NLP Framework