Newspaper3k
News, full-text, and article metadata extraction in Python 3
.... Although installing newspaper is simple with pip, you will run into fixable issues if you are trying to install on ubuntu. Source objects are an abstraction of online news media websites like CNN or ESPN. You can initialize them in two different ways. Building a Source will extract its categories, feeds, articles, brand, and description for you. You may also provide configuration parameters like language, browser_user_agent, and etc seamlessly.