Online Journalism Labservatory
Description and purpose of the Research Infrastructure
Collects articles from ten different newsportals. Specifically, ten background jobs are executed frequently and in parallel for indexing all the latest article's pages using web crawlers and RSS feeds. Then the content of the article is extracted, using artificial intelligence methods, and stored in a database.