Formats: CSV XML ZIP Licenses: Creative Commons Attribution Share-Alike Tags: news SIGARRA text mining named entity recognition