Formats: CSV XML Licenses: Creative Commons Attribution Share-Alike Tags: named entity recognition news SIGARRA text mining portuguese