Formats: XML ZIP Licenses: Creative Commons Attribution Share-Alike Tags: portuguese SIGARRA named entity recognition text mining