SIGARRA News Corpus NER Models for OpenNLP, Stanford CoreNLP, spaCy, NLTK

Pre-trained models for named entity recognition in Portuguese, using the following entity classes: Hora (Hour), Evento (Event), Organizacao (Organization), Curso (Course), Pessoa (Person), Localizacao (Location), Data (Date) and UnidadeOrganica (Organic Unit).

Data and Resources

Additional Info

Field Value
Author André Pires
Last Updated November 15, 2018, 10:38 (Europe/Lisbon)
Created June 13, 2017, 17:16 (Europe/Lisbon)
dc.Coverage.Spatial Portugal
dc.Coverage.Temporal 2016-12-14 to 2017-03-01
dc.Date 2017
dc.Format *.zip
dc.Format.Extent 80MB
dc.Language PT
dc.Publisher INESC TEC
dc.Relation Master´s thesis: PIRES, André (2017).Named entity recognition on Portuguese web text. Porto: Faculdade de Engenharia da Universidade do Porto.
dc.Type Serialized Named Entity Recognition Models