SIGARRA News Corpus NER Models for OpenNLP, Stanford CoreNLP, spaCy, NLTK

Pre-trained models for named entity recognition in Portuguese, using the following entity classes: Hora (Hour), Evento (Event), Organizacao (Organization), Curso (Course), Pessoa (Person), Localizacao (Location), Data (Date) and UnidadeOrganica (Organic Unit).

데이터와 리소스

추가 정보

필드
소스 https://rdm.inesctec.pt/dataset/cs-2017-004
저자 André Pires
최종 업데이트 2018년 11월 15일, 오전 10:38 (UTC+00:00)
생성됨 2017년 6월 13일, 오후 4:16 (UTC+00:00)
dc.Coverage.Spatial Portugal
dc.Coverage.Temporal 2016-12-14 to 2017-03-01
dc.Date 2017
dc.Format *.zip
dc.Format.Extent 80MB
dc.Language PT
dc.Publisher INESC TEC
dc.Relation Master´s thesis: PIRES, André (2017).Named entity recognition on Portuguese web text. Porto: Faculdade de Engenharia da Universidade do Porto. http://hdl.handle.net/10216/106094
dc.Type Serialized Named Entity Recognition Models