SIGARRA News Corpus
Data och resurser
-
sigarra_news_corpus-1000-20170302T1422CSV
Comma-separated file with the following columns: news id, title, subtitle,...
-
sigarra-news-corpusZIP
Annotated news in the standoff format. Each directory represents an organic...
-
sigarra-news-corpusXML
Merged version of the individually annotated news articles, in XML format...
Mer information
| Fält | Värde |
|---|---|
| Källa | https://sigarra.up.pt |
| Författare | André Pires |
| Senast uppdaterad | September 29, 2017, 14:42 (Europe/Lisbon) |
| Skapad | Juni 13, 2017, 15:35 (Europe/Lisbon) |
| dc.Contributor | José Devezas, Sérgio Nunes |
| dc.Coverage.Spatial | Porto |
| dc.Coverage.Temporal | 2016-12-14 to 2017-03-01 |
| dc.Date | 2017 |
| dc.Format | *.csv; *.xml; *.zip |
| dc.Format.Extent | 4,22MB |
| dc.Language | PT |
| dc.Publisher | INESC TEC |
| dc.Relation | Master´s thesis: PIRES, André (2017).Named entity recognition on Portuguese web text. Porto: Faculdade de Engenharia da Universidade do Porto.http://hdl.handle.net/10216/106094 |
| dc.Type | Entity Annotated News |
