Wikipedia and Simple Wikipedia Lead Section Pairs for Nine Categories
البيانات و الموارد
-
Categorized datasetCSV
Contains nine files, each a selection of 10.000 lead section pairs.
-
Model responsesCSV
Contains nine files, each with the model's raw and processed responses for...
-
READMETXT
-
Full DatasetZIP
Folder containing the complete dataset.
معلومات إضافية
حقل | القيمة |
---|---|
المؤلف | José Frederico Rodrigues, Carla Teixeira Lopes & Henrique Lopes Cardoso |
القائم بالصيانة | João A. Castro (joao.a.castro@inesctec.pt) |
آخر تحديث | أغسطس 9, 2024, 14:14 (UTC) |
أنشئت | أغسطس 9, 2024, 13:41 (UTC) |
Citation | Rodrigues, J. F., Teixeira Lopes, C., & Lopes Cardoso, H. (2024). Wikipedia and Simple Wikipedia Lead Section Pairs for Nine Categories [Data set]. INESC TEC. https://doi.org/10.25747/4VC9-ZS43 |
Creation Date | May, 2024 |
DOI | doi.org/10.25747/4VC9-ZS43 |
اللغة | EN |
Relation | Master Thesis: Readability Assessment and Text Simplification through Open-Source Large Language Models |
Size | 454 MB |