Data of real-time prediction of Wikipedia articles' quality
Data and Resources
-
ReadMeTXT
Full dataset structure information.
-
Wikipedia TitlesZIP
Lists all the titles of English Wikipedia's articles, for each quality level....
-
Wikipedia GraphZIP
Nodes and edges of Wikipedia's Network Graph, as of May 2022. Generated from...
-
Default DatasetZIP
Balanced english Wikipedia dataset used to train the prediction models. The...
-
Dataset Construction TimesZIP
This folder fully details the results of the experiments measuring feature...
-
ML Training ReportsZIP
Complete reports of the Machine Learning training phase. Each subfolder...
-
Multi-Language DatasetsZIP
These datasets were designed for assessing and comparing our model's...
Additional Info
Field | Value |
---|---|
Author | Pedro Miguel Moás, Carla Teixeira Lopes |
Last Updated | May 27, 2024, 12:44 (UTC) |
Created | June 27, 2022, 12:53 (UTC) |
Citation | Moás, P. M., Teixeira Lopes, C. (2022). Data of real-time prediction of Wikipedia articles' quality [Data set]. INESC TEC. https://doi.org/10.25747/2RDD-RC08 |
DOI | https://doi.org/10.25747/2rdd-rc08 |
dc.Created | June 2022 |
dc.File.Size | 2.631 Gb |
dc.Type | ML Training datasets and reports, diverse extracted Wikipedia information |