Dataset_2

URL: https://rdm.inesctec.pt/dataset/e8ad6ce4-546a-4205-b4bb-4b50df2ec9a6/resource/b59a34f4-ab55-424e-a980-f81bd9d460f7/download/dataset2.csv

Dataset, with the following attributes:

id: It is assigned incrementally and takes into account ""unprocessed"" articles, i.e. articles that are only part of the links;

lang: the language of that article;

rank: rank position in the top 1000 list;

assess: assess score according to WikiProject Medicine;

Importance: importance according to WikiProject Medicine;

numUniqueEditors: The number of single editors corresponds to the different authors of the article's editions;

numEdits: number of editions made to the article;

connectivity: number of articles linked to a particular article through common editors;

numAnonEdits: number of editions made by anonymous editors;

numRegisteredEdits: number of editions made by registered editors;

numExternalLinks: links present throughout the article that refer to content external to Wikipedia;

numReverts: number of reversals made to editions prior to articles;

numBrokenLinks: links that point to currently unavailable pages;

numInnerLinks: links present in the text of articles and refer to other pages within the Wikipedia;

articleLength: text size in number of characters;

flesch: Flesch reading ease score. [-1] when not assessed;

kincaid: Kincaid score. [-1] when not assessed;

infoNoise: ratio between the amount of information after stemming and stopping and the size of the article before it is processed;

diversity: ratio between the number of unique publishers and the number of edits article totals;

numImages: number of images in article;

adminShare: ratio of edits made by administrators from the total number of editions;

age: article´s age, in days;

medianRevertTime: median of the reversal times of the different editions of the articles;

authority: auhtority computed score;

completeness: completeness computed score;

complexity: complexity computed score;

informativeness: informativeness computed score;

consistency: consistency computed score;

volatility: volatility computed score;

currency: currency computed score;

numInfobox: number of health-related infoboxes in article;

InfoboxValues: number of values in healt-related infoboxes in article;

InfoboxImages: number of images in healt-related infoboxes in article;

numTemplates: number of health-related templates in article;

numWpEdits: number of editions made by WikiProject Medicine admins;

wpmShare: ratio of editions made by WikiProject Medicine admins amomg the total editions;

translated: 1 if article was translated by the Healthcare Translation Task Force, 0 otherwise;

codes: number of medical codes in article's templates;

repLinks: number of links with a reputated source;

HAuthority: HealthAuthority computed score;

HCompleteness: HealthCompleteness computed score;

HInformativeness: HealthInformativeness computed score;

HConsistency: HelathConsistency computed score.

Embed

This resource view is not available at the moment. Click here for more information.

Download resource

Additional Information

Field Value
Last updated September 13, 2021
Created September 13, 2021
Format CSV
License Creative Commons Attribution Share-Alike
created6 days ago
formatCSV
has views1
idb59a34f4-ab55-424e-a980-f81bd9d460f7
last modified6 days ago
on same domain1
package ide8ad6ce4-546a-4205-b4bb-4b50df2ec9a6
revision id93e4736e-a698-4f77-bfc0-b1c43f751d39
stateactive
url typeupload