Labadain-Stopwords: A Curated List of 160 Tetun Stopwords

Labadain-Stopwords is a curated list of 160 Tetun stopwords, compiled from the Labadain-30k+ dataset and validated by native speakers. It is well-suited for various Tetun information retrieval and natural language processing tasks.The list is distributed in plain text format, with one word per line, enabling easy integration into various projects and applications.

Données et ressources

Info additionnelle

Champ Valeur
Producteur Gabriel de Jesus, Sérgio Nunes
Dernière modification avril 15, 2025, 16:23 (TU)
Créé le mars 28, 2025, 10:03 (TU)
Citation de Jesus, G., & Nunes, S. (2025). Labadain-Stopwords: A Curated List of 160 Tetun Stopwords [Data set]. INESC TEC. https://doi.org/10.25747/PG2V-KX70
DOI https://doi.org/10.25747/PG2V-KX70
Langue Tetun
Spatial Coverage Timor-Leste