Labadain-Stopwords: A Curated List of 160 Tetun Stopwords

Labadain-Stopwords is a curated list of 160 Tetun stopwords, compiled from the Labadain-30k+ dataset and validated by native speakers. It is well-suited for various Tetun information retrieval and natural language processing tasks.The list is distributed in plain text format, with one word per line, enabling easy integration into various projects and applications.

ទិន្នន័យ និងធនធាន

Additional Info

Field Value
Author Gabriel de Jesus, Sérgio Nunes
Last Updated មេសា 21, 2025, 09:04 (UTC)
Created មីនា 28, 2025, 10:03 (UTC)
Citation de Jesus, G., & Nunes, S. (2025). Labadain-Stopwords: A Curated List of 160 Tetun Stopwords [Data set]. INESC TEC. https://doi.org/10.25747/PG2V-KX70
DOI https://doi.org/10.25747/PG2V-KX70
ភាសា Tetun
Spatial Coverage Timor-Leste