Labadain-Stopwords: A Curated List of 160 Tetun Stopwords

Labadain-Stopwords is a curated list of 160 Tetun stopwords, compiled from the Labadain-30k+ dataset and validated by native speakers. It is well-suited for various Tetun information retrieval and natural language processing tasks.The list is distributed in plain text format, with one word per line, enabling easy integration into various projects and applications.

البيانات و الموارد

معلومات إضافية

حقل القيمة
المؤلف Gabriel de Jesus, Sérgio Nunes
آخر تحديث أبريل 15, 2025, 16:23 (UTC)
أنشئت مارس 28, 2025, 10:03 (UTC)
Citation de Jesus, G., & Nunes, S. (2025). Labadain-Stopwords: A Curated List of 160 Tetun Stopwords [Data set]. INESC TEC. https://doi.org/10.25747/PG2V-KX70
DOI https://doi.org/10.25747/PG2V-KX70
اللغة Tetun
Spatial Coverage Timor-Leste