Skip to main content
Log in
Register
Datasets
Organizations
Groups
About
Search Datasets...
Home
Datasets
Order by
Relevance
Name Ascending
Name Descending
Last Modified
Go
1 dataset found
Tags:
text corpus
Filter Results
Labadain-30k+: A Monolingual Tetun Document-Level Audited Dataset
Labadain-30k+ is a monolingual Tetun dataset containing 33,550 documents spanning from June 2001 to September 2023, excluding the years 2004 and 2005, for which no documents are...
TXT
PYTHON
You can also access this registry using the
API
(see
API Docs
).