Skip to main content
Log in
Datasets
Organizacións
Grupos
Sobre
Search Datasets...
Home
Datasets
Order by
Relevancia
Nome ascendente
Nome descendente
Última modificación
Go
1 dataset found
Etiquetas:
text corpus
Filter Results
Labadain-30k+: A Monolingual Tetun Document-Level Audited Dataset
Labadain-30k+ is a monolingual Tetun dataset containing 33,550 documents spanning from June 2001 to September 2023, excluding the years 2004 and 2005, for which no documents are...
TXT
PYTHON
You can also access this registry using the
API
(see
API Docs
).