Labadain-Avaliadór : A Test Collection for Tetun Ad-hoc Test Retrieval Task

The Labadain-Avaliadór dataset is a test collection developed for the ad-hoc retrieval task. It comprises 59 topics, 33,550 documents, and 5,900 query-document relevance judgments (qrels), with an average of 36.76 relevant documents per query. The queries are sourced from real-world search activity, specifically from two channels: Google Search Console logs for Timor News and internal search logs from the Timor News website. The document collection is derived from the Labadain-30k+ dataset.

Data ja resurssit

Lisätietoja

Kenttä Arvo
Laatija Gabriel de Jesus, Sérgio Nunes
Viimeksi päivitetty huhtikuuta 15, 2025, 16:24 (UTC)
Luotu maaliskuuta 28, 2025, 10:03 (UTC)
Citation de Jesus, G., & Nunes, S. (2025). Labadain-Avaliadór : A Test Collection for Tetun Ad-hoc Test Retrieval Task [Data set]. INESC TEC. https://doi.org/10.25747/2K6S-E518
DOI https://doi.org/10.25747/2K6S-E518
Kieli Tetun
Spatial Coverage Timor-Leste