1
TITLE: Data Collection Pipeline for Low-Resource Languages: A Case Study on Constructing a Tetun Text Corpus
AUTHORS: Gabriel de Jesus ; Sérgio Nunes ;
PUBLISHED: 2024, SOURCE: 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings
INDEXED IN: Scopus
2
TITLE: Labadain-30k+: A Monolingual Tetun Document-Level Audited Dataset
AUTHORS: Gabriel De Jesus ; Sérgio Nunes;
PUBLISHED: 2024, SOURCE: 3rd Annual Meeting of the ELRA-ISCA Special Interest Group on Under-Resourced Languages, SIGUL 2024 in 3rd Annual Meeting of the ELRA-ISCA Special Interest Group on Under-Resourced Languages, SIGUL 2024 at LREC-COLING 2024 - Workshop Proceedings
INDEXED IN: Scopus
3
TITLE: Text Information Retrieval in Tetun
AUTHORS: de Jesus, Gabriel ;
PUBLISHED: 2023, SOURCE: 45th European Conference on Information Retrieval (ECIR) in ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, VOLUME: 13982
INDEXED IN: Scopus WOS