
Corpus building is the task of creating corpus suitable for various natural language processing tasks. The corpus are often sanitized and labeled and are available in a ready to use format.

Paper Published Year Code
Construction and annotation of a corpus of contemporary Nepali 2008 N/A
Nelralec/Bhasha Sanchar Working Paper 2 Categorisation for automated morphosyntactic analysis of Nepali: introducing the Nelralec Tagset (NT-01) 2005 N/A