Online corpus of spoken Ilokano language

Loading...
Thumbnail Image
Date
2019
Journal Title
Journal ISSN
Volume Title
Publisher
IOP Conference Series: Materials Science and Engineering
Abstract
There has been a great effort in the collection of different languages in the past years all over the world, and the development of online corpus outside the country brought new possibilities in the Philippines. However, there is a limited resource for the Ilokano Language. This paper introduces the Corpus of Spoken Ilokano Language, an online repository of spoken Ilokano in the Philippines specifically in region 1. The main component of this study is spoken Ilokano. It has been specifically built for natural language processing. It shows the difference of Ilokano language as spoken by Ilokanos in the region. The database consists of 160 speakers, 40 speakers in each province of the region, each speaking about 74 statements. Spoken Ilokano language was audio recorded and transcribed. A web application has been developed making the dataset available online. The corpus was validated to provide a useful resource of data that can be used for automatic speech recognition models.
Description
Keywords
Ilokano language, Corpus linguistics, Natural language processing, Philippine languages
Citation
Apostol, F. R., & Malicdem, A. R. (2019). Online corpus of spoken Ilokano language. IOP Conference Series: Materials Science and Engineering, 482(012034), 1-8. doi:10.1088/1757-899X/482/1/012034
?? Usage Statistics