Keating, Ron Andrew (2011) A specification language for agglutinative aboriginal languages for use with finite-state spelling correction. Masters thesis, Memorial University of Newfoundland.
[English]
PDF
- Accepted Version
Available under License - The author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission. Download (9MB) |
Abstract
There are certain North American aboriginal languages which are in danger of becoming extinct. This is partially due to the younger generations learning more major world languages in order to communicate in an increasingly global society. Furthermore, these languages tend to have only developed writing systems relatively recently, and thus do not have a rich legacy of written works to help preserve them. In order to help alleviate this problem, certain tools are being developed to facilitate communication in those languages. One such tool that is expected to help is a digital spelling correction tool. Having such a tool will make it easier to create professional digital documents in those languages, as well as help educate speakers with regard to the proper spellings of words. -- To facilitate the creation of a spelling correction software tool, this thesis proposes a simplified specification language called FSCL. Linguists can use FSCL to specify the details of natural languages in a format that is easily readable by both humans and computers without having to sacrifice any relevant expressive power, thus allowing linguists and even language speakers to build and maintain a working model of the natural language in question. The syntactic and semantic details of FSCL are discussed, and an implementation built on existing finite-state natural language processing algorithms is detailed and tested with respect to a set of actual language data from one such aboriginal language, Innu.
Item Type: | Thesis (Masters) |
---|---|
URI: | http://research.library.mun.ca/id/eprint/9518 |
Item ID: | 9518 |
Additional Information: | Bibliography: leaves 87-92. |
Department(s): | Science, Faculty of > Computer Science |
Date: | 2011 |
Date Type: | Submission |
Library of Congress Subject Heading: | Computational linguistics; Natural language processing (Computer science); Indigenous peoples--Languages--Orthography and spelling; Spelling errors--Software |
Actions (login required)
View Item |