One solution of searching text documents in Serbian language
Abstract: The modern way of life, e-business, a large amount of data available in electronic form imposed the need for analysis of textual documents written in different natural languages. Natural languages have different grammatical rules and a lot of exceptions, which complicate analyze of documents. In this paper, we propose one solution for searching documents on Serbian language. System components have been described. Also, normalization algorithm, preparing algorithm for grouping similar documents, and searching algorithm have been implemented.
engleski
2013
Ovo delo je licencirano pod uslovima licence
Creative Commons CC BY 4.0 - Creative Commons Autorstvo 4.0 International License.
http://creativecommons.org/licenses/by/4.0/legalcode
Keywords: clustering, keyword, n-gram, normalization, similarity searching.