Class IndonesianStemmer


  • public class IndonesianStemmer
    extends java.lang.Object
    Stemmer for Indonesian.

    Stems Indonesian words with the algorithm presented in: A Study of Stemming Effects on Information Retrieval in Bahasa Indonesia, Fadillah Z Tala. http://www.illc.uva.nl/Publications/ResearchReports/MoL-2003-02.text.pdf

    • Constructor Detail

      • IndonesianStemmer

        public IndonesianStemmer()
    • Method Detail

      • stem

        public int stem​(char[] text,
                        int length,
                        boolean stemDerivational)
        Stem a term (returning its new length).

        Use stemDerivational to control whether full stemming or only light inflectional stemming is done.

      • stemDerivational

        private int stemDerivational​(char[] text,
                                     int length)
      • isVowel

        private boolean isVowel​(char ch)
      • removeParticle

        private int removeParticle​(char[] text,
                                   int length)
      • removePossessivePronoun

        private int removePossessivePronoun​(char[] text,
                                            int length)
      • removeFirstOrderPrefix

        private int removeFirstOrderPrefix​(char[] text,
                                           int length)
      • removeSecondOrderPrefix

        private int removeSecondOrderPrefix​(char[] text,
                                            int length)
      • removeSuffix

        private int removeSuffix​(char[] text,
                                 int length)