Page 70 - The-5th-MCAIT2021-eProceeding
P. 70

which containsfinalized positive and negative adjective sentiment bearable words.

















        Fig. 1. Sentiment Lexicon Generation Model Based on Dictionary Approach

        4. Results and Discussion

              The result of the experiment is shown in Table 1.
        Table 1. Total adjectives generated after five iterations.
                                          Items         Total Number
                                      Number of POS        25541
                                      Number of NEG        36064
                                       Total Words
                                        Generated          61605

            The first iteration started with setting a pair of strong seed sets. The initial seed set was choose based on
        our benchmark study, which is “baik” for the positive seed set and “buruk” for the negative seed set. After the
        experiment was run for five iterations, the final iteration produceda total of 61605, with 36064 words that exist
        in negative polarity, meanwhile 25541 words with positive polarity. The result also shows that the final lexicon
        produced not only limited to a single word but also multiple terms such as "bersikapsabar”. It also can be seen
        that some of the terms were generated more than once.  For example, in the positive list, the word 'suci' which
        means clean appears 45 times, and 'bangpak' which means cruel appear 18 times. Even though the same word
        appears  multiple  times,  but  it  was  rooted  in  different  synsets.This  result  also  shows  that  the  synonymy
        generation process for positive sentiment words produced several same lemmas or words that duplicate with
        antonymy words generation in negative lexicon list. This also happened in synonymy generation for negative
        lexicon list, which also generated a few same lemmas in antonymy generation in the positive list.It can be seen
        here that the first iteration collects the related words in the synset mostly, but the number of sentiment words
        increases slowly in the remaining iterations. Due to this reason, the numbers of iterations were run at five (5)
        iterations only as being done in previous research (Alexander & Omar, 2017).

        5. Conclusion and Future Works

              In  this  paper,  it  was  demonstrated  that  the  sentiment  lexicon  generation  algorithm  for  the  Malay
        Languagebased  ondictionary-based  approach  could  lead  to  an  expanded  sentiment  lexiconand  promising
        results. However, this proposed work only contains adjectives, even though it is known that other Part-of-speech
        (POS) also carry the sentiment. Moreover, this work is only suitable to cater sentiment analysis task of formal







        E- Proceedings of The 5th International Multi-Conference on Artificial Intelligence Technology (MCAIT 2021)   [57]
        Artificial Intelligence in the 4th Industrial Revolution
   65   66   67   68   69   70   71   72   73   74   75