Page 52 - The-5th-MCAIT2021-eProceeding
P. 52

An Adjusted BERT Architecture for The Automatic Essay
                                            Scoring Task


                                                                     b
                                           a
                                                                                      c
                   Ridha Hussein Chassab , Lailatul Qadri Zakaria *, Sabrina Tiun
         a,b,c The Asean Natural Language Processing (ASLAN), Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia
                                      43600 Bangi, Selangor Darul Ehsan, Malaysia.
                                         * Email: lailatul.qadri@ukm.edu.my

        Abstract

        Automatic Essay Scoring (AES) is the process of identifying an automatic score for an essay answer. The state-of-the-art in
        AES task relies on word embedding techniques. One of the advanced embedding architecture that seems promising is the
        Bidirectional Encoder Representations from Transformers (BERT). Yet, such an architecture suffers from ‘catastrophic
        forgetting’ problem. This problem occurs because the gradients of fine-tuning BERT quickly forget significant information.
        In order to overcome such a limitation and to adapt the BERT architecture to be fit for the AES task, it is imperative to
        address an adequate adjustment on the learning rate. Therefore, this paper aims at proposing an adjusted BERT architecture
        based on unfreezing fine-tune mechanism in which the BERT architecture can adequately adopted for the AES task.

        Keywords: Automatic Essay Scoring; Automatic Essay Grading; Bidirectional Encoder Representations from Transformers.


        1. Introduction

           Assessment is considered as the main component in the educational domain where student’s abilities are
        being evaluated (Valenti, Neri, & Cucchiarelli, 2003). The last two decades showed a great investment in E-
        learning by utilizing the latest technologies. The dawn of 21 century has witnessed a great progress in terms of
        computerized assessment systems (Valenti et al. 2000). The majority of such systems were concentrating on
        multiple choice questions where the assessment would take the form of storing correct answers in a relational
        database and accommodating a simple matching mechanism to identify whether the student’s answer is correct
        or not. Since the multiple-choice questions have a factual and exact answers, it was to implement an automatic
        assessment system to evaluate answers. Yet, another challenge has been arisen toward the computerized exams,
        such challenge is represented by the assessment of questions that require subjective answers. Automatic Essay
        Scoring (AES) is the process of identifying an automatic score for an essay answer. Subjective or essay answers
        require much more effort in order to articulate an automatic assessment for it.
           The state-of-the-art in AES task relies Word2Vec and pretrained Glove architectures (Chen & Zhou, 2019;
        Hendre, Mukherjee, Preet, & Godse, 2020; Li, Chen, & Nie, 2020; Li et al., 2018; Liu et al., 2019; Wang, Liu,
        & Dong, 2018; Zhang & Litman, 2019). The main limitation behind such architectures lies in its inability to
        handle sentence-level embedding and they suffer from ‘out-of-vocabulary’ problem. This problem occurs when
        a term would have no embedding vector within the training (i.e., unseen). A solution for the aforementioned
        problems depicted by the Bidirectional Encoder Representations from Transformers (BERT) architecture where
        it has a fixed vocabulary  size and a mechanism of rooting the terms. Although BERT showed remarkable
        performance  for  tasks  like  question-answering  yet,  it  showed  incompetency  when  tested  for  the  AES  task
        (Mayfield & Black, 2020; Rodriguez, Jafari, & Ormerod, 2019). The reason behind such miscarriage is due to
        a  well-known limitation behind the BERT architecture  which is ‘catastrophic forgetting’  (Rodriguez et al.,
        2019). This problem occurs because the gradients of fine-tuning BERT quickly forget significant information.







        E- Proceedings of The 5th International Multi-Conference on Artificial Intelligence Technology (MCAIT 2021)   [40]
        Artificial Intelligence in the 4th Industrial Revolution
   47   48   49   50   51   52   53   54   55   56   57