Can computers really mark exams? Benefits of ELT automated assessments

app Languages
Hands typing at a laptop with symbols

Automated assessment, including the use of Artificial Intelligence (AI), is one of the latest education tech solutions. It speeds up exam marking times, removes human biases, and is as accurate and at least as reliable as human examiners. As innovations go, this one is a real game-changer for teachers and students. 

However, it has understandably been met with many questions and sometimes skepticism in the ELT community – can computers really mark speaking and writing exams accurately? 

The answer is a resounding yes. Students from all parts of the world already take AI-graded tests.  aԻ Versanttests – for example – provide unbiased, fair and fast automated scoring for speaking and writing exams – irrespective of where the test takers live, or what their accent or gender is. 

This article will explain the main processes involved in AI automated scoring and make the point that AI technologies are built on the foundations of consistent expert human judgments. So, let’s clear up the confusion around automated scoring and AI and look into how it can help teachers and students alike. 

AI versus traditional automated scoring

First of all, let’s distinguish between traditional automated scoring and AI. When we talk about automated scoring, generally, we mean scoring items that are either multiple-choice or cloze items. You may have to reorder sentences, choose from a drop-down list, insert a missing word- that sort of thing. These question types are designed to test particular skills and automated scoring ensures that they can be marked quickly and accurately every time.

While automatically scored items like these can be used to assess receptive skills such as listening and reading comprehension, they cannot mark the productive skills of writing and speaking. Every student's response in writing and speaking items will be different, so how can computers mark them?

This is where AI comes in. 

We hear a lot about how AI is increasingly being used in areas where there is a need to deal with large amounts of unstructured data, effectively and 100% accurately – like in medical diagnostics, for example. In language testing, AI uses specialized computer software to grade written and oral tests. 

How AI is used to score speaking exams

The first step is to build an acoustic model for each language that can recognize speech and convert it into waveforms and text. While this technology used to be very unusual, most of our smartphones can do this now. 

These acoustic models are then trained to score every single prompt or item on a test. We do this by using human expert raters to score the items first, using double marking. They score hundreds of oral responses for each item, and these ‘Standards’ are then used to train the engine. 

Next, we validate the trained engine by feeding in many more human-marked items, and check that the machine scores are very highly correlated to the human scores. If this doesn’t happen for any item, we remove it, as it must match the standard set by human markers. We expect a correlation of between .95-.99. That means that tests will be marked between 95-99% exactly the same as human-marked samples. 

This is incredibly high compared to the reliability of human-marked speaking tests. In essence, we use a group of highly expert human raters to train the AI engine, and then their standard is replicated time after time.  

How AI is used to score writing exams

Our AI writing scoring uses a technology called . LSA is a natural language processing technique that can analyze and score writing, based on the meaning behind words – and not just their superficial characteristics. 

Similarly to our speech recognition acoustic models, we first establish a language-specific text recognition model. We feed a large amount of text into the system, and LSA uses artificial intelligence to learn the patterns of how words relate to each other and are used in, for example, the English language. 

Once the language model has been established, we train the engine to score every written item on a test. As in speaking items, we do this by using human expert raters to score the items first, using double marking. They score many hundreds of written responses for each item, and these ‘Standards’ are then used to train the engine. We then validate the trained engine by feeding in many more human-marked items, and check that the machine scores are very highly correlated to the human scores. 

The benchmark is always the expert human scores. If our AI system doesn’t closely match the scores given by human markers, we remove the item, as it is essential to match the standard set by human markers.

AI’s ability to mark multiple traits 

One of the challenges human markers face in scoring speaking and written items is assessing many traits on a single item. For example, when assessing and scoring speaking, they may need to give separate scores for content, fluency and pronunciation. 

In written responses, markers may need to score a piece of writing for vocabulary, style and grammar. Effectively, they may need to mark every single item at least three times, maybe more. However, once we have trained the AI systems on every trait score in speaking and writing, they can then mark items on any number of traits instantaneously – and without error. 

AI’s lack of bias

A fundamental premise for any test is that no advantage or disadvantage should be given to any candidate. In other words, there should be no positive or negative bias. This can be very difficult to achieve in human-marked speaking and written assessments. In fact, candidates often feel they may have received a different score if someone else had heard them or read their work.

Our AI systems eradicate the issue of bias. This is done by ensuring our speaking and writing AI systems are trained on an extensive range of human accents and writing types. 

We don’t want perfect native-speaking accents or writing styles to train our engines. We use representative non-native samples from across the world. When we initially set up our AI systems for speaking and writing scoring, we trialed our items and trained our engines using millions of student responses. We continue to do this now as new items are developed.

The benefits of AI automated assessment

There is nothing wrong with hand-marking homework tests and exams. In fact, it is essential for teachers to get to know their students and provide personal feedback and advice. However, manually correcting hundreds of tests, daily or weekly, can be repetitive, time-consuming, not always reliable and takes time away from working alongside students in the classroom. The use of AI in formative and summative assessments can increase assessed practice time for students and reduce the marking load for teachers.

Language learning takes time, lots of time to progress to high levels of proficiency. The blended use of AI can:

  • address the increasing importance of formative assessmentto drive personalized learning and diagnostic assessment feedback 

  • allow students to practice and get instant feedback inside and outside of allocated teaching time

  • address the issue of teacher workload

  • create a virtuous combination between humans and machines, taking advantage of what humans do best and what machines do best. 

  • provide fair, fast and unbiased summative assessment scores in high-stakes testing.

We hope this article has answered a few burning questions about how AI is used to assess speaking and writing in our language tests. An interesting quote from Fei-Fei Li, Chief scientist at Google and Stanford Professor describes AI like this:

“I often tell my students not to be misled by the name ‘artificial intelligence’ — there is nothing artificial about it; A.I. is made by humans, intended to behave [like] humans and, ultimately, to impact human lives and human society.”

AI in formative and summative assessments will never replace the role of teachers. AI will support teachers, provide endless opportunities for students to improve, and provide a solution to slow, unreliable and often unfair high-stakes assessments.

Examples of AI assessments in ELT

At app, we have developed a range of assessments using AI technology.

Versant

The Versant tests are a great tool to help establish language proficiency benchmarks in any school, organization or business. They are specifically designed for placement tests to determine the appropriate level for the learner.

PTE Academic

The  is aimed at those who need to prove their level of English for a university place, a job or a visa. It uses AI to score tests and results are available within five days. 

app English International Certificate (PEIC)

app English International Certificate (PEIC) also uses automated assessment technology. With a two-hour test available on-demand to take at home or at school (or at a secure test center). Using a combination of advanced speech recognition and exam grading technology and the expertise of professional ELT exam markers worldwide, our patented software can measure English language ability.

Read more about the use of AI in our learning and testing here, or if you're wondering which English test is right for your students make sure to check out our post 'Which exam is right for my students?'.

More blogs from app

  • A child sat with a teacher with a tablet

    Writing your own English language materials with the GSE

    Por Billie Jago

    Being an English language teacher means you’re also probably (definitely) a materials writer. You likely tailor or create language materials for your students that are suited to their needs and interests, either as supplements to your course materials or for communicative lessons. Alternatively, you might be a teacher who creates paid, published materials available for students worldwide to enjoy.

    With this in mind, think of the materials you’ve developed and ask yourself the following:

    • How do you level your grammar or vocabulary for the content you write?
    • How do you find topic-related vocabulary to extend your students’ knowledge of language?
    • How do you contextualize new grammar or vocabulary?

    You can use many different resources, from online dictionaries to course workbooks to a Google search. Still, the Global Scale of English is a reference that provides everything you need to write great learning materials, all in one place. It can help save you valuable time as a teacher and materials writer.

    For me, the GSE was a game changer as an English teacher, and it continues to be as I write materials. The GSE is not just a tool; it’s a companion in the complex journey of material development, offering clarity and direction at every step. It can guide you in creating effective, engaging learning resources.

    How to use the GSE toolkit to create your own materials

    1. Establishing clear Learning Objectives

    helps you start with a clear roadmap. It provides detailed descriptors for language proficiency at every level, ensuring your materials align with specific learning objectives. For instance, if you’re creating a beginner-level reading comprehension activity, the GSE descriptors will guide you on the appropriate complexity of vocabulary and sentence structures.

    Take a look at the Learning Objectives tab in the GSE Toolkit to learn more.

    2. Designing level-appropriate content

    Once objectives are set, the GSE assists in tailoring the content difficulty to the targeted proficiency level. Its numerical scale, ranging from 10 to 90, allows you to pinpoint the exact level of language skills required and design your materials accordingly. This precision ensures that learners are neither overwhelmed nor under-challenged.

    You can set the level you are looking for by sliding the bar along the scale, so it corresponds to the appropriate CEFR level or GSE range.

  • People sat in a office, one with a VR headset on smiling and gesturing with the other two watching

    The impact of AI on language learning

    Por

    Recently, Artificial Intelligence (AI) has shown itself to be a useful and breakthrough tool for many things, including language learning. Educators, learners and businesses are constantly seeking effective methods to impart language skills and AI-powered solutions have proved to be invaluable assets in this regard.

    The impact of AI on language learning is multifaceted, presenting opportunities as well as challenges for educators, learners and corporate entities. According to a , many institutions say that AI will be instrumental to improving their competitiveness, funding, and innovation.

    It is a widely discussed topic nowadays. While it has its advantages, there are also some drawbacks to consider.

  • A teacher stood by a student in a classroom smiling

    Online language teaching tools: A comprehensive guide for educators

    Por app Languages

    Online language teaching has become a prominent facet of modern education, especially in light of recent global changes. The ability to instruct and learn languages online has opened up new possibilities for educators and students alike.

    The importance of using tools in online language teaching cannot be overstated. In this digital era, these tools serve as the linchpin for effective and engaging instruction. They enhance the learning experience, promote interaction and provide valuable resources that traditional teaching methods often lack.

    The digital language teaching landscape

    The shift to online education

    As a result of the digital revolution, which has completely changed how information is accessible and disseminated, education is moving toward being done online.

    has several benefits, such as:

    • Accessibility: Online language courses break down geographical barriers, allowing students worldwide to access quality instruction and teaching.
    • Flexibility: Students can tailor their learning schedules to their needs, making education more accommodating for a diverse range of learners.
    • Interactivity: Online platforms offer numerous interactive features, such as virtual classrooms, video conferences and chat tools, enhancing student-teacher and peer-to-peer interactions.
    • Rich resources:Online language teaching can harness a vast array of multimedia resources, making learning more engaging and effective.

    Challenges for language educators

    While the shift to , it also brings challenges that language educators must navigate, including:

    • Adaptation: Educators must adapt to new teaching methods, technologies and platforms, which can be a steep learning curve for some.
    • Engagement: Maintaining student engagement in an online environment requires creative approaches and the integration of various teaching tools.
    • Assessment: Assessing student progress can be more complex online, but it also offers the opportunity for innovative assessment methods.
    • Technical issues: Connectivity problems and technical glitches can disrupt online classes, requiring educators to be tech-savvy troubleshooters.
    • Isolation: Online learning can sometimes lead to feelings of isolation among students, so educators need to foster a sense of community and support.

    Essential online language teaching tools

    The right tools are essential to create engaging and compelling learning experiences. This section explores the key categories of tools that educators can utilize to enhance their online language teaching.

    Tools for interactive lessons and activities

    • Video conferencing platforms: These platforms enable real-time interaction with students, allowing educators to conduct live lessons, engage in discussions and facilitate group activities. Video conferencing helps replicate the classroom experience online. Platforms such as Zoom, Microsoft Teams and Google Meet provide a virtual classroom environment with features for video, audio and chat communication.
    • Interactive whiteboards: Tools like Jamboard and allow educators to write, draw and share resources in real time. This feature enhances engagement and enables educators to illustrate concepts effectively.
    • Screen recorder: software allows educators to capture their lessons, which can be invaluable for students who miss a live session. Recorded lessons can also serve as a reference for students during their self-paced learning.
    • Teleprompter: A helps educators maintain a smooth and organized delivery during recorded lessons. It ensures that lessons stay on track and within a predefined timeframe, improving the clarity and effectiveness of instruction.

    Tools for assessment and feedback

    • Quizzes and surveys: Platforms like and allow educators to create online quizzes and surveys for assessing students' knowledge.
    • Assignment submission tools: Learning management systems (LMSs) such asapp English Connect (PEC)and MyEnglishLab (MEL) enable students to submit assignments electronically, making it easier for educators to grade and provide feedback.
    • Feedback forms: Google Forms and Microsoft Forms can collect student feedback, helping educators improve their teaching methods.

    Tools for content and resource creation

    • Video creation tools: Video editing software empowers educators to create engaging educational videos, lectures and tutorials. This multimedia approach can make complex concepts more accessible to students.
    • Video compressor: tools reduce the file size of videos without compromising quality. This is especially useful for ensuring students can easily access and stream video content, even with limited bandwidth.

    How to implement online language tools

    Implementing online language tools effectively is crucial for creating a dynamic and successful learning environment. Here are some best practices for integrating these tools into language teaching.

    • Alignment with learning objectives: Ensure your tools align with your course objectives. They should enhance the learning experience and help achieve specific educational goals.
    • Teacher training: Invest time in training educators to use these tools effectively. Familiarity with the tools' features and functionalities is essential.
    • Clear instructions: Provide clear and concise instructions on how to use the tools. Create tutorials or to help students navigate them effectively.
    • Accessibility and inclusivity: Ensure that the tools and content you use are accessible to all students, including those with disabilities. This means considering factors like compatibility and providing transcripts for multimedia content.
    • Regular feedback: Encourage students to provide feedback on their experience with the tools. Use this feedback to make improvements and adjustments.

    Explore the tools discussed in this guide, experiment with them and adapt your teaching methods to the digital age. By staying curious, adaptable and open to innovation, you'll be better equipped to provide your students with the best possible language learning experience in the online realm. Happy teaching!

    If you're a teacher looking for more hints and advice to level up your language teaching, check out ourlanguage teaching blog posts. We also have opportunities for professional development.