The role of AI in English assessment

Jennifer Manning
A woman holding a tablet stood in a server room

Digital assessment is becoming more and more widespread in recent years. But what’s the role of digital assessment in teaching today? We’d like to give you some insight into digital assessment and automated scoring.

Just a few years ago, there may have been doubts about the role of AI in English assessment and the ability of a computer to score language tests accurately. But today, thousands of teachers worldwide use automated language tests to assess their students’ language proficiency.

For example, ÃÛÌÒapp’s suite of Versant tests have been delivering automated language assessments for nearly 25 years. And since its launch in 1996, over 350 million tests have been scored. The same technology is used in ÃÛÌÒapp’s Benchmark and Level tests.

So what makes automated scoring systems so reliable?

Huge data sets of exam answers and results are used to train artificial intelligence machine learning technology to score English tests the same way that human markers do. This way, we’re not replacing human judgment; we’re just teaching computers to replicate it.

Of course, computers are much more efficient than humans. They don’t mind monotonous work and don’t make mistakes (the standard marking error of an AI-scored test is lower than that of a human-scored test). So we can get unbiased, accurate, and consistent scores.

The top benefits of automated scoring are speed, reliability, flexibility, and free from bias.

Speed

The main advantage computers have over humans is that they can quickly process complex information. Digital assessments can often provide an instant score turnaround. We can get accurate, reliable results within minutes. And that’s not just for multiple-choice answers but complex responses, too.

The benefit for teachers and institutions is that they can have hundreds, thousands, or tens of thousands of learners taking a test simultaneously and instantly receive a score.

The sooner you have scores, the sooner you can make decisions about placement and students’ language level or benchmark a learner’s strengths and weaknesses and make adjustments to learning that drive improvement and progress.

Flexibility

The next biggest benefit of digital assessment is flexible delivery models. This has become increasingly more important since online learning has become more prominent.

Accessibility became key: how can your institution provide access to assessment for your learners, if you can’t deliver tests on school premises?

The answer is digital assessment.

For example, Versant, our web-based test can be delivered online or offline, on-site or off-site. All test-takers need is a computer and a headset with a microphone. They can take the test anywhere, any time of day, any day of the week, making it very flexible to fit into someone's schedule or situation.Ìý

Free from bias

Impartiality is another important benefit of AI-based scoring. The AI engine used to score digital proficiency tests is completely free from bias. It doesn’t get tired, and it doesn’t have good and bad days like human markers do. And it doesn’t have a personality.

While some human markers are more generous and others are more strict, AI is always equally fair. Thanks to this, automated scoring provides consistent, standardized scores, no matter who’s taking the test.

If you’re testing students from around the world, with different backgrounds, they will be scored solely on their level of English, in a perfectly objective way.

Additional benefits of automated scoring are security and cost.

Security

Digital assessments are more difficult to monitor than in-person tests, so security is a valid concern. One way to deal with this is remote monitoring.

Remote proctoring adds an extra layer of security, so test administrators can be confident that learners taking the test from home don’t cheat.

For example, our software captures a video of test takers, and the AI detection system automatically flags suspicious test-taker behavior. Test administrators can access the video anytime for audits and reviews, and easily find suspicious segments highlighted by our AI.

Here are a few examples of suspicious behavior that our system might flag:

Image monitoring:

  • A different face or multiple faces appearing in the frame
  • Camera blocked

Browser monitoring:

  • Navigating away from the test window or changing tabs multiple times

Video monitoring:

  • Test taker moving out of camera view
  • More than one person in the camera view
  • Looking away from the camera multiple times

Cost

Last but not least, the cost of automated English certifications are a benefit. Indeed, automated scoring can be a more cost-effective way of monitoring tests, primarily because it saves time and resources.

ÃÛÌÒapp English proficiency assessments are highly scalable and don’t require extra time from human scorers, no matter how many test-takers you have.

Plus, there’s no need to spend time and money on training markers or purchasing equipment.

AI is helping to lead the way with efficient, accessible, fair and cost-effective English test marking/management. Given time it should develop even further, becoming even more advanced and being of even more help within the world of English language learning and assessments.Ìý

More blogs from ÃÛÌÒapp

  • A teacher helping a student at a table

    The Global Scale of English and planning: A perfect partnership

    By Leonor Corradi

    As a teacher, I realized that planning had become an 'automatic pilot' routine from which I did not learn much. Like many others, I thought scales such as the Global Scale of English (GSE) or the Common European Framework of Reference are just that; references that are beyond the realities of their lessons.

    However, I've seen that the GSE is a very powerful resource to help us at the level of planning.Ìý

    If you're using a coursebook you may have noticed that, after completing one of the books in the series, students move up one level, such as from elementary to pre-intermediate or from intermediate to upper-intermediate.

    We all understand what it means to be an elementary or intermediate student. These levels are usually defined in terms of structures – conditional sentences, passive voice, and tenses – Simple Past, Future Continuous, etc.

    But why do students want to learn English? Using it means being able to listen or read and understand, interact with others, and communicate in writing. Even if it is parents who enroll their children in language institutes, what they want is for them to use the language. We can see a mismatch between how levels are defined and students' aims to study English.

    Here's how the GSE can help English language teachers

    First, you need the right scale for your group – Pre-primary, Young Learners, Adults, Professionals or Academic, which can be downloaded at:

    /languages/why-pearson/the-global-scale-of-english/resources.html

    Focus on your students' level. There you will see all the learning objectives that students need to achieve to complete the level at which they are and move on in their learning journey.

    What are learning objectives?ÌýThey are can-do statements that clearly describe what students are expected to achieve as the result of instruction. In other words, these objectives guide teachers in our planning to help students learn.

    When we plan our lessons, rather than working at lesson level only, we should reflect on how the activities proposed are referenced against the learning objectives of the level. We may see that some activities need some adapting in order to focus on the selected learning outcomes.

    At the level of planning as well, I also use the GSE to analyze the activities proposed in the materials I am using. Let me tell you what I do. Let's take listening, for instance. You may use the downloaded scales or the Teacher Toolkit that the GSE provides. Let's run through how this works.Ìý

    Ìý