The role of AI in English assessment

Jennifer Manning
A woman holding a tablet stood in a server room

Digital assessment is becoming more and more widespread in recent years. But what’s the role of digital assessment in teaching today? We’d like to give you some insight into digital assessment and automated scoring.

Just a few years ago, there may have been doubts about the role of AI in English assessment and the ability of a computer to score language tests accurately. But today, thousands of teachers worldwide use automated language tests to assess their students’ language proficiency.

For example, ÃÛÌÒapp’s suite of Versant tests have been delivering automated language assessments for nearly 25 years. And since its launch in 1996, over 350 million tests have been scored. The same technology is used in ÃÛÌÒapp’s Benchmark and Level tests.

So what makes automated scoring systems so reliable?

Huge data sets of exam answers and results are used to train artificial intelligence machine learning technology to score English tests the same way that human markers do. This way, we’re not replacing human judgment; we’re just teaching computers to replicate it.

Of course, computers are much more efficient than humans. They don’t mind monotonous work and don’t make mistakes (the standard marking error of an AI-scored test is lower than that of a human-scored test). So we can get unbiased, accurate, and consistent scores.

The top benefits of automated scoring are speed, reliability, flexibility, and free from bias.

Speed

The main advantage computers have over humans is that they can quickly process complex information. Digital assessments can often provide an instant score turnaround. We can get accurate, reliable results within minutes. And that’s not just for multiple-choice answers but complex responses, too.

The benefit for teachers and institutions is that they can have hundreds, thousands, or tens of thousands of learners taking a test simultaneously and instantly receive a score.

The sooner you have scores, the sooner you can make decisions about placement and students’ language level or benchmark a learner’s strengths and weaknesses and make adjustments to learning that drive improvement and progress.

Flexibility

The next biggest benefit of digital assessment is flexible delivery models. This has become increasingly more important since online learning has become more prominent.

Accessibility became key: how can your institution provide access to assessment for your learners, if you can’t deliver tests on school premises?

The answer is digital assessment.

For example, Versant, our web-based test can be delivered online or offline, on-site or off-site. All test-takers need is a computer and a headset with a microphone. They can take the test anywhere, any time of day, any day of the week, making it very flexible to fit into someone's schedule or situation.Ìý

Free from bias

Impartiality is another important benefit of AI-based scoring. The AI engine used to score digital proficiency tests is completely free from bias. It doesn’t get tired, and it doesn’t have good and bad days like human markers do. And it doesn’t have a personality.

While some human markers are more generous and others are more strict, AI is always equally fair. Thanks to this, automated scoring provides consistent, standardized scores, no matter who’s taking the test.

If you’re testing students from around the world, with different backgrounds, they will be scored solely on their level of English, in a perfectly objective way.

Additional benefits of automated scoring are security and cost.

Security

Digital assessments are more difficult to monitor than in-person tests, so security is a valid concern. One way to deal with this is remote monitoring.

Remote proctoring adds an extra layer of security, so test administrators can be confident that learners taking the test from home don’t cheat.

For example, our software captures a video of test takers, and the AI detection system automatically flags suspicious test-taker behavior. Test administrators can access the video anytime for audits and reviews, and easily find suspicious segments highlighted by our AI.

Here are a few examples of suspicious behavior that our system might flag:

Image monitoring:

  • A different face or multiple faces appearing in the frame
  • Camera blocked

Browser monitoring:

  • Navigating away from the test window or changing tabs multiple times

Video monitoring:

  • Test taker moving out of camera view
  • More than one person in the camera view
  • Looking away from the camera multiple times

Cost

Last but not least, the cost of automated English certifications are a benefit. Indeed, automated scoring can be a more cost-effective way of monitoring tests, primarily because it saves time and resources.

ÃÛÌÒapp English proficiency assessments are highly scalable and don’t require extra time from human scorers, no matter how many test-takers you have.

Plus, there’s no need to spend time and money on training markers or purchasing equipment.

AI is helping to lead the way with efficient, accessible, fair and cost-effective English test marking/management. Given time it should develop even further, becoming even more advanced and being of even more help within the world of English language learning and assessments.Ìý

More blogs from ÃÛÌÒapp

  • A child sat with a teacher with a tablet

    Writing your own English language materials with the GSE

    By Billie Jago

    Being an English language teacher means you’re also probably (definitely) a materials writer. You likely tailor or create language materials for your students that are suited to their needs and interests, either as supplements to your course materials or for communicative lessons. Alternatively, you might be a teacher who creates paid, published materials available for students worldwide to enjoy.

    With this in mind, think of the materials you’ve developed and ask yourself the following:

    • How do you level your grammar or vocabulary for the content you write?
    • How do you find topic-related vocabulary to extend your students’ knowledge of language?
    • How do you contextualize new grammar or vocabulary?

    You can use many different resources, from online dictionaries to course workbooks to a Google search. Still, the Global Scale of English is a reference that provides everything you need to write great learning materials, all in one place. It can help save you valuable time as a teacher and materials writer.

    For me, the GSE was a game changer as an English teacher, and it continues to be as I write materials. The GSE is not just a tool; it’s a companion in the complex journey of material development, offering clarity and direction at every step. It can guide you in creating effective, engaging learning resources.

    How to use the GSE toolkit to create your own materials

    1. Establishing clear Learning Objectives

    helps you start with a clear roadmap. It provides detailed descriptors for language proficiency at every level, ensuring your materials align with specific learning objectives. For instance, if you’re creating a beginner-level reading comprehension activity, the GSE descriptors will guide you on the appropriate complexity of vocabulary and sentence structures.

    Take a look at the Learning Objectives tab in the GSE Toolkit to learn more.

    2. Designing level-appropriate content

    Once objectives are set, the GSE assists in tailoring the content difficulty to the targeted proficiency level. Its numerical scale, ranging from 10 to 90, allows you to pinpoint the exact level of language skills required and design your materials accordingly. This precision ensures that learners are neither overwhelmed nor under-challenged.

    You can set the level you are looking for by sliding the bar along the scale, so it corresponds to the appropriate CEFR level or GSE range.