The role of AI in English assessment

Jennifer Manning
A woman holding a tablet stood in a server room

Digital assessment is becoming more and more widespread in recent years. But what’s the role of digital assessment in teaching today? We’d like to give you some insight into digital assessment and automated scoring.

Just a few years ago, there may have been doubts about the role of AI in English assessment and the ability of a computer to score language tests accurately. But today, thousands of teachers worldwide use automated language tests to assess their students’ language proficiency.

For example, ÃÛÌÒapp’s suite of Versant tests have been delivering automated language assessments for nearly 25 years. And since its launch in 1996, over 350 million tests have been scored. The same technology is used in ÃÛÌÒapp’s Benchmark and Level tests.

So what makes automated scoring systems so reliable?

Huge data sets of exam answers and results are used to train artificial intelligence machine learning technology to score English tests the same way that human markers do. This way, we’re not replacing human judgment; we’re just teaching computers to replicate it.

Of course, computers are much more efficient than humans. They don’t mind monotonous work and don’t make mistakes (the standard marking error of an AI-scored test is lower than that of a human-scored test). So we can get unbiased, accurate, and consistent scores.

The top benefits of automated scoring are speed, reliability, flexibility, and free from bias.

Speed

The main advantage computers have over humans is that they can quickly process complex information. Digital assessments can often provide an instant score turnaround. We can get accurate, reliable results within minutes. And that’s not just for multiple-choice answers but complex responses, too.

The benefit for teachers and institutions is that they can have hundreds, thousands, or tens of thousands of learners taking a test simultaneously and instantly receive a score.

The sooner you have scores, the sooner you can make decisions about placement and students’ language level or benchmark a learner’s strengths and weaknesses and make adjustments to learning that drive improvement and progress.

Flexibility

The next biggest benefit of digital assessment is flexible delivery models. This has become increasingly more important since online learning has become more prominent.

Accessibility became key: how can your institution provide access to assessment for your learners, if you can’t deliver tests on school premises?

The answer is digital assessment.

For example, Versant, our web-based test can be delivered online or offline, on-site or off-site. All test-takers need is a computer and a headset with a microphone. They can take the test anywhere, any time of day, any day of the week, making it very flexible to fit into someone's schedule or situation.Ìý

Free from bias

Impartiality is another important benefit of AI-based scoring. The AI engine used to score digital proficiency tests is completely free from bias. It doesn’t get tired, and it doesn’t have good and bad days like human markers do. And it doesn’t have a personality.

While some human markers are more generous and others are more strict, AI is always equally fair. Thanks to this, automated scoring provides consistent, standardized scores, no matter who’s taking the test.

If you’re testing students from around the world, with different backgrounds, they will be scored solely on their level of English, in a perfectly objective way.

Additional benefits of automated scoring are security and cost.

Security

Digital assessments are more difficult to monitor than in-person tests, so security is a valid concern. One way to deal with this is remote monitoring.

Remote proctoring adds an extra layer of security, so test administrators can be confident that learners taking the test from home don’t cheat.

For example, our software captures a video of test takers, and the AI detection system automatically flags suspicious test-taker behavior. Test administrators can access the video anytime for audits and reviews, and easily find suspicious segments highlighted by our AI.

Here are a few examples of suspicious behavior that our system might flag:

Image monitoring:

  • A different face or multiple faces appearing in the frame
  • Camera blocked

Browser monitoring:

  • Navigating away from the test window or changing tabs multiple times

Video monitoring:

  • Test taker moving out of camera view
  • More than one person in the camera view
  • Looking away from the camera multiple times

Cost

Last but not least, the cost of automated English certifications are a benefit. Indeed, automated scoring can be a more cost-effective way of monitoring tests, primarily because it saves time and resources.

ÃÛÌÒapp English proficiency assessments are highly scalable and don’t require extra time from human scorers, no matter how many test-takers you have.

Plus, there’s no need to spend time and money on training markers or purchasing equipment.

AI is helping to lead the way with efficient, accessible, fair and cost-effective English test marking/management. Given time it should develop even further, becoming even more advanced and being of even more help within the world of English language learning and assessments.Ìý

More blogs from ÃÛÌÒapp

  • A parent and their child laying on the floor drawing together on a large peice of paper

    Raising bilingual kids: Sharing your family language at home

    By Charlotte Guest
    Reading time: 3 minutes

    A shared language is central to many families, and this can carry extra meaning when your children are growing up in a country that speaks a different language. It's not just about words; it's about culture, identity and connection. If you'd like to teach your kids the language that holds a special place in your heart, here are some tips to get you started.

  • A group of students stood around a teacher on a laptop

    The ethical challenges of AI in education

    By
    Reading time: 5 minutes

    AI is revolutionising every industry, and language learning is no exception. AI tools can provide students with unprecedented access to things like real-time feedback, instant translation and AI-generated texts, to name but a few.

    AI can be highly beneficial to language education by enhancing our students’ process of learning, rather than simply being used by students to ‘demonstrate’ a product of learning. However, this is easier said than done, and given that AI is an innovative tool in the classroom, it is crucial that educators help students to maintain authenticity in their work and prevent AI-assisted ‘cheating’. With this in mind, striking a balance between AI integration and academic integrity is critical.

    How AI impacts language learning

    Generative AI tools such as ChatGPT and Gemini have made it easier than ever for students to refine and develop their writing. However, these tools also raise concerns about whether submitted texts are student-produced, and if so, to what extent. If students rely on text generation tools instead of their own skills, our understanding of our students’ abilities may not reflect their true proficiency.

    Another issue is that if students continue to use AI for a skill they are capable of doing on their own, they’re likely to eventually lose that skill or become significantly worse at it.

    These points create a significant ethical dilemma:

    • How does AI support learning, or does it (have the potential to) replace the learning process?
    • How can educators differentiate between genuine student ability and AI-assisted responses?

    AI-integration strategies

    There are many ways in which educators can integrate AI responsibly, while encouraging our learners to do so too.

    1.ÌýRedesign tasks to make them more ‘AI-resistant’

    No task can be completely ‘AI-resistant’, but there are ways in which teachers can adapt coursebook tasks or take inspiration from activities in order to make them less susceptible to being completed using AI.

    For example:

    • Adapt writing tasks to be hyperlocal or context-specific. Generative AI is less likely to be able to generate texts that are context-bound. Focus on local issues and developments, as well as school or classroom-related topics. A great example is having students write a report on current facilities in their classroom and suggestions for improving the learning environment.
    • Focus on the process of writing rather than the final product. Have students use mind maps to make plans for their writing, have them highlight notes from this that they use in their text and then reflect on the steps they took once they’ve written their piece.
    • Use multimodal learning. Begin a writing task with a class survey, debate or discussion, then have students write up their findings into a report, essay, article or other task type.
    • Design tasks with skill-building at the core. Have students use their critical thinking skills to analyse what AI produces, creatively adapt its output and problem solve by fact-checking AI-generated text.

    2.ÌýUse AI so that students understand you know how to use it

    Depending on the policies in your institution, if you can use AI in the classroom with your students, they will see that you know about different AI tools and their output. A useful idea is to generate a text as a class, and have students critically analyse the AI-generated text. What do they think was done well? What could be improved? What would they have done differently?

    You can also discuss the ethical implications of AI in education (and other industries) with your students, to understand their view on it and better see in what situations they might see AI as a help or a hindrance.

    3.ÌýUse the GSE Learning Objectives to build confidence in language abilities

    Sometimes, students might turn to AI if they don’t know where to start with a task or lack confidence in their language abilities. With this in mind, it’s important to help your students understand where their language abilities are and what they’re working towards, with tangible evidence of learning. This is where the GSE Learning Objectives can help.

    The Global Scale of English (GSE) provides detailed, skill-specific objectives at every proficiency level, from 10 to 90. These can be used to break down complex skills into achievable steps, allowing students to see exactly what they need to do to improve their language abilities at a granular level.

    • Start by sharing the GSE Learning Objectives with students at the start of class to ensure they know what the expectations and language goals are for the lesson. At the end of the lesson, you can then have students reflect on their learning and find evidence of their achievement through their in-class work and what they’ve produced or demonstrated.
    • Set short-term GSE Learning Objectives for the four key skills – speaking, listening, reading and writing. That way, students will know what they’re working towards and have a clear idea of their language progression.
  • A teacher stood by a long wooden desk where her students are sat smiling at her

    What’s it like to teach English in France?

    By Steffanie Zazulak
    Reading time: 3 minutes

    Kirsty Murray taught English for a year at a collège (the French equivalent of a secondary school) in Villers-Cotterêts: a town in the north of France known for being the birthplace of Alexandre Dumas. She taught mixed-ability groups of 11- to 16-year-olds, with classes ranging in size from 10 to 35 students. Here, she shares the five lessons she learned from the experience.