The role of AI in English assessment

Jennifer Manning
A woman holding a tablet stood in a server room

Digital assessment is becoming more and more widespread in recent years. But what’s the role of digital assessment in teaching today? We’d like to give you some insight into digital assessment and automated scoring.

Just a few years ago, there may have been doubts about the role of AI in English assessment and the ability of a computer to score language tests accurately. But today, thousands of teachers worldwide use automated language tests to assess their students’ language proficiency.

For example, app’s suite of Versant tests have been delivering automated language assessments for nearly 25 years. And since its launch in 1996, over 350 million tests have been scored. The same technology is used in app’s Benchmark and Level tests.

So what makes automated scoring systems so reliable?

Huge data sets of exam answers and results are used to train artificial intelligence machine learning technology to score English tests the same way that human markers do. This way, we’re not replacing human judgment; we’re just teaching computers to replicate it.

Of course, computers are much more efficient than humans. They don’t mind monotonous work and don’t make mistakes (the standard marking error of an AI-scored test is lower than that of a human-scored test). So we can get unbiased, accurate, and consistent scores.

The top benefits of automated scoring are speed, reliability, flexibility, and free from bias.

Speed

The main advantage computers have over humans is that they can quickly process complex information. Digital assessments can often provide an instant score turnaround. We can get accurate, reliable results within minutes. And that’s not just for multiple-choice answers but complex responses, too.

The benefit for teachers and institutions is that they can have hundreds, thousands, or tens of thousands of learners taking a test simultaneously and instantly receive a score.

The sooner you have scores, the sooner you can make decisions about placement and students’ language level or benchmark a learner’s strengths and weaknesses and make adjustments to learning that drive improvement and progress.

Flexibility

The next biggest benefit of digital assessment is flexible delivery models. This has become increasingly more important since online learning has become more prominent.

Accessibility became key: how can your institution provide access to assessment for your learners, if you can’t deliver tests on school premises?

The answer is digital assessment.

For example, Versant, our web-based test can be delivered online or offline, on-site or off-site. All test-takers need is a computer and a headset with a microphone. They can take the test anywhere, any time of day, any day of the week, making it very flexible to fit into someone's schedule or situation.

Free from bias

Impartiality is another important benefit of AI-based scoring. The AI engine used to score digital proficiency tests is completely free from bias. It doesn’t get tired, and it doesn’t have good and bad days like human markers do. And it doesn’t have a personality.

While some human markers are more generous and others are more strict, AI is always equally fair. Thanks to this, automated scoring provides consistent, standardized scores, no matter who’s taking the test.

If you’re testing students from around the world, with different backgrounds, they will be scored solely on their level of English, in a perfectly objective way.

Additional benefits of automated scoring are security and cost.

Security

Digital assessments are more difficult to monitor than in-person tests, so security is a valid concern. One way to deal with this is remote monitoring.

Remote proctoring adds an extra layer of security, so test administrators can be confident that learners taking the test from home don’t cheat.

For example, our software captures a video of test takers, and the AI detection system automatically flags suspicious test-taker behavior. Test administrators can access the video anytime for audits and reviews, and easily find suspicious segments highlighted by our AI.

Here are a few examples of suspicious behavior that our system might flag:

Image monitoring:

  • A different face or multiple faces appearing in the frame
  • Camera blocked

Browser monitoring:

  • Navigating away from the test window or changing tabs multiple times

Video monitoring:

  • Test taker moving out of camera view
  • More than one person in the camera view
  • Looking away from the camera multiple times

Cost

Last but not least, the cost of automated English certifications are a benefit. Indeed, automated scoring can be a more cost-effective way of monitoring tests, primarily because it saves time and resources.

app English proficiency assessments are highly scalable and don’t require extra time from human scorers, no matter how many test-takers you have.

Plus, there’s no need to spend time and money on training markers or purchasing equipment.

AI is helping to lead the way with efficient, accessible, fair and cost-effective English test marking/management. Given time it should develop even further, becoming even more advanced and being of even more help within the world of English language learning and assessments.

More blogs from app

  • a young man sat in a lecture hall with other students behind him

    How the GSE helped Salem State University meet learner needs

    By Sara Davila

    Salem State University is one of the largest and most diverse public teaching universities in Massachusetts. In total, it has about 8,700 students enrolled, 37% of whom are people of color. It also educates 221 international students from 59 different countries – with China, Albania, Brazil, Morocco, Nigeria and Japan among the most represented countries on campus.

    The university runs an intensive English language program. Most students who enrol come from China, Brazil, Albania, Vietnam, and Japan. The program also has a number of part-time English language learners from the local community.

    In 2016, Associate Director Shawn Wolfe and teachers at the American Language and Culture Institute did a review and found that areas for growth included establishing a universal documentation for identifying learner needs, goals and progress.

    “The biggest challenge was that we needed to have a better way of placing students,” Wolfe says. “We also needed to have a way to have our curriculum, our assessment and our student learning outcomes unified.”

    The team lacked programmatic data related to learning gains and outcomes. Additionally, they realized that assessments could be used to inform students about entry requirements at the university and other programs. And that’s where the Global Scale of English (GSE) came in, as a tool which enabled the staff at the American Language and Culture Institute to personalize and diversity their English teaching program to meet learner needs.

    Cultural and linguistic diversity

    David Silva PhD, the Provost and Academic Vice President, highlights the need for this type of personalization when it comes to education.

    “We have to be prepared for an increasing variety of learners and learning contexts. This means we have to make our learning contexts real,” he says. “We have to think about application, and we have to think about how learners will take what they learn and apply it, both in terms of so-called book smarts, but also in terms of soft skills, because they’re so important.”

    Silva makes the point that, as the world gets smaller and technology becomes a bigger part of our lives, we can be anywhere at any time, working with anyone from across the globe. “We need to be prepared,” he says, “for those cultural and linguistic differences that we’re going to face in our day-to-day jobs.”

    The ability to change and adapt

    So how does the curriculum at the American Language and Culture Institute help prepare students for the world of study and work?

    At the Institute, the general review led to the realization that the program needed to be adaptive and flexible. This would provide a balance between general English and academic preparation and would also encompass English for specific purposes (ESP).

    Wolfe says, “The GSE fit with what we were trying to do because it offers three different options; English for academic learners, English for professionals and English for adults, which is another area that we realized we needed to add to our evening program so that we can serve working adults that are English language learners in our community.”

    The English language instructors at the Institute were also impressed with the capabilities of the GSE. Joni Hagigeorges, one of the instructors, found the GSE to be an excellent tool for tracking student progress.

    “What I really like is that you can choose the skill – , listening, speaking – and you’re given the can-do statements, the learning objectives that each student will need to progress to the next level,” she said.

    Wolfe also commented on the GSE Teacher Toolkit and the way that it supports assessment and planning, allowing instructors to get ideas for specific learning objectives for groups or individual students. “It’s enabled us to personalize learning, and it’s changed the way that our teachers are planning their lessons, as well as the way that they are assessing the students.”

    A curriculum that will meet learner needs

    The GSE has allowed the team at the Institute to become more responsive to changing student expectations. The alignment of placement and progress tests to the GSE has allowed instructors to have more input into the courses they are teaching.

    Elizabeth Cullen, an English language instructor at the Institute, said, “The GSE helps us assess the strengths and weaknesses of various textbooks. It has helped us develop a unified curriculum, and a unified assessment mechanism.”

    This unification means that the curriculum can easily be tweaked or redesigned quickly to meet the needs of the students. What’s more, as Elizabeth points out, the students benefit too. “The Global Scale of English provides students with a road map showing them where they are now, where they want to go and how they’re going to get there.”

    Standing out from the crowd

    In this time of global hyper-competition, the challenge for any language program is finding innovative ways to stand out from the crowd while staying true to your identity. At Salem State, the staff found that the GSE was the perfect tool for the modern, data-driven approach to education, inspiring constant inquiry, discussion and innovation. It offers students, instructors and administrators a truly global metric to set and measure goals, and go beyond the ordinary.