The role of AI in English assessment

Jennifer Manning
A woman holding a tablet stood in a server room

Digital assessment is becoming more and more widespread in recent years. But what’s the role of digital assessment in teaching today? We’d like to give you some insight into digital assessment and automated scoring.

Just a few years ago, there may have been doubts about the role of AI in English assessment and the ability of a computer to score language tests accurately. But today, thousands of teachers worldwide use automated language tests to assess their students’ language proficiency.

For example, app’s suite of Versant tests have been delivering automated language assessments for nearly 25 years. And since its launch in 1996, over 350 million tests have been scored. The same technology is used in app’s Benchmark and Level tests.

So what makes automated scoring systems so reliable?

Huge data sets of exam answers and results are used to train artificial intelligence machine learning technology to score English tests the same way that human markers do. This way, we’re not replacing human judgment; we’re just teaching computers to replicate it.

Of course, computers are much more efficient than humans. They don’t mind monotonous work and don’t make mistakes (the standard marking error of an AI-scored test is lower than that of a human-scored test). So we can get unbiased, accurate, and consistent scores.

The top benefits of automated scoring are speed, reliability, flexibility, and free from bias.

Speed

The main advantage computers have over humans is that they can quickly process complex information. Digital assessments can often provide an instant score turnaround. We can get accurate, reliable results within minutes. And that’s not just for multiple-choice answers but complex responses, too.

The benefit for teachers and institutions is that they can have hundreds, thousands, or tens of thousands of learners taking a test simultaneously and instantly receive a score.

The sooner you have scores, the sooner you can make decisions about placement and students’ language level or benchmark a learner’s strengths and weaknesses and make adjustments to learning that drive improvement and progress.

Flexibility

The next biggest benefit of digital assessment is flexible delivery models. This has become increasingly more important since online learning has become more prominent.

Accessibility became key: how can your institution provide access to assessment for your learners, if you can’t deliver tests on school premises?

The answer is digital assessment.

For example, Versant, our web-based test can be delivered online or offline, on-site or off-site. All test-takers need is a computer and a headset with a microphone. They can take the test anywhere, any time of day, any day of the week, making it very flexible to fit into someone's schedule or situation.

Free from bias

Impartiality is another important benefit of AI-based scoring. The AI engine used to score digital proficiency tests is completely free from bias. It doesn’t get tired, and it doesn’t have good and bad days like human markers do. And it doesn’t have a personality.

While some human markers are more generous and others are more strict, AI is always equally fair. Thanks to this, automated scoring provides consistent, standardized scores, no matter who’s taking the test.

If you’re testing students from around the world, with different backgrounds, they will be scored solely on their level of English, in a perfectly objective way.

Additional benefits of automated scoring are security and cost.

Security

Digital assessments are more difficult to monitor than in-person tests, so security is a valid concern. One way to deal with this is remote monitoring.

Remote proctoring adds an extra layer of security, so test administrators can be confident that learners taking the test from home don’t cheat.

For example, our software captures a video of test takers, and the AI detection system automatically flags suspicious test-taker behavior. Test administrators can access the video anytime for audits and reviews, and easily find suspicious segments highlighted by our AI.

Here are a few examples of suspicious behavior that our system might flag:

Image monitoring:

  • A different face or multiple faces appearing in the frame
  • Camera blocked

Browser monitoring:

  • Navigating away from the test window or changing tabs multiple times

Video monitoring:

  • Test taker moving out of camera view
  • More than one person in the camera view
  • Looking away from the camera multiple times

Cost

Last but not least, the cost of automated English certifications are a benefit. Indeed, automated scoring can be a more cost-effective way of monitoring tests, primarily because it saves time and resources.

app English proficiency assessments are highly scalable and don’t require extra time from human scorers, no matter how many test-takers you have.

Plus, there’s no need to spend time and money on training markers or purchasing equipment.

AI is helping to lead the way with efficient, accessible, fair and cost-effective English test marking/management. Given time it should develop even further, becoming even more advanced and being of even more help within the world of English language learning and assessments.

More blogs from app

  • Precision teaching with AI: Aligning GSE objectives with generative AI for targeted materials

    By
    Reading time: 4 minutes

    English teachers today face increasing demands: create engaging content, differentiate instruction and address diverse learner needs – all within a limited time. The rise of Generative AI, like ChatGPT, offers a promising solution. But without proper guidance, AI-generated content can lack educational value. This blog post introduces a practical, research-informed approach to using AI tools aligned with the Global Scale of English (GSE). You will learn how this framework helps educators design accurate, personalized and level-appropriate English teaching materials quickly and confidently.

    Why GSE and AI are a game-changing combination for ELT

    The Global Scale of English (GSE) is a CEFR-aligned framework developed by app, offering detailed "can-do" learning objectives. It includes nearly 4,000 descriptors across speaking, listening, reading and writing skills, offering more precision than traditional level labels like A2 or B1. At the same time, Generative AI tools such as ChatGPT can generate entire lessons, tasks and assessments in seconds. The challenge lies in ensuring this content is aligned with clear pedagogical outcomes.

    Pairing AI’s creative speed with the GSE’s structured outcomes offers a scalable way to meet learner needs without compromising instructional quality.

    Unlocking measurable, differentiated and efficient teaching with GSE and AI

    The GSE makes objectives measurable

    Unlike generic teaching goals, GSE objectives are specific and measurable. For example, a B1-level learner objective might state:

    “Can identify a simple chronological sequence in a recorded narrative or dialogue.” (GSE 43)
    This clarity helps teachers define outcomes and ensure each AI-generated task targets an actual language skill, not just generic content.

    Generative AI enhances productivity

    Teachers using Generative AI can create draft lesson materials in minutes. By inputing a structured prompt such as:

    “Create a B1 reading activity that helps learners summarize the main points of a short article.”
    ChatGPT can instantly generate content that meets the learning goal. When guided by the GSE, AI becomes a collaborative assistant as well as a time-saver.

    The GSE + AI combination supports differentiation

    Because the GSE includes descriptors across a wide proficiency range (from pre-A1 to C2), teachers can tailor AI-generated content to meet the exact needs of their students. Mixed-level classrooms or tutoring contexts benefit especially from this, as teachers can create multiple versions of a task with consistent scaffolding.

    Practical tips

    • Use the GSE Teacher Toolkit to select objectives based on skill, level or function.
    • When prompting ChatGPT, include the GSE descriptor in your input for more precise results.
    • Always review and adapt the AI output to match your learners’ context, culture and curriculum.
    • Create a prompt library mapped to GSE codes to save time in future planning.

    A step-by-step example of the GSE and AI in action

    Here is a typical application of the workflow:

    1. A teacher selects a GSE objective, such as:
      “Can write a basic formal email/letter requesting information.” (GSE 46).
    2. Within seconds, a sample formal email, accompanied by a short reading comprehension task and a vocabulary activity, is generated.
    3. The reading task serves as a model to help learners analyze the structure, tone, and key language features of a well-written email before attempting their own.
    4. The teacher then reviews and refines the output for clarity, appropriateness, and context relevance.

    This process supports targeted teaching while significantly reducing preparation time.

    Overcoming challenges: Ensuring quality and relevance

    Challenge: AI outputs may lack cultural context, level appropriateness or instructional clarity.
    Solution: Always pair AI with professional judgment. Use the GSE to check that skills match the intended outcome, and adjust the complexity of the language as needed.

    Challenge: Teachers may be unfamiliar with how to write effective AI prompts.
    Solution: Start simple with templates like:

    “Create a [skill] activity at [level] that supports this GSE objective: [insert objective].”

    Challenge: Risk of over-relying on AI for instruction.
    Solution: Use AI as a starting point, not the final product. Combine AI-generated content with classroom interaction, feedback and your own creativity.

    Teaching tools that make this easier

    • : for exploring and selecting level-appropriate learning objectives
    • : for generating customizable teaching content
    • GSE Smart Lesson Generator: an AI-powered lesson creation tool developed by app that uses the GSE framework to automatically generate high-quality activities and lesson plans
    • Google Docs or Word: for editing and organizing your materials before class

    Confidently transforming English teaching

    Combining Generative AI with the Global Scale of English allows teachers to design materials that are both fast and focused. The GSE provides the structure; AI provides the speed. Together, they offer a sustainable solution for personalized English instruction that respects both learner needs and instructional quality.

  • A teacher helping a teenage student working at her desk in a library

    How teachers can use the GSE for professional development

    By
    Reading time: 4.5 minutes

    As English teachers, we’re usually the ones helping others grow. We guide learners through challenges, celebrate their progress and push them to reach new heights. But what about our own growth? How do we, as educators, continue to develop and refine our practice?

    The Global Scale of English (GSE) is often seen as a tool for assessing students. However, in my experience, it can also be a powerful guide for teachers who want to become more intentional, reflective, and confident in their teaching. Here's how the GSE has helped me in my own journey as an English teacher and how it can support yours too.

    About the GSE

    The GSE is a proficiency scale developed by app. It measures English ability across four skills – listening, speaking, reading and writing – on a scale from 10 to 90. It’s aligned with the CEFR but offers more detailed learning objectives, which can be incredibly useful in diverse teaching contexts.

    I first encountered the GSE while exploring ways to better personalize learning objectives in my Business English classes. As a teacher in a non-formal education setting in Indonesia, I often work with students who don’t fit neatly into one CEFR level. I needed something more precise, more flexible, and more connected to real classroom practice. That’s when the GSE became a turning point.

    Reflecting on our teaching practice

    The GSE helped me pause and reflect. I started reading through the learning objectives and asking myself important questions. Were my lessons really aligned with what learners at this level needed? Was I challenging them just enough or too much?

    By using the GSE as a mirror, I began to see areas where I could improve. For example, I realized that, although I was confident teaching speaking skills, I wasn’t always giving enough attention to writing development. The GSE didn’t judge me. It simply showed me where I could grow.

    Planning with purpose

    One of the best things about the GSE is that it brings clarity to lesson planning. Instead of guessing whether an activity is suitable for a student’s level, I now check the GSE objectives. If I know a learner is at GSE 50 in speaking, I can design a role-play that matches that level of complexity. If another learner is at GSE 60, I can challenge them with more open-ended tasks.

    Planning becomes easier and more purposeful. I don’t just create lessons, I design learning experiences that truly meet students where they are.

    Collaborating with other teachers

    The GSE has also become a shared language for collaboration. When I run workshops or peer mentoring sessions, I often invite teachers to explore the GSE Toolkit together. We look at learning objectives, discuss how they apply to our learners, and brainstorm ways to adapt materials.

    These sessions are not just about theory: they’re energizing. Teachers leave with new ideas, renewed motivation and a clearer sense of how to bring their teaching to the next level.

    Getting started with the GSE

    If you’re curious about how to start using the GSE for your own growth, here are a few simple steps:

    • Visit the GSE Teacher Toolkit and explore the learning objectives for the skills and levels you teach.
    • Choose one or two objectives that resonate with you and reflect on whether your current lessons address them.
    • Try adapting a familiar activity to better align with a specific GSE range.
    • Use the GSE when planning peer observations or professional learning communities. It gives your discussions a clear focus.

    Case study from my classroom

    I once had a private Business English student preparing for a job interview. Her speaking skills were solid – around GSE 55 – but her writing was more limited, probably around GSE 45. Instead of giving her the same tasks across both skills, I personalized the lesson.

    For speaking, we practiced mock interviews using complex questions. For writing, I supported her with guided sentence frames for email writing. By targeting her actual levels, not just a general CEFR level, she improved faster and felt more confident.

    That experience reminded me that when we teach with clarity, learners respond with progress.

    Challenges and solutions

    Of course, using the GSE can feel overwhelming at first. There are many descriptors, and it can take time to get familiar with the scale. My advice is to start small: focus on one skill or one level. Also, use the Toolkit as a companion, not a checklist.

    Another challenge is integrating the GSE into existing materials, and this is where technology can help. I often use AI tools like ChatGPT to adjust or rewrite tasks so they better match specific GSE levels. This saves time and makes differentiation easier.

    Teachers deserve development too

    Teaching is a lifelong journey. The GSE doesn’t just support our students, it also supports us. It helps us reflect, plan, and collaborate more meaningfully. Most of all, it reminds us that our growth as teachers is just as important as the progress of our learners.

    If you’re looking for a simple, practical, and inspiring way to guide your professional development, give the GSE a try. It helped me grow, and I believe it can help you too.

    Additional resources