How do computers test English? We explain it in plain English

Explaining computerized English testing in plain English

��app Languages mayo 22, 2020

Research has shown that automated scoring can give more reliable and objective results than human examiners when evaluating a person’s mastery of English. This is because an automated scoring system is impartial, unlike humans, who can be influenced by irrelevant factors such as a test taker’s appearance or body language. Additionally, automated scoring treats regional accents equally, unlike human examiners who may favor accents they are more familiar with. Automated scoring also allows individual features of a spoken or written test question response to be analyzed independent of one another, so that a weakness in one area of language does not affect the scoring of other areas.

was created in response to the demand for a more accurate, objective, secure and relevant test of English. Our automated scoring system is a central feature of the test, and vital to ensuring the delivery of accurate, objective and relevant results – no matter who the test-taker is or where the test is taken.

Development and validation of the scoring system to ensure accuracy

PTE Academic’s automated scoring system was developed after extensive research and field testing. A prototype test was developed and administered to a sample of more than 10,000 test takers from 158 different countries, speaking 126 different native languages. This data was collected and used to train the automated scoring engines for both the written and spoken PTE Academic items.

To do this, multiple trained human markers assess each answer. Those results are used as the training material for machine learning algorithms, similar to those used by systems like Google Search or Apple’s Siri. The model makes initial guesses as to the scores each response should get, then consults the actual scores to see well how it did, adjusts itself in a few directions, then goes through the training set over and over again, adjusting and improving until it arrives at a maximally correct solution – a solution that ideally gets very close to predicting the set of human ratings.

Once trained up and performing at a high level, this model is used as a marking algorithm, able to score new responses just like human markers would. Correlations between scores given by this system and trained human markers are quite high. The standard error of measurement between ��app’s system and a human rater is less than that between one human rater and another – in other words, the machine scores are more accurate than those given by a pair of human raters, because much of the bias and unreliability has been squeezed out of them. In general, you can think of a machine scoring system as one that takes the best stuff out of human ratings, then acts like an idealized human marker.

��app conducts scoring validation studies to ensure that the machine scores are consistently comparable to ratings given by skilled human raters. Here, a new set of test-taker responses (never seen by the machine) are scored by both human raters and by the automated scoring system. Research has demonstrated that the automated scoring technology underlying PTE Academic produces scores comparable to those obtained from careful human experts. This means that the automated system “acts” like a human rater when assessing test takers’ language skills, but does so with a machine's precision, consistency and objectivity.

Scoring speaking responses with ��app’s Ordinate technology

The spoken portion of PTE Academic is automatically scored using ��app’s Ordinate technology. Ordinate technology results from years of research in speech recognition, statistical modeling, linguistics and testing theory. The technology uses a proprietary speech processing system that is specifically designed to analyze and automatically score speech from fluent and second-language English speakers. The Ordinate scoring system collects hundreds of pieces of information from the test takers’ spoken responses in addition to just the words, such as pace, timing and rhythm, as well as the power of their voice, emphasis, intonation and accuracy of pronunciation. It is trained to recognize even somewhat mispronounced words, and quickly evaluates the content, relevance and coherence of the response. In particular, the meaning of the spoken response is evaluated, making it possible for these models to assess whether or not what was said deserves a high score.

Scoring writing responses with Intelligent Essay Assessor™ (IEA)

The written portion of PTE Academic is scored using the Intelligent Essay Assessor™ (IEA), an automated scoring tool powered by ��app’s state-of-the-art Knowledge Analysis Technologies™ (KAT) engine. Based on more than 20 years of research and development, the KAT engine automatically evaluates the meaning of text, such as an essay written by a student in response to a particular prompt. The KAT engine evaluates writing as accurately as skilled human raters using a proprietary application of the mathematical approach known as Latent Semantic Analysis (LSA). LSA evaluates the meaning of language by analyzing large bodies of relevant text and their meanings. Therefore, using LSA, the KAT engine can understand the meaning of text much like a human.

What aspects of English does PTE Academic assess?

Written scoring	Spoken scoring
Word choice Grammar and mechanics Progression of ideas Organization Style, tone Paragraph structure Development, coherence Point of view Task completion	Sentence mastery Content Vocabulary Accuracy Pronunciation Intonation Fluency Expressiveness Pragmatics

More blogs from ��app

Improve your strategic workforce planning with English language testing
Por Samantha Ball
Reading time: 3 minutes

Companies constantly seek methods to optimize workforce productivity and effectiveness. A powerful approach to achieving this goal is through strategic workforce planning bolstered by English language testing. This tactic not only identifies and addresses skills gaps but also reduces attrition and strengthens your workforce for both short-term and long-term success.

agosto 23, 2024
How to assess your learners using the GSE Assessment Frameworks
Por Billie Jago
Reading time: 4 minutes
With language learning, assessing both the quality and the quantity of language use is crucial for accurate proficiency evaluation. While evaluating quantity (for example the number of words written or the duration of spoken production) can provide insights into a learner's fluency and engagement in a task, it doesn’t show a full picture of a learner’s language competence. For this, they would also need to be evaluated on the quality of what they produce (such as the appropriateness, accuracy and complexity of language use). The quality also considers factors such as grammatical accuracy, lexical choice, coherence and the ability to convey meaning effectively.

In order to measure the quality of different language skills, you can use the Global Scale of English (GSE) assessment frameworks.

Developed in collaboration with assessment experts, the GSE Assessment Frameworks are intended to be used alongside the GSE Learning Objectives to help you assess the proficiency of your learners.

There are two GSE Assessment Frameworks: one for adults and one for young learners.

What are the GSE Assessment Frameworks?
- The GSE Assessment Frameworks are intended to be used alongside the GSE Learning Objectives to help teachers assess their learners’ proficiency of all four skills (speaking, listening, reading and writing).
- The GSE Learning Objectives focus on the things a learner can do, while the GSE Assessment Frameworks focus on how well a learner can do these things.
- It can help provide you with examples of what proficiencies your learners should be demonstrating.��
- It can help teachers pinpoint students' specific areas of strength and weakness more accurately, facilitating targeted instruction and personalized learning plans.
- It can also help to motivate your learners, as their progress is evidenced and they can see a clear path for improvement.
An example of the GSE Assessment Frameworks

This example is from the Adult Assessment Framework for speaking.

As you can see, there are sub-skills within speaking (and��for the other three main overarching skills – writing, listening and reading). Within speaking, these are��production��and��fluency, spoken interaction, language range and��accuracy.

The GSE range (and corresponding CEFR level) is shown at the top of each column, and there are descriptors that students should ideally demonstrate at that level.

However, it is important to note that students may sit across different ranges, depending on the sub-skill. For example, your student may show evidence of GSE 43-50 production and fluency and spoken interaction, but they may need to improve their language range and accuracy, and therefore sit in a range of GSE 36-42 for these sub-skills.
agosto 20, 2024
English Teacher Awards 2024: Understanding the categories
Por Thomas Gardner
Reading time: 4 minutes

Teachers shape every aspect of our learning experience, especially when it comes to language learning. Great teachers give learners not only the skills but the confidence to go out in the world, start speaking up and discovering new opportunities.��

We’re celebrating those exceptional educators with the ��app English Teacher Awards 2024.��

With five different categories and a Gold, Silver and Bronze winner in each, there are 15 chances to take home thousands of pounds worth of top prizes for the winning teachers and their schools.��

Find out more about who can enter and the different categories in this article.��

agosto 15, 2024

��app

For test takers

��app Test of English (PTE)

��app English International Certificate (PEIC)

Versant

For educators

Connected Learning Programs

Resources

Professional development

For HR professionals

Versant

MondlyWORKS by ��app

Business language courses

Our language learning community

Blogs

Webinars

Fast-track language learning

Ground-breaking technology

Supporting all learners

GSE Job Profiles

Explaining computerized English testing in plain English

Development and validation of the scoring system to ensure accuracy

Scoring speaking responses with ��app’s Ordinate technology

Scoring writing responses with Intelligent Essay Assessor™ (IEA)

What aspects of English does PTE Academic assess?

Written scoring

Spoken scoring

More blogs from ��app

Improve your strategic workforce planning with English language testing

How to assess your learners using the GSE Assessment Frameworks

What are the GSE Assessment Frameworks?

An example of the GSE Assessment Frameworks

English Teacher Awards 2024: Understanding the categories

Hola

����app

GSE Job Profiles

Explaining computerized English testing in plain English

Development and validation of the scoring system to ensure accuracy

Scoring speaking responses with ����app’s Ordinate technology

Scoring writing responses with Intelligent Essay Assessor™ (IEA)

What aspects of English does PTE Academic assess?

Written scoring

Spoken scoring

More blogs from ����app

Improve your strategic workforce planning with English language testing

How to assess your learners using the GSE Assessment Frameworks

What are the GSE Assessment Frameworks?

An example of the GSE Assessment Frameworks

English Teacher Awards 2024: Understanding the categories

Nuestros clientes

Ayuda y soporte

Comunidad

¿Por qué aprender con nosotros?

Más de ����app

Redes sociales

��app

Scoring speaking responses with ��app’s Ordinate technology

More blogs from ��app