Explaining computerized English testing in plain English

Pearson Languages
a pair of hands typing at a laptop

Research has shown that automated scoring can give more reliable and objective results than human examiners when evaluating a person’s mastery of English. This is because an automated scoring system is impartial, unlike humans, who can be influenced by irrelevant factors such as a test taker’s appearance or body language. Additionally, automated scoring treats regional accents equally, unlike human examiners who may favor accents they are more familiar with. Automated scoring also allows individual features of a spoken or written test question response to be analyzed independent of one another, so that a weakness in one area of language does not affect the scoring of other areas.

PTE Academic was created in response to the demand for a more accurate, objective, secure and relevant test of English. Our automated scoring system is a central feature of the test, and vital to ensuring the delivery of accurate, objective and relevant results – no matter who the test-taker is or where the test is taken.

Development and validation of the scoring system to ensure accuracy

PTE Academic’s automated scoring system was developed after extensive research and field testing. A prototype test was developed and administered to a sample of more than 10,000 test takers from 158 different countries, speaking 126 different native languages. This data was collected and used to train the automated scoring engines for both the written and spoken PTE Academic items.

To do this, multiple trained human markers assess each answer. Those results are used as the training material for machine learning algorithms, similar to those used by systems like Google Search or Apple’s Siri. The model makes initial guesses as to the scores each response should get, then consults the actual scores to see well how it did, adjusts itself in a few directions, then goes through the training set over and over again, adjusting and improving until it arrives at a maximally correct solution – a solution that ideally gets very close to predicting the set of human ratings.

Once trained up and performing at a high level, this model is used as a marking algorithm, able to score new responses just like human markers would. Correlations between scores given by this system and trained human markers are quite high. The standard error of measurement between Pearson’s system and a human rater is less than that between one human rater and another – in other words, the machine scores are more accurate than those given by a pair of human raters, because much of the bias and unreliability has been squeezed out of them. In general, you can think of a machine scoring system as one that takes the best stuff out of human ratings, then acts like an idealized human marker.

Pearson conducts scoring validation studies to ensure that the machine scores are consistently comparable to ratings given by skilled human raters. Here, a new set of test-taker responses (never seen by the machine) are scored by both human raters and by the automated scoring system. Research has demonstrated that the automated scoring technology underlying PTE Academic produces scores comparable to those obtained from careful human experts. This means that the automated system “acts” like a human rater when assessing test takers’ language skills, but does so with a machine's precision, consistency and objectivity.

Scoring speaking responses with Pearson’s Ordinate technology

The spoken portion of PTE Academic is automatically scored using Pearson’s Ordinate technology. Ordinate technology results from years of research in speech recognition, statistical modeling, linguistics and testing theory. The technology uses a proprietary speech processing system that is specifically designed to analyze and automatically score speech from fluent and second-language English speakers. The Ordinate scoring system collects hundreds of pieces of information from the test takers’ spoken responses in addition to just the words, such as pace, timing and rhythm, as well as the power of their voice, emphasis, intonation and accuracy of pronunciation. It is trained to recognize even somewhat mispronounced words, and quickly evaluates the content, relevance and coherence of the response. In particular, the meaning of the spoken response is evaluated, making it possible for these models to assess whether or not what was said deserves a high score.

Scoring writing responses with Intelligent Essay Assessor™ (IEA)

The written portion of PTE Academic is scored using the Intelligent Essay Assessor™ (IEA), an automated scoring tool powered by Pearson’s state-of-the-art Knowledge Analysis Technologies™ (KAT) engine. Based on more than 20 years of research and development, the KAT engine automatically evaluates the meaning of text, such as an essay written by a student in response to a particular prompt. The KAT engine evaluates writing as accurately as skilled human raters using a proprietary application of the mathematical approach known as Latent Semantic Analysis (LSA). LSA evaluates the meaning of language by analyzing large bodies of relevant text and their meanings. Therefore, using LSA, the KAT engine can understand the meaning of text much like a human.

What aspects of English does PTE Academic assess?

Written scoring

Spoken scoring

  • Word choice
  • Grammar and mechanics
  • Progression of ideas
  • Organization
  • Style, tone
  • Paragraph structure
  • Development, coherence
  • Point of view
  • Task completion
  • Sentence mastery
  • Content
  • Vocabulary
  • Accuracy
  • Pronunciation
  • Intonation
  • Fluency
  • Expressiveness
  • Pragmatics

More blogs from Pearson

  • A business woman sat at a table in a office writing notes

    Hard skills vs. soft skills: The impact of language learning

    By Charlotte Guest
    Reading time: 6 minutes

    Hard skills and soft skills play a crucial role in defining career success and progression. The difference between hard skills and soft skills is that hard skills are teachable, technical, measurable abilities specific to particular jobs, while soft skills are more interpersonal, universal and related to personality traits. While hard skills refer to the technical knowledge and specific abilities required to perform a job, soft skills are more intangible. They encompass the interpersonal attributes and personality traits that enable individuals to communicate effectively, work collaboratively and adapt to changes in the workplace environment.

    In this blog post, we will explore how learning a new language can significantly enhance both hard and soft skills, making you a more versatile and effective professional in today’s multifaceted work environment.

    Understanding the balance of hard and soft skills

    Hard skills might get your foot in the door, showcasing your qualifications for a position. Developing hard skills to stand out from other job seekers is crucial; take advantage of classes, webinars and workshops offered by your current employer to develop hard skills and learn new technical skills. Examples include proficiency in a particular software, certification in a specific field, or mastery of a technical domain. However, it’s the soft skills, such as effective communication, collaboration, critical thinking and emotional intelligence, that propel you through the door and into the realms of career advancement. Recent research underscores the growing importance of English proficiency as a pivotal element in this dynamic, equally vital for enhancing both sets of skills.

    What are examples of soft skills?

    Soft skills encompass a wide range of attributes that can significantly impact workplace efficiency and harmony. Examples of essential soft skills include:

    Communication: The ability to convey information clearly and effectively is paramount. This includes both verbal and written communication, as well as active listening skills.

    Teamwork: Collaborating well with others, often with diverse backgrounds and perspectives, to achieve common goals.

    Problem-solving: The capability to analyze situations, identify problems and devise effective solutions.

    Adaptability: The readiness to adjust to new conditions, workflows, or technologies, demonstrating flexibility in the face of change.

    Critical thinking: The process of objectively analyzing information to make informed decisions.

    Emotional intelligence: The ability to understand, manage and utilize one's emotions constructively while also recognizing and influencing the emotions of others.

    What are examples of hard skills?

    Hard skills are quantifiable, teachable abilities specific to a job or industry. These skills are typically acquired through formal education, training programs and practical experience. Some examples of essential hard skills include:

    Computer programming: Proficiency in coding and programming languages, such as Python, Java, C++, or HTML/CSS is crucial for software development and web design roles.

    Data analysis: The ability to interpret complex data sets using tools like Excel, SQL, or R, providing valuable insights and informing decision-making processes.

    Graphic design: Mastery of design software such as Adobe Photoshop, Illustrator, and InDesign, enabling the creation of visual content for various media.

    Foreign language proficiency: Fluency in a second language can be an asset in international business, for example, in translation services or customer support roles.

    Project management: Knowledge of project management methodologies (e.g., Agile, Scrum) and tools (e.g., Microsoft Project, Jira) to plan, execute and oversee projects effectively.

    Technical writing: The skill of crafting clear, precise documentation and instructional materials, essential in industries such as engineering, IT and pharmaceuticals.

  • Business people sat and waiting in a row

    Boost the quality of your hires with English proficiency testing

    By Samantha Yates
    Reading time: 6.5 minutes

    Hire quality is top of the agenda for recruiters and talent acquisition leaders. Discover the impact of English skill testing on hiring fit-for-role employees.

    The results are in… thousands of recruiting professionals and top talent acquisition leaders say that sourcing high-quality candidates is their number one objective in 2024 and beyond.

    54% of recruiters are now prioritizing quality of hire above all else, according to LinkedIn’s Talent Solutions report The Future of Recruiting 2024. The report also highlights that 73% are using a skills-based approach to find top-quality hires, faster, with skills that fit the business both now and in future.

    Getting recruitment right can drastically impact productivity. In the UK alone, effective recruitment boosts productivity by £7.7bn each year, according to the Recruitment and Employment Confederation (REC). Conversely, the direct and indirect costs of mistake hires are a constant concern to organizations, not just in the UK but around the world. According to a survey of 400 hiring decision-makers by CareerBuilder, 75% have hired the wrong person and say that one bad hire costs them nearly $17,000 on average. It’s no surprise then that skills-based quality hiring is such a top priority for recruiters.

    It’s harder than it might seem to systematically increase the quality of your hires, especially when you’re recruiting at scale. But the rewards are high when you get it right and a skills-first approach increases your chances of success – particularly when you focus on core skills like English proficiency that underpin communication. As an added bonus, skills-based testing can speed up the recruitment process significantly.

  • A group of women celebrating with confetti

    The Global Scale of English: A decade of innovation in language education

    By Pearson Languages
    Reading time: 4 minutes

    This month marks 10 years since the launch of the Global Scale of English (GSE) and what a journey it has been. As we celebrate this important milestone, it’s time to reflect on everything that has been achieved over the past decade and to take pride in the work that has contributed to the advancement of language learning, teaching and assessment around the world.