Tweet Summative Tests may be seen as assessment of learning, which occurs at the end of a particular unit. This form of assessment usually occurs toward the end of a period of learning in order to describe the standard reached by the learner. Often, this takes place in order for appropriate decisions about future learning or job suitability to be made.

Measuring the user experience. User-based evaluations, The human-computer interaction handbook: Lawrence Erlbaum Associates, Inc. Published Studies Quesenbery, W. Defining a summative usability test for voting systems - A report from the UPA workshop on voting and usability.

Validity and Reliability in User Interaction and Design. In such an experiment, hypotheses are tested by modifying an independent variable in a controlled environment. The effects of this modification on one or several dependent variables are then measured and statistically analyzed.

In the early s such experiments were first transferred to usability testing and it is therefore quite hard to mark the exact point in time, when 'summative testing' was formally developed out of these methods.

An important aspect is the separation from a user test, which tries to identify usability problems but does not qualify for statistical analysis of quantitative measurements.

Such a test is often referred to as informal testing or formative evaluation. Summative usability testing is sometimes also referred to as user performance testing or formal evaluation and tries to fulfil the requirements of scientific experiments. History As described above, the history of this method can be found in social sciences and psychology and therefore goes back a long time in human history.

The adaptation of the method to usability testing began in the early 80s and since then has been a long journey and therefore a lot of different definitions and slightly different approaches exist.

The MUSiC project can be seen as one important step to formalize the method with respect to software evaluation. International standards ISO standardizes usability measures and also provides a general procedure for summative usability testing.

ISO parts 2, 3 and 4 contain summative test methods to measure the ease of operation and installation of everyday products. Benefits, Advantages and Disadvantages The method offers empirical reliable data and therefore can be used to test hypotheses.

The central usability measures effectiveness, efficiency and user satisfaction can be measured. Furthermore it offers the possibility to detect more complex usability flaws, which less formalized methods would hardly detect.

A correctly carried out summative test can simulate the real use of a product. It can be used to underline marketing statements with empirical evidence. Summary Drawbacks The large number of participants required to get reliable data can be time consuming and expensive.

Does not provide so much support to enhance a product, since finding usability flaws is not the main focus. The reliability of the results depends to a large extent on the correct planning, execution and analysis.

It can be difficult for people not involved in the study to rate the reliability and validity of a summative test. Appropriate Uses To find out whether requirements have been achieved. To compare results with a competing product, interaction technique or earlier version.

The main goal of the method is to measure the usability of a product. As the term summative evaluation suggests, the method should be mainly applied in later stages of development. This allows integrating real tasks and, since the evaluation object is completed or nears completion, excluding possible interfering variables such as system crashes or incomplete functionality.

It is also used in post development, e. A more detailed description of How to Do It can be found in Usability testing.

The requirements for a usability lab to run a summative test vary greatly. A mobile lab, meaning a laptop computer with recording software and a webcam can be sufficient, however big laboratories which include observation rooms for usability experts and developers can have advantages. The most important requirement however is to have a controlled environment in which the experiment takes place.

For recording purposes there exist different software products, such as Techsmith Morae or Noldus Observer which record audio, video and screen for detailed post-analysis. It is important to select participants from the expected target group. In many cases, university researches will rely on students as participants because of cost issues.It’s not a stretch to say that assessment is a hot button issue in education; however, you’d be hard pressed to find an educator who doesn’t see the value in measuring student progress.

Summative assessments are traditionally more structured and standardized than formative assessments. Still, you have a few options to shake things up that go beyond a pen-and-paper test. 1. Instructional design. Summative assessment is used as an evaluation technique in instructional design.

