Automated Usability Testing

13 Automated Usability Testing Elizabeth Chang and Tharam S. Dillon School of Computer Science and Computer Engineering La Trobe University, Bundoora, Melbourne, Australia chang@latcs l.1at oz. au; tharam@latcs l.1at. oz. au ABSTRACT: Currently Laboratory Testing for Usability evaluation requires external monitoring and recording devices such as video and audio, as well as evaluator observation of user and user actions. It then requires review of these recordings, a time consuming and tedious process. We describe an automatic Usability Testing and Evaluation tool that we have developed. This consists of a piece of software called AUS (Automatic Usability Software) running at the operating system level that collects all the information related to user actions when s/he is using a particular application. It then displays this information in various useful formats as well as calculates suitable measures of Usability. There is no need with this system for any external recording devices. Patent protection has been applied for in respect of the inventive aspects ofthis system. KEY WORDS: Usability Testing, Usability Evaluation, Usability Measurement. 1. INTRODUCTION separate observation room, which are linked together usually through a one way mirror. Drawbacks of the fixed laboratories include: (i) they are relatively A variety of techniques for carrying Usability expensive, costing typically around US$50,000 to Evaluation are explained in the literature and these $100,000. (ii) the test is performed away from the work include: (a) Empirical testing [Molich and Nielsen environment and could be artificial. (iii) it is expensive 1990], (b) Inspection [Nielsen and Philips 1993], (c) to shift staff observation and test staff to test site. Comparative Usability measures [Dillon and Maquire Portable Laboratory Testing moves equipment to the 1993], (d) Formal complexity based measures user site so the testing can be carried out in the user's [Thimbleby 1994], (e) The MUSIC methodology company close to or at the actual work site where the [Bevan & Macleod 1994] user is likely to be using the software [Dorward 1994, Empirical testing consists of testing an Rowley 1994]. Advantages of Portable useability implementation of the user interface and software in a testing include, (a) it is much cheaper costing about fairly controlled situation so as to ascertain the US$20,000 for a Portable Laboratory, (b) it involves problems that the user is experiencing with the user less cost for the corporation as no people are moved to interface. It can be carried out using three different the test site, (c) the tests involved are carried out in a environments, and these are: (1) Fixed more realistic environment. This approach still relies LaboratoryTesting, (2) Portable Usability Laboratory on extensive video recording which has to be reviewed, Testing, (3) Remote Usability Testing. a time consuming process. Further, there are only a Fixed Laboratory Testing involves a test room and a limited number of things that are actually captured by Human-Computer Interaction: INTERACT'97 S. Howard, 1 Hammond & G. Lindgaard (editors) Published by Chapman & Hall ©IFIP 1997 78 Part Two Technical Sessions the video camera. In addition it still has the problem Manager manages the printing queue to print many that test personnel and test equipment have to be sent to files frem many applications. AUS can run in several the site where the test subjects, are in order to carryout ways. ie: click the icon or automatically start with the the testing. StartUP program. The Usability software opens and Remote Usability Testing. uses computer networks closes when the Windows Operating System comes on and modem connections to monitor what the user is and off. It always does an auto save data at every time doing, so, neither the Usability tester nor the test interval (which can be specified) and protects the data subjects have to travel. However the disadvantages are when computer is turned off. AUS is only run and used that the Usability tester cannot observe the test subject's by UI analysts or UI evaluators, not by the user. AUS reactions, nor does s/he have access to the dialogue that automatically collects all data concerned with UI the user is encouraged to enter into to provide some analysis and Usability evaluation. feedback. This Usability software should be able to monitor all All these Usability laboratory tests have been found the user actions when s/he is carrying out the specified to be very effective in actually picking up specific series of tasks. It should be able to store all the useability problems that relate only to the designated information that it has gathered allowing the test tasks. There could still be Usability problems with the personal to gather various statistics, or frequencies. The User Interface which are not covered by the designated Automated Usability Testing software should also tasks. Furthermore, these approaches do not give an permit test personal to replay user actions and hence be overall figure of merit either to the whole user able to review a user's test session, at normal speed and interface or to significant aspects of the user interface. fast speed. This Usability Testing software must also be Thus it might uncover a specific problem but it does not able to generate the required Usability Metrics. give you an overall measure of goodness of the Usability of the user interface. 3. SPECIFICATION OF AUTOMATED These difficulties with Usability Testing have meant USABILITY SOFTWARE (AUS) that whilst it might be utilised for some expensive projects, it is not in widespread use for a large The AUS should be capable of eventually capturing proportion of the software that is being produced today. all user interactions with the system through any user However, the need to make software user friendly is interface device. The Automated Usability System becoming greater as time passes. There is clearly, needs to be able to: therefore, a need for a more convenient approach to • Monitor multiple applications that the user is Usability Testing that would lead to more widespread interacting with at the same time; Usability Testing. Monitor multiple windows rather than just current activated window. 2. NEW PROPOSAL For the purpose of the prototype system, the following function should provide: In the proposed method, a piece of software know as (I) UI data collection needs to occur: the Automated Usability Testing System is installed in => Keystrokes within application; within each window the user's computer together with the software that is and within each time interval. actually tested. o Input and output from a port This Usability Software is automatically linked into o Help system use the operating system. it runs as part of operating o Delete (string, number .. ) function use system. It works quietly in the background of the users' o Select, searching or find (a menu, submenu or a application. The user will not notice that there is item in a multiple choice field) function use; Automatic Usability Testing software running. It is like o Modification to what they have entered; many other tools in Windows, such as Print Manager, o Retrieve function use; because it works in the background. You can continue o working with your applications while the Print Copy function use; Automated usability testing 79 Edit function use; o Normal distribution fits to data Undo function use; Learning curve (Correlation and Regression) Cancel function use; FFT and Spectral Analysis Resizing of windows; o Best Case !Worst Case Analysis Move windows; (IV) The Usability Results Output should display: Scroll button use; o Historical keystrokes (virtual keys, extended keys, Network UI data transfer (Sound receiver, Video and system keys) within each predefined time Recording) interval; ~ Mouse movements within application; within each Total density of mouse clicks within each window and within each time interval predefined time interval; Left mouse key down-up Density of mouse clicks in each window opened Right mouse key down-up within a selected time interval; Middle mouse key down-up Total number of application windows opened Double click (created) (eg 20 WD opened); Drag and Drop o Times when each window is opened (activated or Mouse travel distance deactivated) (eg WDl opened 10 times); (II) The following information needs to stored for later o Times when each menu (system menu, pop up retrieval by the AUS: menu) is selected; Keystrokes with window number and timer Times when each button is pressed (system button, Mouse location with window number and timer speed button, dialogue button) System Messages with timer Time of every user action (III) The Automatic Usability Software (AUS) needs to Time of task completion carry out the following analysis : o User action playback with selected speed, pause, ~ Task time measurement data are collected as fast forward, backward, stop and continue. (This below: allows a complete playback of all user actions Total task finishing time; carried out on the system within a specified time Total computer idle time (unproductive time or interval automatically). wasted time); o Overall Usability figure of merit (Fuzzy Model) o Total computer user dialogue time; Statistical graph and metric displays Total computer response time; A playback operation of all user interactions with Average operation time; the system at different speeds of display ~ Error measurements determined include: o User action sequence diagram Number of dialogue errors; o Mouse movement pattern Number of user requests from or to the system; Three categories of user file creation and choice Number of user requests that failed due to user (Novice, Intermediate, Expert) error; o Up to 24 window partitions of the Screen for Number of user requests that failed due to system visual comparison of all outputs for up to 24 users. error (eg not enough memory); Number of times the application was terminated by The AUS has two main components and these are: user (user control to quit the system); (1) UI data Collection; (2) Usability analyser and UI Number of times the application crashed (out of metrics.

Automated Usability Testing

Human Performance Regression Testing

1. Can You Explain the PDCA Cycle and Where Testing Fits In?

Orthogonal Array Application for Optimized Software Testing

Cost Justifying Usability a Case-Study at Ericsson

SSW 567 Software Testing, Quality, and Maintenance

Flash Usability Report

Standard Glossary of Terms Used in Software Testing Version 3.0 Advanced Test Analyst Terms

Remote Web Site Usability Testing - Benefits Over Traditional Methods

Causal Testing: Understanding Defects’ Root Causes

The Guide to Usability Testing

Testing the Social-Mobile- Analytics-Cloud Pack: the Way Forward

Formal Usability Testing of Interactive Educational Software: a Case Study