{"id":118,"date":"2012-05-30T23:50:00","date_gmt":"2012-05-30T23:50:00","guid":{"rendered":"http:\/\/measuringu.com\/sum\/"},"modified":"2022-03-21T18:24:58","modified_gmt":"2022-03-22T00:24:58","slug":"sum","status":"publish","type":"post","link":"https:\/\/measuringu.com\/sum\/","title":{"rendered":"10 Things to Know about the Single Usability Metric (SUM)"},"content":{"rendered":"<p><a href=\"https:\/\/measuringu.com\/SUM\/\"><img decoding=\"async\" style=\"float: left; padding: 10px; margin-right: 20px;\" title=\"Single Usability Metric (SUM)\" src=\"https:\/\/measuringu.com\/images\/sum-grinder.jpg\" border=\"0\" \/><\/a>There is no usability thermometer to tell you how easy to use a website or software application is.<\/p>\n<p>Instead we rely on the outcomes of good and bad experiences which provide <a href=\"https:\/\/measuringu.com\/blog\/usability-exist.php\">evidence for the construct of usability<\/a>.<\/p>\n<p>Combining multiple usability metrics into a single usability metric (SUM) is something we <a href=\"https:\/\/measuringu.com\/papers\/p482-sauro.pdf\">proposed seven years ago<\/a><span style=\"color: #ff0000;\">[PDF]<\/span> and we wrote about in Chapter 9 of <a href=\"http:\/\/www.amazon.com\/gp\/product\/0123849683\/ref=as_li_ss_tl?ie=UTF8&amp;tag=meausallc-20&amp;linkCode=as2&amp;camp=1789&amp;creative=390957&amp;creativeASIN=0123849683\">Quantifying the User Experience<\/a>.<\/p>\n<p>Here are 10 things to know about single measures of usability.<\/p>\n<ol>\n<li>Usability is the intersection of effectiveness, efficiency and satisfaction (<a href=\"http:\/\/en.wikipedia.org\/wiki\/ISO_9241\">ISO 9241 pt 11<\/a>). One of the best measures of usability is a combination of metrics that describes each of these aspects.<\/li>\n<li>The most <a href=\"https:\/\/measuringu.com\/blog\/usability-metrics.php\">common usability metrics<\/a> are completion rates and errors (effectiveness), task-times (efficiency) and task-level satisfaction (satisfaction). These metrics tend to have a <a href=\"https:\/\/measuringu.com\/papers\/Sauro_Lewis_CHI2009.pdf\">moderate correlation<span style=\"color: #ff0000;\">[PDF]<\/span><\/a> with each other of <span style=\"font-style: italic;\">r<\/span> = .3 to .5. The correlation is strong enough to suggest an overlap (e.g., users that commit more errors tend to take longer) but the correlation isn&#8217;t strong enough that one metric can substitute for the other.<\/li>\n<li>By averaging together a standardized version of completion rates, task-times, task-level satisfaction and errors you generate a Single Usability Metric (SUM) which summarizes the majority of information in all four measures. By averaging you weight each metric equally. Despite many discussions for determining which metric &#8220;counts&#8221; more, our analysis found that a simple average is least subjective and reflects the data best (from a <a href=\"https:\/\/measuringu.com\/papers\/p482-sauro.pdf\">principal components analysis<span style=\"color: #ff0000;\">[PDF]<\/span><\/a>). Keep in mind that if you weight one metric a lot then you must lessen the weight of another, often to a point where an additional metric does little.<\/li>\n<li>You can have 3 metric or 4 metric versions of SUM: <a href=\"https:\/\/measuringu.com\/blog\/errors-ux.php\">Errors<\/a> are usually the most time consuming and difficult to collect metric (especially in <a href=\"https:\/\/measuringu.com\/services\/muiq\/\">unmoderated<\/a> testing) so completion rates, task-times and task-satisfaction provide the minimum description of effectiveness, efficiency and satisfaction for a single usability metric.<\/li>\n<li>A single usability metric doesn&#8217;t replace the individual metrics; it simply summarizes them in a more condensed way like an abstract to a long paper or like the mean summarizes a large set of numbers. With any summarization comes data loss, but the gain in interpretability usually far outweighs the loss\u2014especially considering you don&#8217;t &#8220;lose&#8221; anything as you can always dive into the individual metrics (like you can read the details of a paper).<\/li>\n<li>There are a number of reasonable ways to combine usability metrics. One of the best ways we&#8217;ve found is to convert everything into a percentage. For discrete metrics (completion rates and errors) this is done by generating a proportion and for continuous metrics (time and satisfaction) we generate a normalized &#8220;<a href=\"https:\/\/measuringu.com\/z.htm\">z-score<\/a>&#8221; and convert it to percentage then average the metrics together.<\/li>\n<li>To convert discrete data so they are amenable to combining:\n<ul>\n<li><span style=\"font-weight: bold;\">For completion rates<\/span>:\u00a0 they are already in the percentage form.\u00a0 An 80% completion rate stays as 80%.<\/li>\n<li><span style=\"font-weight: bold;\">For errors<\/span>: you need to convert the raw number of errors into a proportion by identifying the <a href=\"https:\/\/measuringu.com\/blog\/errors-ux.php\">opportunities for errors<\/a> and subtracting this proportion by 1 (so higher proportions are better). If 10 users commit 20 errors and there are 5 opportunities for an error per task the error rate is 20\/50 = .40. Subtracting this value from 1 reverses the error rate so higher percentages are better 1-.4 = 60%.<\/li>\n<\/ul>\n<\/li>\n<li>To convert continuous data so they are amenable to combining:\n<ul>\n<li><span style=\"font-weight: bold;\">For task-level satisfaction<\/span>:\u00a0 if you are using the standardized task-level metric like the <a href=\"https:\/\/measuringu.com\/blog\/single-question.php\">SEQ<\/a> you can use the percentile rank. If you have a 5-point or 7-point scale then <a href=\"https:\/\/measuringu.com\/blog\/top-box.php\">common specification limits<\/a> are 4 (for a 5 point scale) and 5 (for a 7 point scale).\u00a0 For example, an average score of a 5.6 on a 7 point scale with a standard deviation of 2 becomes (5.6-5)\/2 = .3. The .3 is a z-score and gets converted into a percentage =61.7%.<\/li>\n<li><span style=\"font-weight: bold;\">For task times<\/span> you need to identify <a href=\"https:\/\/measuringu.com\/blog\/task-times.php\">how long a task should take<\/a> (a specification limit) and subtract the mean time from the successful task attempts. There&#8217;s an <a href=\"https:\/\/measuringu.com\/papers\/hcii2005task_times.pdf\">art to determining<span style=\"color: #ff0000;\">[pdf]<\/span><\/a> how long a task should take. For example, if the average time is 50 seconds with a standard deviation of 40 seconds and the spec time is 80 seconds we get a z-score of (50-80)\/40\u00a0 = -.75. The -.75 is a z-score and we convert it to a percentage (which is the area under the curve up to -.75 standard deviations) and we get .2266. We subtract this value from 1 (because we want times to be less than the spec limit) which generates a percentage of 1-.2266 = 77.3%<\/li>\n<\/ul>\n<\/li>\n<li>A single usability metric is ideal for dashboards, for <a href=\"https:\/\/measuringu.com\/papers\/HCII2005_sauro_kindlund-V9.pdf\">comparing competing products<span style=\"color: #ff0000;\">[pdf]<\/span><\/a> and tasks when you need a single dependent variable to describe the complex construct of usability. Given the four example metrics shown above, we get a SUM of (80%+60%+61.7%+77.3%)\/4 = 69.75%.<\/li>\n<li>You can convert raw usability metrics into a SUM score by using the free downloadable <a href=\"https:\/\/measuringu.com\/SUM\/index.htm\">Excel spreadsheet<\/a> or the <a href=\"http:\/\/www.usabilityscorecard.com\/\">usability scorecard<\/a> application.<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>There is no usability thermometer to tell you how easy to use a website or software application is. Instead we rely on the outcomes of good and bad experiences which provide evidence for the construct of usability. Combining multiple usability metrics into a single usability metric (SUM) is something we proposed seven years ago[PDF] and [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":11244,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"site-sidebar-layout":"default","site-content-layout":"default","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"default","adv-header-id-meta":"","stick-header-meta":"default","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"_price":"field_56e41332a1ae5","_stock":"","_tribe_ticket_header":"","_tribe_default_ticket_provider":"Tribe__Tickets_Plus__Commerce__WooCommerce__Main","_tribe_ticket_capacity":"0","_ticket_start_date":"","_ticket_end_date":"","_tribe_ticket_show_description":"","_tribe_ticket_show_not_going":false,"_tribe_ticket_use_global_stock":"","_tribe_ticket_global_stock_level":"","_global_stock_mode":"","_global_stock_cap":"","_tribe_rsvp_for_event":"","_tribe_ticket_going_count":"","_tribe_ticket_not_going_count":"","_tribe_tickets_list":"[]","_tribe_ticket_has_attendee_info_fields":false,"footnotes":""},"categories":[88],"tags":[89],"acf":[],"ticketed":false,"_links":{"self":[{"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/posts\/118"}],"collection":[{"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/comments?post=118"}],"version-history":[{"count":1,"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/posts\/118\/revisions"}],"predecessor-version":[{"id":31998,"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/posts\/118\/revisions\/31998"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/media\/11244"}],"wp:attachment":[{"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/media?parent=118"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/categories?post=118"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/measuringu.com\/wp-json\/wp\/v2\/tags?post=118"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}