Randomization

What a Randomization Test Is and How to Run One in R

The two-sample t-test is one of the most widely used statistical tests, assessing whether mean differences between two samples are statistically significant. It can be used to compare two samples of many UX metrics, such as SUS scores, SEQ scores, and task times. The t-test, like most statistical tests, has certain requirements (assumptions) for its

Read More »
Change Verbs

From Soared to Plummeted: Can We Quantify Change Verbs?

Cases spike, home prices surge, and stock prices tank: we read headlines like these daily. But what is a spike and how much is a surge? When does something crater versus tank or just fall? Headlines are meant to grab our attention. They often communicate the dramatic story the author wants to tell rather than

Read More »
Rating Scales

Rating Scale Best Practices: 8 Topics Examined

Rating scales have been around for close to a century. It’s no wonder there are many questions about best practices and pitfalls to avoid. And like any topic that’s been around for that long, there are urban legends, partial truths, context-dependent findings, and just plain misconceptions about the “right” and “wrong” way to use and

Read More »
Sliders

Are Sliders More Sensitive than Numeric Rating Scales?

Sliders are a type of visual analog scale that can be used with many online survey tools such as our MUIQ platform. The literature on their overall effectiveness is mixed (Roster et al., 2015). On the positive side, evidence indicates that sliders might be more engaging to respondents. On the negative side, evidence also indicates

Read More »

Improving the Prediction of the Number of Usability Problems

Paraphrasing the statistician George Box, all models are wrong, some are useful, and some can be improved. In a recent article, we reviewed the most common way of modeling problem discovery, which is based on a straightforward application of the cumulative binomial probability formula: P(x≥1) = 1 – (1-p)n. Well, it’s straightforward if you like

Read More »

The UX of Vacation Rental Websites

The COVID-19 pandemic has led to significant changes in how people have vacationed in 2020. To get away from it all without spending time in crowded places, vacationers have turned to vacation rental websites and have planned longer stays. For example, Airbnb recently reported a year-to-year doubling of long-term (>28 days) rentals and a shift

Read More »

What Do You Gain from Larger-Sample Usability Tests?

We typically recommend small sample sizes (5–10) for conducting iterative usability testing meant to find and fix problems (formative evaluations). For benchmark or comparative studies, where the focus is on detecting differences or estimating population parameters (summative evaluations), we recommend using larger sample sizes (20–100+). Usability testing can be used to uncover problems and assess the

Read More »

Sample Size Recommendations for Benchmark Studies

One of the primary goals of measuring the user experience is to see whether design efforts actually make a quantifiable difference over time. A regular benchmark study is a great way to institutionalize the idea of quantifiable differences. Benchmarks are most effective when done at regular intervals (e.g., quarterly or yearly) or after significant design

Read More »

Sample Size in Usability Studies: How Well Does the Math Match Reality?

We’ve written extensively about how to determine the right sample size for UX studies. There isn’t one sample size that will work for all studies. The optimal sample size is based on the type of study, which can be classified into three groups: Comparison studies: Comparing metrics for statistical differences Standalone studies: Estimates a population

Read More »
0
    0
    Your Cart
    Your cart is emptyReturn to Shop
    Scroll to Top