You can’t beat a user test
Here at iQ Content we do lots of ‘traditional’ user testing. This involves recording the on-screen actions of a user as they interact with a website. During each test, we ask the users to do what is called a ‘concurrent think aloud’. The user tells us what they are doing and thinking during the user test. This helps us identify usability issues on a site. We see great value in this, in what we consider to be a tried and tested approach. It gives us insights into user behaviour that we may not get if the user did not perform a think aloud.
Eye Tracking, another kind of user test
Last week I attended EyeTrackUX 2009 in Frankfurt. I was at some great (and some not so great) seminars, all related to the Eye Tracking method. In a nutshell, eye tracking allows analysts to track and record a users’ eye gaze whilst they interact with a website. This enables whoever is doing the research to see what the user looked at, what attracted their attention the most, and what elements on a page they completely missed. One of the outputs of a user test done using eye tracking is called a gaze replay. This is basically a replay of what the user looked at during the test. The video below is an example of this:
Is silence really golden?
At the end of the second day, I was lucky enough to speak with Rob Stevens from Bunnyfoot, and Guy Redwood from SimpleUsability. Both firmly agree, that when we are testing a website using Eye Tracking, the user should remain silent during the initial test. As Rob and Guy described, this is natural browsing behaviour, but by getting them to think aloud during the test, we are interfering with this natural behaviour.
Rob says “Generally, we don’t want users to think during their interaction with our websites, but that is exactly what we are forcing them to do when we get them to give concurrent protocols… and as Steve Krug so famously once said, “Dont make me think’’. With concurrent protocols, we may not be getting the right information from the user, and more critically, may be missing some some vital information all together.
Tell me later: Retrospective protocols
Once the eye tracking test is finished, a gaze replay is shown to the user. It is at this point that they get the user to give a ‘retrospective protocol’’, requiring the user to narrate their experience, as they see it being played back. The gaze replay reminds the user exactly what they did, what they looked at, and what they thought, at every point throughout the test. Guy Redwood said, “You get an insight into just how amazing the human brain is after watching a retrospective protocol… users correct themselves, and can instantly recall very precisely what happened”.
Having investigated Eye Tracking as a potential method for iQ, and having talked to many people at EyeTrackUX 09, I think it holds some merit over traditional user testing. Here are the reasons why;
- It eliminates the cognitive strain we place on the users when we force them to think aloud during a user test. This allows us to observe more natural browsing and interaction behaviours.
- It facilitates a more natural setting for the user test, where quite often, the analyst isn’t even required to be in the same room as the user. This reduces (although doesn’t eliminate) the likelihood of the Hawthorne effect taking place.
- It provides irrefutable proof about what our users looked at during the session which helps to reduce any suspicion that we, as analysts, are giving our subjective opinion about why or what a user did or said…. ‘here is the proof’!
With these in mind, I look forward to posting some insights and findings from our upcoming eye tracking studies here at iQ.
What do you think? Is it time to move on completely from the traditional testing approach? Or can we still get just as much insight from traditional user testing?
May 5, 2009 at 7:47 pm
We’ve done some research on this, inspired by the early experiences of Rob Stevens and colleagues. The results are encouraging, see http://www.jemr.org/online/2/4/5
May 6, 2009 at 9:24 am
Hi Kario-Jourko,
Thanks for the link, looks interesting. I’ll be sure to check it out.
May 6, 2009 at 10:42 am
Hi Kari-Jouko, great paper thanks for bringing to our attention. We have been developing the PEEP methodology over the years and have started to employ it with samples sizes of 30. One benefit is that we can harvest valid eyetracking metrics and visualisation. There is a good overview of it here: http://bunnyfoot.com/blog/2009.....r-testing/
May 6, 2009 at 11:33 am
Hey Ruari,
As a web analytics person trying to learn more about usability I have more general questions about this methodology.
How do you get a statistically significant sample size (try saying that quickly)? If you get tens of thousands of visitors a month how do you conduct enough eye tracks to make sure you are covering a large enough % of your user base.
How do you get participants that are likely to be part of your sites target audience? Presumably you don’t want to have all grannies doing your eye tracks if you sell skateboards.
May 6, 2009 at 12:00 pm
Hi Brian,
With traditional user testing, we find that testing with 5 – 12 participants usually yields the vast majority of usability issues on a site.
More participants are required for eye tracking studies in order to produce valid outputs (i.e., the heat map). It is a wide open debate on exactly how many users suffice. Most researchers would say that anywhere between 20 – 60 users is enough.
To identify the correct user group of a site, we find the most accurate method is persona development. This starts with a persona workshop (involving all the stakeholders of the site) earlier in the project. The output of this is a number of fictitious characters (with names, ages, goals, & behaviours) that we can then begin designing for. When recruiting participants for user testing, we try to match the profile of the users we recruit, to the personas.
I hope this answers your questions, Brian. Please don’t hesitate to contact me if you have any more questions on the matter.
May 6, 2009 at 2:09 pm
Ruairi,
It’s good that you’re inviting a discussion about this because it’s a controversial topic – some people love eye tracking, others hate it and consider it a form of research snake-oil.
Personally, I stand somewhere in the middle. It’s never a good idea to invest too heavily in a single research method.
May 6, 2009 at 2:16 pm
Hi Harry,
Thanks for the comment. Your right, debate rages on over this topic.
May 6, 2009 at 2:51 pm
Ruairi,
Interesting discussion you have going here. One concern I have is your bullet three to “why” – “It provides irrefutable proof about what our users looked at during the session…”
What can be said irrefutably is what the users didn’t look at. When making points about where users have looked when using the eye tracking tools you have to include the caveat that you may not know what they have attended to or what they have processed. A gaze on a particular area or item does not translate into a known cognitive processing of that information.
Don’t get me wrong, I work with eye tracking and strongly believe that all of the eye tracking tools are supplements to better understanding the users process. Our camp would argue that heat maps and gaze plots add the “sexy sprinkles” but the gaze replay is the best tool for re-watching your sessions to extrapolate web user behaviors.
May 6, 2009 at 3:02 pm
Hi Elizabeth,
It was nice meeting you at the conference, & thanks for the comment.
Good point about bullet 3 and thanks for your input. It is interesting that the gaze replay can suggest that a user looked at something but they actually didn’t ‘attend’ to it. Do you think this affects the ‘honesty’ with which they give their retrospective protocol? i.e., if they see their gaze on a certain area of the page, but don’t remember attending to it, they might make up something to match the replay?
May 6, 2009 at 3:26 pm
Unfortunately, I haven’t had the opportunity to do retrospectives. I have only done “think aloud” and post-session recorded debriefs where I ask the participant to take me back through and recall what they did. As you can imagine, it can be a bit limiting but this way we have clean data from the original non-verbal testing.
I am interested in trying the new Studio 1.5 tool to do a retrospective during my next project. I will refrain from speculating and will give you real feedback once I have concluded those sessions.
May 6, 2009 at 3:28 pm
Hi Ruiri,
What kind of equipment was showcased in Frankfurt? I see that the old clunky and intrusive headgear is generally being replaced with specialized monitors. Is there any newer technologies and is there any trends to make the price of the equipment less inhibitive?
May 6, 2009 at 3:31 pm
Thanks Elizabeth,
In look forward to it.
May 6, 2009 at 3:36 pm
Hi Darragh,
The equipment showcased in Frankfurt was the T120, and the the T60 XL (on a 24″ Monitor). Your right, the head mounted stuff is on the way out. There are cheaper alternatives to Tobii out there, but none seem as good as Tobii. The new software (Studio 1.5.2) was also showcased. Unfortunately, the price for the hardware and software remains as high as ever.
May 7, 2009 at 8:58 am
Cheers Ruairi, can i have a lend of yours when you get one. I’ll drop it back within the week, promise.
May 7, 2009 at 2:13 pm
No Problem Darragh..
Any time