Talk presented at the 1st Workshop on Replication in the Language Sciences, Frankfurt, 2025
1. Universidad Nebrija; 2. University of Valencia
Presentation available at: https://bangele.quarto.pub/worela2025. Scan QR code for link.
General issue of WEIRD research (Henrich et al., 2010): Western participants may not be representative of all readers or even the majority of readers
English, German, French, Spanish, Italian etc. are similar languages in many aspects and all share the same writing system
Studying Chinese reading has forced us to think about new issues such as word segmentation, processing of character components and many more.
32 participants read 400 sentences in Spanish
Eye movements are recorded by an SR Research Eyelink Portable Duo
Four sampling rates 250 Hz, 500 Hz, 1000 Hz, and 2000 Hz (100 sentences each)
Frequency manipulation: each sentence has a target word that was manipulated to be either
high frequency (mean frequency 47/million)
low frequency (mean frequency 2/million)
The context up to the target word was identical for both versions of the sentence.
Crop out end of trial
With Eyelink detected fixations
For each participant and sampling rate, we extract the fixations detected by the Eyelink saccade detection algorithm and aggregate them into word-based fixation time measures for the target word
First fixation duration (FFD)
Gaze duration (GD)
In order to evaluate the strength of evidence for the frequency effect, we then fitted Bayesian linear and generalized linear mixed models using the brms package (Bürkner, 2017).
Fixed effect: frequency condition (coded as present = -.5
; absent = .5
)
Random effects: all possible (intercepts and frequency condition by participant and item)
As a rule of thumb, we considered an effect credible if more than 95% of the distribution are on one side of 0
250 and 500 Hz | ||||
---|---|---|---|---|
First fixation duration
|
Gaze duration
|
|||
Mean | SD | Mean | SD | |
250 Hz | ||||
high frequency | 242 | 85 | 332 | 169 |
low frequency | 245 | 85 | 373 | 209 |
Effect | 3 | 40 | ||
500 Hz | ||||
high frequency | 238 | 84 | 329 | 182 |
low frequency | 246 | 92 | 366 | 229 |
Effect | 8 | 36 |
1000 and 2000 Hz | ||||
---|---|---|---|---|
First fixation duration
|
Gaze duration
|
|||
Mean | SD | Mean | SD | |
1000 Hz | ||||
high frequency | 234 | 83 | 319 | 171 |
low frequency | 244 | 85 | 365 | 222 |
Effect | 10 | 45 | ||
2000 Hz | ||||
high frequency | 230 | 80 | 313 | 165 |
low frequency | 236 | 85 | 344 | 197 |
Effect | 7 | 31 |
400 trials per subject
400 trials per subject
The frequency effect is very robust, especially in GD and TVT
The effect is more sensitive to sampling rate in FFD
This is encouraging for researchers who cannot afford a 1000 Hz eye-tracker!
Simulating low sampling rates from data collected using a very accurate eye-tracker is not the same as actually using an affordable eye tracker.
But we have shown that sampling rate is not a hard bottleneck for studying reading.
You can compensate for low sampling rates by increasing sample size
If you are not sure about whether your eye-tracker is good enough to study reading, maybe just do a pilot study looking for the frequency effect in GD
Now in press at Behavior Research Methods!
Presentation available at: https://bangele.quarto.pub/using-affordable-eye-tracking-methods-to-study-reading-the-role-of-sampling-rate or scan QR code
With fixations according to the Engbert & Kliegl (2003) algorithm
With Eyelink and Engbert & Kliegl (2004) fixations plotted on top of each other
100 trials/subject
100 trials/subject
400 trials/subject
400 trials/subject
Simulated low sampling rates | ||||
---|---|---|---|---|
First fixation duration
|
Gaze duration
|
|||
Mean | SD | Mean | SD | |
31.25 Hz | ||||
high frequency | 214 | 101 | 269 | 143 |
low frequency | 222 | 107 | 307 | 189 |
Effect | 8 | 39 | ||
50 Hz | ||||
high frequency | 221 | 88 | 291 | 151 |
low frequency | 228 | 92 | 333 | 200 |
Effect | 7 | 42 | ||
125 Hz | ||||
high frequency | 233 | 81 | 314 | 158 |
low frequency | 237 | 83 | 356 | 213 |
Effect | 4 | 42 |
100 trials per subject
100 trials per subject
Error in UseMethod("group_by"): no applicable method for 'group_by' applied to an object of class "function"
Error: object 'filtered_data' not found
Error in `left_join()`:
! Join columns in `x` must be present in the data.
✖ Problem with `sentence_nr`, `word_nr`, and `cond`.
Simulated low sampling rates | ||||
---|---|---|---|---|
First fixation duration
|
Gaze duration
|
|||
Mean | SD | Mean | SD | |
31.25 Hz | ||||
high frequency | 214 | 99 | 274 | 154 |
low frequency | 225 | 108 | 313 | 193 |
Effect | 12 | 39 | ||
50 Hz | ||||
high frequency | 223 | 89 | 299 | 162 |
low frequency | 233 | 96 | 337 | 205 |
Effect | 9 | 38 | ||
125 Hz | ||||
high frequency | 234 | 84 | 322 | 170 |
low frequency | 241 | 88 | 363 | 219 |
Effect | 7 | 41 |