23  Reliability of ROAR-Fonema

23.1 Background: Published studies

Bhat et al. (2024) reported marginal reliability of 0.85 for ROAR-Fonema (see more detail in Section 21.1).

23.2 Reliability of fixed-length ROAR-Fonema

Table 23.1 reports marginal reliability computed based on data from 4128 students under the IRT model for 2 different versions of ROAR-Fonema: the short, 20 item version and the long 54 item version. The computer-adaptive ROAR-Fonema is planned for release in fall 2024 and will be more efficient and reliable with fewer items. Reliability (\(\rho_{xx^\prime}\)) is computed based on the estimated variance of \(\hat{\theta}\) relative to the estimated standard error (\(\widehat{SE}(\hat{\theta})^2\)) using Equation 23.1:

\[ \hat{\rho}_{xx^\prime} = \frac{\widehat{VAR}(\hat{\theta})}{\widehat{VAR}(\hat{\theta}) + \widehat{SE}(\hat{\theta})^2}, \tag{23.1}\]

Version Grade Empirical Reliability N
All All 0.82 4128
long 1 0.52 513
long 2 0.72 578
long 3 0.78 561
long 4 0.81 519
long 5 0.87 577
long 6 0.88 592
long K 0.35 389
short 1 0.6 62
short 2 0.74 131
short K 0.51 206
Table 23.1: Reliability of ROAR-Fonema by Grade

To ensure that ROAR-Fonema is fair and equitable for different demographic groups, we also report reliability by gender (Table 23.2), eligibility for free and reduced price lunch (Table 23.3), English learner status based on state of California designations (Table 23.4), primary langauge spoken (Table 23.5), special education (Table 23.6), ethnicity (Table 23.7), and race (Table 23.8).

Version Gender Empirical Reliability N
All All 0.81 3517
long F 0.83 1484
long M 0.82 1640
short F 0.67 178
short M 0.66 215
Table 23.2: Reliability of ROAR-Fonema by Gender
Free/Reduced Lunch Status Empirical Reliability N
All 0.66 393
Free 0.59 191
Paid 0.71 138
Reduced 0.71 64
Table 23.3: Reliability of ROAR-Fonema by FRL (California Sub-sample Only)
English Learner Status Empirical Reliability N
All 0.66 393
English Learner 0.58 257
English Only 0.73 99
Initial Fluent English Proficient NULL 28
Reclassified Fluent English Proficient NULL 9
Table 23.4: Reliability of ROAR-Fonema by EL Status (California Sub-sample Only)
Primary Language Empirical Reliability N
All 0.67 360
English 0.72 156
Spanish 0.59 204
Table 23.5: Reliability of ROAR-Fonema by Primary Language (California Sub-sample Only)
Special Education Status Empirical Reliability N
All 0.66 393
No 0.67 357
Yes 0.48 36
Table 23.6: Reliability of ROAR-Fonema by Special Education Status (California Sub-sample Only)
Hispanic Ethnicity Empirical Reliability N
All 0.63 374
No NULL 28
Yes 0.63 346
Table 23.7: Reliability of ROAR-Fonema by Hispanic Ethnicity (California Sub-sample Only)
Race Empirical Reliability N
All 0.63 374
Hispanic 0.63 346
White NULL 26
Table 23.8: Reliability of ROAR-Fonema by Race (California Sub-sample Only)

References

Bhat, Kruttika G., Alexa Mogan, Ana Saavedra, Mia Fuentes-Jimenez, Julian M. Siebert, Wanjing Anya Ma, Carrie Townley-Flores, et al. 2024. “Shared and Unique Influences of Phonological Processing on Reading and Math.” OSF Preprints. https://doi.org/10.31219/osf.io/em3bg.