23 Reliability of ROAR-Fonema

23.1 Background: Published studies

Bhat et al. (2024) reported marginal reliability of 0.85 for ROAR-Fonema (see more detail in Section 21.1).

23.2 Reliability of fixed-length ROAR-Fonema

Table 23.1 reports marginal reliability computed based on data from 4128 students under the IRT model for 2 different versions of ROAR-Fonema: the short, 20 item version and the long 54 item version. The computer-adaptive ROAR-Fonema is planned for release in fall 2024 and will be more efficient and reliable with fewer items. Reliability (\(\rho_{xx^\prime}\)) is computed based on the estimated variance of \(\hat{\theta}\) relative to the estimated standard error (\(\widehat{SE}(\hat{\theta})^2\)) using Equation 23.1:

\[ \hat{\rho}_{xx^\prime} = \frac{\widehat{VAR}(\hat{\theta})}{\widehat{VAR}(\hat{\theta}) + \widehat{SE}(\hat{\theta})^2}, \tag{23.1}\]

Version	Grade	Empirical Reliability	N
All	All	0.82	4128
long	1	0.52	513
long	2	0.72	578
long	3	0.78	561
long	4	0.81	519
long	5	0.87	577
long	6	0.88	592
long	K	0.35	389
short	1	0.6	62
short	2	0.74	131
short	K	0.51	206

Table 23.1: Reliability of ROAR-Fonema by Grade

To ensure that ROAR-Fonema is fair and equitable for different demographic groups, we also report reliability by gender (Table 23.2), eligibility for free and reduced price lunch (Table 23.3), English learner status based on state of California designations (Table 23.4), primary langauge spoken (Table 23.5), special education (Table 23.6), ethnicity (Table 23.7), and race (Table 23.8).

Version	Gender	Empirical Reliability	N
All	All	0.81	3517
long	F	0.83	1484
long	M	0.82	1640
short	F	0.67	178
short	M	0.66	215

Table 23.2: Reliability of ROAR-Fonema by Gender

Free/Reduced Lunch Status	Empirical Reliability	N
All	0.66	393
Free	0.59	191
Paid	0.71	138
Reduced	0.71	64

Table 23.3: Reliability of ROAR-Fonema by FRL (California Sub-sample Only)

English Learner Status	Empirical Reliability	N
All	0.66	393
English Learner	0.58	257
English Only	0.73	99
Initial Fluent English Proficient	NULL	28
Reclassified Fluent English Proficient	NULL	9

Table 23.4: Reliability of ROAR-Fonema by EL Status (California Sub-sample Only)

Primary Language	Empirical Reliability	N
All	0.67	360
English	0.72	156
Spanish	0.59	204

Table 23.5: Reliability of ROAR-Fonema by Primary Language (California Sub-sample Only)

Special Education Status	Empirical Reliability	N
All	0.66	393
No	0.67	357
Yes	0.48	36

Table 23.6: Reliability of ROAR-Fonema by Special Education Status (California Sub-sample Only)

Hispanic Ethnicity	Empirical Reliability	N
All	0.63	374
No	NULL	28
Yes	0.63	346

Table 23.7: Reliability of ROAR-Fonema by Hispanic Ethnicity (California Sub-sample Only)

Race	Empirical Reliability	N
All	0.63	374
Hispanic	0.63	346
White	NULL	26

Table 23.8: Reliability of ROAR-Fonema by Race (California Sub-sample Only)

References

Bhat, Kruttika G., Alexa Mogan, Ana Saavedra, Mia Fuentes-Jimenez, Julian M. Siebert, Wanjing Anya Ma, Carrie Townley-Flores, et al. 2024. “Shared and Unique Influences of Phonological Processing on Reading and Math.” OSF Preprints. https://doi.org/10.31219/osf.io/em3bg.