INDEX
Explanations
references to "reading" and associated activities or concepts related to literacy
New Auto-Interp
Negative Logits
sten
-0.17
ck
-0.16
ats
-0.16
ped
-0.15
ated
-0.15
oc
-0.15
iv
-0.15
FactoryBot
-0.15
iling
-0.14
udad
-0.14
POSITIVE LOGITS
/view
0.26
/list
0.25
just
0.24
/watch
0.22
comprehension
0.22
mitted
0.20
/w
0.19
iness
0.19
/write
0.18
aloud
0.18
Activations Density 0.061%