INDEX
Explanations
phrases indicating reading time or duration
New Auto-Interp
Negative Logits
Pou
-0.15
ili
-0.14
Polic
-0.14
ally
-0.14
sbin
-0.13
216
-0.13
imat
-0.13
Sist
-0.13
Colon
-0.13
vr
-0.13
POSITIVE LOGITS
read
0.22
读
0.20
reading
0.20
è®Ģ
0.19
reads
0.18
-read
0.18
Reading
0.17
reads
0.17
reo
0.16
Reading
0.16
Activations Density 0.018%