INDEX
Explanations
sentences instructing to read more on a particular topic
references to reading-related actions or discussions
New Auto-Interp
Negative Logits
oult
-0.75
TEXTURE
-0.70
ascal
-0.68
cffffcc
-0.66
heel
-0.66
USSR
-0.63
aviour
-0.62
pload
-0.62
VP
-0.61
WWF
-0.61
POSITIVE LOGITS
Write
0.96
aloud
0.90
Read
0.86
just
0.81
ahead
0.81
gon
0.79
iances
0.79
dress
0.78
Continued
0.76
Read
0.75
Activations Density 0.019%