INDEX
Explanations
occurrences of the word "the" and its variants within the text
New Auto-Interp
Negative Logits
insky
-0.15
alled
-0.15
reading
-0.14
Campos
-0.14
icipant
-0.13
est
-0.13
269
-0.13
147
-0.13
ronic
-0.13
268
-0.13
POSITIVE LOGITS
ufen
0.17
ocratic
0.16
ologically
0.16
opsy
0.15
ToF
0.15
sembles
0.14
tük
0.14
ocs
0.14
VERTISEMENT
0.14
ovna
0.14
Activations Density 0.180%