INDEX
Explanations
the occurrence of the letter 'f' in the text
New Auto-Interp
Negative Logits
oster
-0.16
unger
-0.14
Cunning
-0.14
arb
-0.14
alse
-0.14
(«
-0.14
sidel
-0.14
/sidebar
-0.13
ILLE
-0.13
Stephens
-0.13
POSITIVE LOGITS
982
0.16
STALL
0.15
Shaft
0.15
ouro
0.15
ladder
0.14
nis
0.13
CRET
0.13
rag
0.13
longleftrightarrow
0.13
etic
0.13
Activations Density 0.018%