INDEX
Explanations
instances of the letter "o" in various contexts
New Auto-Interp
Negative Logits
rt
-0.21
b
-0.20
h
-0.20
rist
-0.19
pel
-0.19
bs
-0.18
p
-0.18
ris
-0.18
pv
-0.18
hle
-0.17
POSITIVE LOGITS
lymp
0.27
'clock
0.26
vens
0.25
phthalm
0.24
missions
0.23
regon
0.21
aths
0.21
curring
0.20
vals
0.20
tor
0.20
Activations Density 0.012%