INDEX
Explanations
variations of the word "cough."
New Auto-Interp
Negative Logits
eva
-0.17
cth
-0.16
ynth
-0.15
aque
-0.14
suma
-0.14
chaft
-0.14
zw
-0.14
kaar
-0.14
zeÅĪ
-0.14
peÄį
-0.14
POSITIVE LOGITS
erty
0.30
lin
0.27
ough
0.22
borough
0.21
orne
0.20
lan
0.20
nut
0.20
ought
0.19
erb
0.17
nuts
0.17
Activations Density 0.018%