INDEX
Explanations
various forms of the word "crazy" or its variations
New Auto-Interp
Negative Logits
ÅĻÃŃm
-0.18
aidu
-0.17
озем
-0.16
gard
-0.15
idon
-0.15
isoft
-0.15
leine
-0.15
grim
-0.15
Https
-0.15
ello
-0.14
POSITIVE LOGITS
ze
0.40
igslist
0.30
fter
0.27
zi
0.26
ZY
0.26
cra
0.23
ZE
0.23
iyon
0.23
azy
0.23
Ze
0.22
Activations Density 0.009%