INDEX
Explanations
references to the concept of craziness or insanity
New Auto-Interp
Negative Logits
Hallmark
-0.44
upol
-0.38
WPS
-0.36
hlm
-0.35
thenReturn
-0.35
PSS
-0.34
AED
-0.34
PCC
-0.34
PPD
-0.33
behalf
-0.33
POSITIVE LOGITS
crazy
1.80
Crazy
1.79
Crazy
1.77
crazy
1.75
craz
1.16
insane
1.03
Insane
0.93
insane
0.92
Insane
0.91
locura
0.90
Activations Density 0.003%