INDEX
Explanations
references to incompetence and immaturity in individuals
New Auto-Interp
Negative Logits
à¸Ńà¸Ķ
-0.16
raž
-0.15
Severity
-0.14
severity
-0.14
');?>
-0.14
erland
-0.14
.oracle
-0.14
Tyto
-0.13
');?>↵
-0.13
æ¹
-0.13
POSITIVE LOGITS
mor
0.36
retard
0.31
jerk
0.30
douche
0.30
MOR
0.28
asshole
0.28
dip
0.28
assh
0.27
idi
0.27
dumb
0.27
Activations Density 0.355%