INDEX
Explanations
words related to criticism or disapproval
variations of the prefix "unc," indicating negation or absence
New Auto-Interp
Negative Logits
å§«
-0.89
uyomi
-0.87
Ò
-0.85
tery
-0.80
geist
-0.70
=-=-=-=-
-0.69
=-=-=-=-=-=-=-=-
-0.69
Fo
-0.68
Dynamics
-0.67
MENT
-0.66
POSITIVE LOGITS
redited
1.17
ooked
1.16
ursed
1.16
reated
1.08
aleb
1.05
umbered
1.04
ount
1.03
ivil
1.00
onduct
0.97
irc
0.97
Activations Density 0.010%