INDEX
Explanations
terms related to negative judgment or condemnation
instances of the word "foul" in various contexts
New Auto-Interp
Negative Logits
_>
-0.74
edia
-0.73
DCS
-0.71
Downloadha
-0.71
zl
-0.70
å§«
-0.69
Airl
-0.68
itan
-0.68
etsk
-0.68
iku
-0.67
POSITIVE LOGITS
foul
0.84
cery
0.81
smelling
0.76
terness
0.70
poisons
0.66
stains
0.65
misc
0.64
mire
0.64
fully
0.64
mson
0.64
Activations Density 0.006%