INDEX
Explanations
instances of contempt-related language or actions
instances of the word "contempt" and its variations in the context of legal or social issues
New Auto-Interp
Negative Logits
eor
-0.82
mith
-0.66
ramid
-0.65
akeru
-0.62
jc
-0.62
laus
-0.60
ded
-0.59
sund
-0.59
Lans
-0.58
arist
-0.57
POSITIVE LOGITS
uously
1.45
uous
1.44
contempt
1.04
naire
0.98
ible
0.94
uality
0.86
fully
0.85
ibly
0.82
antly
0.81
isons
0.79
Activations Density 0.010%