INDEX
Explanations
proper nouns, particularly related to vilification and character assassination
words related to vilification or negative labeling of individuals
New Auto-Interp
Negative Logits
soDeliveryDate
-0.76
cropped
-0.74
ĸļ
-0.71
unct
-0.70
riad
-0.67
uyomi
-0.66
Brill
-0.65
hr
-0.64
ItemTracker
-0.63
advertisement
-0.63
POSITIVE LOGITS
Vil
1.39
eness
0.94
vil
0.86
uv
0.80
icious
0.80
estone
0.78
estones
0.77
apo
0.77
vil
0.77
atan
0.76
Activations Density 0.014%