INDEX
Explanations
exclamations and announcements in text
exclamatory statements or emphatic expressions
New Auto-Interp
Negative Logits
mistrust
-0.89
dehuman
-0.84
distrust
-0.84
outwe
-0.80
contrasted
-0.80
discour
-0.80
outweigh
-0.79
bias
-0.78
contrasts
-0.78
oneself
-0.78
POSITIVE LOGITS
Atl
1.27
Yesterday
1.25
Sony
1.23
Congratulations
1.20
TOR
1.18
Following
1.14
Rum
1.13
UPDATE
1.13
Today
1.11
Starting
1.09
Activations Density 0.373%