INDEX
Explanations
action verbs in past tense with '-ed' suffix
negative statements or assessments related to social issues
New Auto-Interp
Negative Logits
Mississ
-0.73
Thrones
-0.73
Canaver
-0.73
Metall
-0.71
Stones
-0.70
Schwar
-0.69
Rolls
-0.69
Isle
-0.69
Skywalker
-0.68
Corps
-0.67
POSITIVE LOGITS
terday
1.26
theless
1.24
anwhile
1.10
etheless
1.09
maxwell
0.89
guiActiveUn
0.80
bsite
0.80
mosp
0.78
ividual
0.77
ickr
0.77
Activations Density 0.206%