INDEX
Explanations
words related to legal or political actions and decisions
action verbs that indicate support or endorsement
New Auto-Interp
Negative Logits
been
-0.80
yond
-0.72
é¾įå¥ij士
-0.71
coin
-0.70
isable
-0.69
wcsstore
-0.67
yet
-0.65
borne
-0.65
pic
-0.64
():
-0.63
POSITIVE LOGITS
hement
0.77
unanimously
0.74
unsuccessfully
0.74
him
0.67
nervously
0.66
enthusiastically
0.66
valiant
0.66
furiously
0.64
Wem
0.63
tremend
0.63
Activations Density 0.525%