INDEX
Explanations
words related to legal or formal terms
words related to judgment or evaluation
New Auto-Interp
Negative Logits
hess
-0.76
Grayson
-0.72
Cheong
-0.70
ingham
-0.67
hyde
-0.67
Highlands
-0.65
Sutherland
-0.64
enhagen
-0.62
Shepherd
-0.62
Trace
-0.61
POSITIVE LOGITS
propos
0.64
pestic
0.64
predec
0.64
comprom
0.62
manent
0.61
bureaucr
0.61
abor
0.60
embassies
0.59
taxp
0.59
anc
0.59
Activations Density 0.535%