INDEX
Explanations
verbs related to actions or behaviors
references to serious crimes or allegations
New Auto-Interp
Negative Logits
unison
-0.78
selves
-0.65
respectively
-0.63
angular
-0.59
VERTISEMENT
-0.57
were
-0.56
Ware
-0.56
constituted
-0.55
respective
-0.55
merce
-0.54
POSITIVE LOGITS
himself
1.51
Himself
1.15
his
1.11
herself
1.07
HIS
0.91
his
0.85
His
0.83
His
0.78
reelection
0.68
tweeting
0.66
Activations Density 1.514%