INDEX
Explanations
words related to various actions or concepts in discussions or debates
terms associated with controversy and political maneuvers
New Auto-Interp
Negative Logits
*.
-0.72
Conduct
-0.62
Supported
-0.61
Medline
-0.61
Due
-0.60
":["
-0.60
Sind
-0.60
.''.
-0.60
consecut
-0.59
Prior
-0.59
POSITIVE LOGITS
ocracy
1.10
ocratic
1.05
iest
1.03
ariat
0.89
horse
0.88
leader
0.86
less
0.84
ometer
0.82
book
0.81
packed
0.81
Activations Density 0.545%