INDEX
Explanations
phrases indicating discussions, negotiations, or agreements involving multiple parties
references to opposing groups or factions
New Auto-Interp
Negative Logits
pta
-0.69
DERR
-0.68
ķ
-0.65
nce
-0.64
Rapids
-0.59
ogle
-0.58
arton
-0.57
Prospect
-0.57
stad
-0.57
Tray
-0.56
POSITIVE LOGITS
alike
1.07
sides
0.99
mutually
0.98
sexes
0.98
simultaneously
0.97
vying
0.97
equally
0.96
parties
0.86
'
0.84
agree
0.83
Activations Density 0.157%