INDEX
Explanations
phrases indicating types of actions, events, or conditions
New Auto-Interp
Negative Logits
aceae
-0.71
âĶľ
-0.68
agree
-0.66
âĶľâĶĢâĶĢ
-0.64
lords
-0.63
ngth
-0.63
livious
-0.63
somet
-0.62
grounds
-0.61
nown
-0.61
POSITIVE LOGITS
moratorium
1.19
boycott
1.13
halt
1.06
truce
0.91
referendum
0.87
barrage
0.85
roundup
0.83
dismissal
0.83
demonstration
0.83
crackdown
0.82
Activations Density 0.078%