INDEX
Explanations
references to governmental or official actions and positions
New Auto-Interp
Negative Logits
ipo
-0.14
okus
-0.14
ç
-0.14
ifest
-0.13
sis
-0.13
eleri
-0.13
itud
-0.13
ds
-0.13
ahoma
-0.13
bero
-0.13
POSITIVE LOGITS
move
0.33
moves
0.24
move
0.23
spokesman
0.22
statement
0.22
spokeswoman
0.21
Move
0.20
spokesperson
0.20
-move
0.20
exact
0.20
Activations Density 0.164%