INDEX
Explanations
news events and incidents involving violence or conflict
New Auto-Interp
Negative Logits
respons
-0.75
thereafter
-0.70
xxxx
-0.70
thereof
-0.67
thereto
-0.63
antage
-0.62
nown
-0.62
olicy
-0.61
_.
-0.61
代
-0.60
POSITIVE LOGITS
zbollah
0.89
oÄŁan
0.79
Expand
0.75
Vegan
0.71
Wiki
0.70
][
0.69
Description
0.68
reetings
0.67
ONDON
0.66
resa
0.65
Activations Density 0.270%