INDEX
Explanations
mentions of political conflicts and international events related to war and refugee crises
New Auto-Interp
Negative Logits
''.
-0.53
".
-0.50
thood
-0.50
$.
-0.49
".
-0.46
boil
-0.45
EStreamFrame
-0.45
.''.
-0.45
'.
-0.44
bluff
-0.44
POSITIVE LOGITS
meanwhile
0.57
countered
0.56
wrote
0.54
reacted
0.53
commented
0.53
Fr
0.53
echoed
0.52
WARN
0.51
Vital
0.50
Ori
0.49
Activations Density 0.733%