INDEX
Explanations
locations and events related to news headlines
references to controversial social and political events or entities
New Auto-Interp
Negative Logits
Emer
-0.70
à¨
-0.66
":"/
-0.66
odder
-0.63
keyes
-0.63
actionGroup
-0.62
erenn
-0.60
Ô
-0.60
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.59
Defin
-0.59
POSITIVE LOGITS
clashed
0.81
alleging
0.75
amid
0.72
sparked
0.69
violates
0.69
blamed
0.67
slammed
0.66
criticised
0.65
leaked
0.65
Belfast
0.64
Activations Density 0.438%