INDEX
Explanations
mentions of specific locations and entities in news articles
references to specific nationalities or groups related to significant events
New Auto-Interp
Negative Logits
_.
-0.61
nonetheless
-0.55
accordingly
-0.53
however
-0.50
descriptive
-0.48
*.
-0.48
herer
-0.48
foremost
-0.48
moreover
-0.47
+.
-0.47
POSITIVE LOGITS
Belfast
0.65
zbollah
0.65
ocaust
0.60
chwitz
0.57
Rohingya
0.55
tyres
0.53
debt
0.48
illegally
0.48
DoS
0.48
rubbish
0.48
Activations Density 1.163%