INDEX
Explanations
mentions of locations or events
sentences and phrases that indicate reports or statements related to incidents or events
New Auto-Interp
Negative Logits
exha
-0.83
undermin
-0.78
councill
-0.74
£ı
-0.74
bryce
-0.73
mbuds
-0.73
byss
-0.68
ataka
-0.68
hitherto
-0.64
surplus
-0.64
POSITIVE LOGITS
JUST
1.46
CNN
1.22
Photos
1.14
CNN
0.96
Replay
0.94
Media
0.80
BBC
0.75
IMAGES
0.75
NPR
0.75
"...
0.74
Activations Density 0.291%