INDEX
Explanations
keywords related to news articles and political events
occurrences of the abbreviation "PA" related to news agency references
New Auto-Interp
Negative Logits
lings
-0.89
tie
-0.85
naire
-0.75
worms
-0.72
worldly
-0.71
nets
-0.69
line
-0.68
icity
-0.68
nets
-0.67
lers
-0.66
POSITIVE LOGITS
UL
1.06
WN
1.06
INT
0.98
BLE
0.94
GE
0.90
BILITY
0.89
IRED
0.88
ignt
0.84
veyard
0.82
UNCH
0.81
Activations Density 0.020%