INDEX
Explanations
proper nouns or phrases related to news agencies or organizations
instances of the abbreviation "PA" typically related to press releases or news articles
New Auto-Interp
Negative Logits
lings
-0.85
tie
-0.84
worms
-0.82
naire
-0.75
worldly
-0.71
selves
-0.67
minded
-0.66
seeing
-0.66
measles
-0.65
worm
-0.65
POSITIVE LOGITS
UL
1.11
WN
1.08
INT
0.94
BILITY
0.89
ignt
0.89
IRED
0.88
BLE
0.87
GE
0.86
USE
0.85
veyard
0.81
Activations Density 0.024%