INDEX
Explanations
proper nouns related to politics and individuals associated with them
mentions of specific individuals or entities within a context of accountability or newsworthiness
New Auto-Interp
Negative Logits
uckland
-0.83
ingen
-0.78
atively
-0.69
osion
-0.69
ayers
-0.68
falls
-0.68
oppy
-0.66
aying
-0.65
uring
-0.65
olation
-0.64
POSITIVE LOGITS
illac
0.90
rov
0.84
jet
0.75
SHIP
0.75
arov
0.73
Stanton
0.72
WARE
0.72
mbol
0.72
meric
0.72
GOODMAN
0.70
Activations Density 0.012%