INDEX
Explanations
mentions of specific organizations or entities related to government and international affairs
New Auto-Interp
Negative Logits
rencies
-0.84
ities
-0.80
paces
-0.79
ernels
-0.79
riages
-0.78
aters
-0.77
ularity
-0.77
ages
-0.76
Romans
-0.76
ads
-0.76
POSITIVE LOGITS
staffer
1.21
spokeswoman
1.20
spokesman
1.15
spokesperson
1.15
colleague
1.13
employee
1.04
representative
0.97
broch
0.96
official
0.95
aide
0.94
Activations Density 0.222%