INDEX
Explanations
mentions of a specific country or collective entity
occurrences of the word "nation" in various contexts
New Auto-Interp
Negative Logits
omething
-0.82
urations
-0.71
hift
-0.71
IFT
-0.69
wrapper
-0.68
ension
-0.67
err
-0.66
pread
-0.66
Vaj
-0.65
Adds
-0.65
POSITIVE LOGITS
wide
1.07
States
0.75
sovere
0.75
matically
0.74
ographically
0.73
legislature
0.72
agog
0.71
ically
0.71
urally
0.70
ographer
0.70
Activations Density 0.028%