INDEX
Explanations
references to a specific country or nation
mentions of the concept of "nation"
New Auto-Interp
Negative Logits
urations
-0.77
omething
-0.77
IFT
-0.72
ension
-0.72
ection
-0.66
Adds
-0.66
TERN
-0.66
Accessory
-0.66
Ware
-0.64
ctory
-0.64
POSITIVE LOGITS
wide
1.07
States
0.76
matically
0.74
legislature
0.74
sovere
0.73
Grid
0.73
ographically
0.70
anthem
0.67
skyline
0.67
mable
0.66
Activations Density 0.020%