INDEX
Explanations
words related to national matters or entities
references to national significance or context
New Auto-Interp
Negative Logits
xual
-0.91
omething
-0.86
YE
-0.81
MODE
-0.79
herty
-0.76
enhagen
-0.75
hops
-0.75
chnology
-0.73
rett
-0.73
eva
-0.73
POSITIVE LOGITS
ized
0.94
ities
0.90
anthem
0.89
Geographic
0.89
wide
0.87
ised
0.84
ITY
0.82
security
0.81
izations
0.81
ization
0.80
Activations Density 0.038%