INDEX
Explanations
countries and proper nouns
references to countries or geographical locations
New Auto-Interp
Negative Logits
xual
-0.63
tremend
-0.60
citiz
-0.58
Lauder
-0.57
prayer
-0.52
ĸļ
-0.50
osexual
-0.47
worldly
-0.47
urga
-0.47
dden
-0.46
POSITIVE LOGITS
thur
0.65
iasco
0.63
SpaceEngineers
0.58
ombat
0.55
emp
0.55
à¦
0.55
Äį
0.53
ourgeois
0.52
ucket
0.52
ĵĺ
0.50
Activations Density 1.073%