INDEX
Explanations
words related to locations or names
words related to elements of identity or nationality
New Auto-Interp
Negative Logits
OWS
-0.70
Ashes
-0.67
Subject
-0.65
drawn
-0.65
Normandy
-0.64
ILCS
-0.64
space
-0.64
AAP
-0.63
rences
-0.61
mark
-0.61
POSITIVE LOGITS
zzi
1.33
vernment
1.26
zzo
1.08
oco
1.03
zzle
0.96
onga
0.92
zo
0.90
cephal
0.89
opsis
0.89
berto
0.88
Activations Density 0.055%