INDEX
Explanations
proper nouns related to various locations and organizations
geographical names and locations
New Auto-Interp
Negative Logits
Americ
-0.65
omsky
-0.64
aci
-0.64
aer
-0.61
ente
-0.60
estyles
-0.60
adobe
-0.59
uers
-0.59
geoning
-0.59
sych
-0.59
POSITIVE LOGITS
outper
0.72
counterpart
0.72
Stadium
0.71
's
0.68
lacks
0.64
itself
0.62
fans
0.62
substitution
0.60
monop
0.60
ians
0.59
Activations Density 0.383%