INDEX
Explanations
proper nouns related to geographical locations
proper nouns, particularly names of places and individuals
New Auto-Interp
Negative Logits
Plex
-0.76
well
-0.66
rhy
-0.63
Trend
-0.63
Pastebin
-0.61
ĸļ
-0.60
bondage
-0.60
Veronica
-0.60
yond
-0.59
nce
-0.59
POSITIVE LOGITS
etz
0.89
orts
0.89
nikov
0.88
ocalyptic
0.84
olitan
0.82
etts
0.81
ics
0.79
etric
0.77
athy
0.77
eln
0.77
Activations Density 0.045%