INDEX
Explanations
the last names of political figures
proper nouns related to political figures and organizations
New Auto-Interp
Negative Logits
cannabin
-0.67
atern
-0.66
nect
-0.64
NAS
-0.63
alog
-0.62
olicy
-0.62
asonic
-0.60
awarding
-0.59
angible
-0.59
administering
-0.58
POSITIVE LOGITS
tsky
0.90
ttes
0.77
geoning
0.75
ez
0.75
yre
0.74
vez
0.73
gence
0.73
dit
0.72
illance
0.72
tsy
0.72
Activations Density 0.054%