INDEX
Explanations
terms related to geographical locations and political events
New Auto-Interp
Negative Logits
plet
-0.92
rentice
-0.88
ertation
-0.86
psey
-0.83
otom
-0.82
erva
-0.81
olicy
-0.81
lot
-0.81
odder
-0.80
otion
-0.78
POSITIVE LOGITS
ledged
0.90
л
0.85
ties
0.81
cut
0.79
abbrevi
0.78
itarian
0.77
Known
0.75
abouts
0.72
cuts
0.71
iary
0.71
Activations Density 9.497%