INDEX
Explanations
years, locations, and names related to politics and elections
New Auto-Interp
Negative Logits
inates
-0.92
ctica
-0.86
ional
-0.85
uate
-0.82
inations
-0.80
orie
-0.78
ion
-0.76
rary
-0.75
ogy
-0.74
uations
-0.73
POSITIVE LOGITS
nesday
1.17
edge
0.90
fare
0.83
tip
0.83
nut
0.82
esome
0.77
ington
0.77
combe
0.76
nuts
0.76
yer
0.74
Activations Density 1.433%