INDEX
Explanations
proper nouns related to political figures
New Auto-Interp
Negative Logits
exempt
-0.83
road
-0.78
nir
-0.78
rl
-0.77
agra
-0.76
ahime
-0.76
ebook
-0.74
yk
-0.74
nexus
-0.74
rf
-0.74
POSITIVE LOGITS
Rodham
1.07
Trump
1.07
Reagan
1.03
Wallace
0.99
Trudeau
0.99
Clinton
0.98
Johnson
0.98
Gates
0.98
Kennedy
0.97
Obama
0.95
Activations Density 1.202%