INDEX
Explanations
names of political figures
names of individuals and organizations
New Auto-Interp
Negative Logits
ultimate
-0.88
words
-0.79
dies
-0.78
alore
-0.73
itars
-0.73
romeda
-0.73
sci
-0.72
teenth
-0.72
mates
-0.71
istar
-0.70
POSITIVE LOGITS
Baldwin
0.90
Reed
0.88
Bennett
0.87
Gib
0.85
Stewart
0.85
Gomez
0.85
Howard
0.85
Kir
0.84
Kurt
0.84
Lopez
0.83
Activations Density 0.238%