INDEX
Explanations
specific names associated with prominent public figures and their actions
New Auto-Interp
Negative Logits
letico
-0.70
pregn
-0.69
erala
-0.68
Pixie
-0.68
loads
-0.66
alach
-0.64
population
-0.64
ATHER
-0.63
plays
-0.63
dolphin
-0.63
POSITIVE LOGITS
feld
1.09
hetti
0.93
heimer
0.91
owitz
0.88
stein
0.86
bard
0.86
bach
0.84
baum
0.84
wald
0.84
berg
0.83
Activations Density 0.003%