INDEX
Explanations
comparisons of public figures in terms of their motivations and contributions
New Auto-Interp
Negative Logits
ldr
-0.16
ÑĢоÑĪ
-0.16
tam
-0.15
mpr
-0.15
/popper
-0.14
è²´
-0.14
tery
-0.14
eam
-0.14
LOB
-0.14
ournée
-0.14
POSITIVE LOGITS
Trump
0.20
Trump
0.18
Golf
0.17
-Trump
0.17
.enumer
0.17
unning
0.17
Chow
0.16
Gol
0.16
golf
0.16
NY
0.15
Activations Density 0.080%