INDEX
Explanations
proper nouns related to individuals and their attributes
New Auto-Interp
Negative Logits
ButtonItem
-0.20
ayi
-0.19
jspb
-0.16
ington
-0.16
ewan
-0.16
neau
-0.14
bane
-0.14
uke
-0.14
zier
-0.14
inq
-0.13
POSITIVE LOGITS
of
0.31
cá»§a
0.24
of
0.22
mili
0.17
ustil
0.16
Nicholson
0.15
hometown
0.15
uner
0.15
á»§a
0.15
icky
0.14
Activations Density 0.054%