INDEX
Explanations
words related to specific names
names or identifiers, particularly those related to people and locations
New Auto-Interp
Negative Logits
backer
-0.72
kefeller
-0.69
govtrack
-0.67
ruary
-0.65
Xi
-0.64
<+
-0.63
isance
-0.62
pack
-0.62
ascript
-0.62
issance
-0.60
POSITIVE LOGITS
oshi
0.86
azi
0.82
anga
0.79
chuk
0.76
ugu
0.74
iko
0.73
hya
0.71
abi
0.70
obiles
0.70
awk
0.70
Activations Density 0.155%