INDEX
Explanations
names of people and potentially organizations
proper nouns and names
New Auto-Interp
Negative Logits
Carnage
-0.67
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.66
lesbians
-0.65
Masquerade
-0.63
Somali
-0.63
Sigma
-0.62
[&
-0.62
cknowled
-0.62
Skydragon
-0.62
initialized
-0.62
POSITIVE LOGITS
odore
0.75
dinand
0.75
romeda
0.69
alin
0.69
ison
0.68
entin
0.67
berman
0.67
rick
0.67
undy
0.65
espie
0.65
Activations Density 0.217%