INDEX
Explanations
words related to titles or names ending in "anc"
proper nouns or specific names
New Auto-Interp
Negative Logits
ãĥīãĥ©
-0.80
ĪĴ
-0.79
ãĢIJ
-0.68
åĤ
-0.68
nings
-0.66
knife
-0.66
Gors
-0.66
Fraz
-0.64
IMAGES
-0.63
nces
-0.63
POSITIVE LOGITS
orp
1.04
ourt
1.01
uten
0.92
ulty
0.92
ipation
0.89
eteria
0.88
isco
0.88
ultural
0.88
risis
0.84
henko
0.83
Activations Density 0.018%