INDEX
Explanations
proper nouns related to personal names
references to titles or ranks associated with individuals or entities
New Auto-Interp
Negative Logits
xon
-0.85
Hack
-0.76
hya
-0.76
berman
-0.72
mble
-0.72
hn
-0.68
gel
-0.64
ADRA
-0.63
ĸļ
-0.61
plet
-0.61
POSITIVE LOGITS
awks
1.07
awk
1.06
ttp
1.05
orse
0.84
mares
0.84
ouses
0.82
ighth
0.81
azer
0.78
unction
0.77
olly
0.77
Activations Density 0.019%