INDEX
Explanations
names or partial names related to individuals or places
proper nouns, particularly names
New Auto-Interp
Negative Logits
BILITIES
-0.82
entimes
-0.71
FML
-0.71
leneck
-0.68
rency
-0.66
farious
-0.65
decay
-0.64
cled
-0.64
bably
-0.62
mentation
-0.62
POSITIVE LOGITS
ovych
1.11
oi
0.78
Hir
0.74
aya
0.72
Ara
0.72
ji
0.71
Hatt
0.70
wo
0.69
chev
0.69
Yug
0.68
Activations Density 0.255%