INDEX
Explanations
proper names of individuals
New Auto-Interp
Negative Logits
ailles
-0.17
uforia
-0.16
Äĩe
-0.15
egov
-0.15
olls
-0.14
esian
-0.14
endale
-0.14
å®ħ
-0.14
roker
-0.14
laÄį
-0.14
POSITIVE LOGITS
islav
0.26
oslav
0.23
fried
0.20
éric
0.20
ÅĻich
0.20
fred
0.18
imir
0.17
bert
0.17
loyd
0.15
ildo
0.15
Activations Density 0.296%