INDEX
Explanations
names of individuals, particularly those involved in professional or public roles
New Auto-Interp
Negative Logits
amer
-0.17
çŃĨ
-0.15
ib
-0.14
=cut
-0.14
udy
-0.14
hon
-0.13
imir
-0.13
Trib
-0.13
nostr
-0.13
966
-0.13
POSITIVE LOGITS
кид
0.17
èle
0.15
kea
0.15
kest
0.15
ungal
0.15
cuador
0.15
ãĥ¼ãĥĵ
0.15
ambre
0.14
вад
0.14
ιθ
0.14
Activations Density 0.713%