INDEX
Explanations
proper nouns, particularly names of individuals and entities
New Auto-Interp
Negative Logits
Äĩe
-0.15
uforia
-0.15
laÄį
-0.14
egov
-0.14
емаÑĤи
-0.14
ailles
-0.14
olls
-0.14
msp
-0.13
endale
-0.13
roker
-0.13
POSITIVE LOGITS
islav
0.25
oslav
0.24
fried
0.23
bert
0.18
fred
0.18
ÅĻich
0.17
éric
0.17
odore
0.17
ko
0.17
ildo
0.16
Activations Density 0.324%