INDEX
Explanations
proper nouns, particularly names of people and places
New Auto-Interp
Negative Logits
Gallimard
-0.69
Cæsar
-0.66
:]:
-0.65
hvem
-0.65
revanche
-0.65
appé
-0.64
andaag
-0.64
zaś
-0.62
tersebut
-0.62
epam
-0.62
POSITIVE LOGITS
تضيفلها
0.93
McN
0.77
CopyWith
0.69
Familienname
0.64
leh
0.63
Савезне
0.62
szóci
0.61
PreferredItem
0.60
SharedDtor
0.59
oul
0.58
Activations Density 0.837%