INDEX
Explanations
biographical information about individuals, particularly actresses and public figures
New Auto-Interp
Negative Logits
293
-0.17
Frank
-0.16
ens
-0.15
834
-0.14
ilo
-0.14
екÑĤоÑĢ
-0.14
essen
-0.14
unt
-0.14
uis
-0.13
arResult
-0.13
POSITIVE LOGITS
ÑģеÑĢ
0.14
uÄŁ
0.14
ANJI
0.14
ycastle
0.14
$MESS
0.14
yster
0.14
leston
0.14
OKIE
0.14
Ïĩν
0.13
sol
0.13
Activations Density 0.012%