INDEX
Explanations
proper nouns, particularly names of individuals
New Auto-Interp
Negative Logits
ecies
-0.17
гов
-0.17
.MixedReality
-0.16
ÑĢам
-0.16
terdam
-0.15
porto
-0.15
edis
-0.15
onas
-0.15
urovision
-0.15
зи
-0.15
POSITIVE LOGITS
iac
0.16
dew
0.16
TT
0.15
901
0.15
ans
0.15
igh
0.14
or
0.14
адÑĥ
0.14
Chap
0.14
CG
0.14
Activations Density 0.026%