INDEX
Explanations
names of people and their activities or contributions
New Auto-Interp
Negative Logits
alars
-0.17
iland
-0.16
oland
-0.15
iž
-0.15
ontrol
-0.15
alah
-0.15
Warnings
-0.15
ovo
-0.15
adro
-0.15
rico
-0.15
POSITIVE LOGITS
ainen
0.15
สà¸ģ
0.15
vala
0.15
cz
0.14
.jd
0.14
ì±Ħ
0.14
alsy
0.13
icz
0.13
III
0.13
Carp
0.13
Activations Density 0.329%